Meta is seeking a Technical Program Manager (TPM) with experience in server and rack systems design, development, and deployment at scale. This position will work with cross-functional teams in Meta’s Infrastructure organization to drive product definition, proof of concept generation, design, component selection, integration, development, validation, and end-to-end adoption of new hardware products. This position would focus on hardware system end-to-end development needed to support emerging high powered AI/ML servers, and would require engagement with external vendors as well as a range of internal engineering and specialized teams. You would focus on creating strategies and executing plans to support the development and deployment of current and new hardware platforms. These platforms are the foundation of our AI Training and Inference systems, and are a key enabler to supporting the company’s push into AI. This role would work with external and internal partners to influence and define roadmaps based on technical and business considerations, influence adoption strategies including internal customer and stakeholder alignment, and drive integration with Meta’s capacity planning tools and systems across the program space. This role would work with Infrastructure Hardware development, Infrastructure software, Capacity Planning, Data Center, Network Infrastructure and Infrastructure sourcing teams. Meta’s Infrastructure Engineering organization is responsible for the growth, management and 24x7 upkeep of all Meta’s products and services.
Locations
Menlo Park, CA, USA
Salary
167,000 - 230,000 USD / yearly
Skills Required
Hardware Systems Designadvanced (Technical)
AI/ML Server Developmentintermediate (Technical)
Program Managementadvanced (Management)
Vendor Managementintermediate (Management)
Required Qualifications
B.S. in Computer Science or a related technical discipline (degree in Computer Science)
10+ years of experience in software engineering, systems engineering, hardware engineering, or technical product/program management experience (experience, 10 years)
Experience delivering tech programs or products from inception to delivery (experience)
Knowledge of user needs, gathering requirements, and defining scope (experience)
Experience operating under your own initiative across multiple teams, demonstrated critical thinking, and thought leadership (experience)
Communication experience and experience working with technical management teams to develop systems, solutions, and products (experience)
Organizational, coordination and multi-tasking experience (experience)
Analytical and problem-solving experience with large-scale systems (experience)
Experience establishing work relationships across multidisciplinary teams and multiple partners in different time zones (experience)
Preferred Qualifications
Understanding of Hardware system architecture and related technologies (experience)
Knowledge of compute, storage and/or AI/ML server development (experience)
Hardware Systems Design experience and new product introduction lifecycle management (experience)
Web or Internet start-up environment and technical infrastructure management experience (experience)
Experience with data center deployment (experience)
Experience in large scale AI cluster build out (experience)
Experience working with Original Design Manufacturers (ODM)’s and other vendors (experience)
Responsibilities
Lead technical program management of next-generation hardware platform(s) for Meta Infrastructure in a matrix organization covering a range of areas (Data Center, Network, Hardware Systems, Infrastructure Engineering, Software Engineering, Capacity Management) and across multiple physical locations
Own overall program success, spanning the end-to-end development of the hardware product. spanning internal and external development work through successful ingestion into Meta’s infrastructure and support of production workloads at scale
Develop and manage programs including defining scope, requirements, development model, schedules, and deliverables with engineering teams, partners, and stakeholders
Influence broader roadmaps through product interception and market fit, competitive analysis, and feasibility studies
Provide hands-on program management during analysis, design, development, testing, implementation, and post implementation phases
Partner with Engineering counterparts across a range of specialties as well as other teams to define product roadmaps
Drive overall communication to leadership, stakeholders and core working teams in regular cadence
Drive internal process improvements across multiple teams and functions
Analyze infrastructure needs and produce hardware designs and prototypes to meet those needs
Manage and drive strategic vendor engagement and deliveries