Meta is seeking a Network Operations Engineer to join our Edge and Network Services (ENS) Foundation NetOps team. In this role, you will utilize your knowledge in hyperscale data center structured cabling, fiber connectivity solutions, and IP networking to support the delivery of field repair services. You will work closely with internal business teams, external vendors, network design/engineering, and other cross-functional teams to develop strategies for integrating new technologies and better supporting existing technologies across our operational fleet. Your objective will be to understand technical drivers for the business and organize strategies to ensure operational supportability meets those needs. Additionally, you will help represent ENS across various stakeholder groups to help achieve critical projects. Our team is dedicated to improving the operational efficiency and reliability of one of the world's most dynamic and fast-paced networks. Demonstrated track record of working in a fast-moving organization and identifying root causes in complex systems and quickly learning new domain expertise and technologies to implement process enhancements and automation solutions that address operational pain points.
Locations
New Albany, OH, US
Salary
Salary not disclosed
Skills Required
IP networkingintermediate
structured cablingintermediate
fiber connectivity solutionsintermediate
optical/network communication equipmentadvanced
Cisco and Juniper routers/switchesintermediate
Nexus data center switchingintermediate
physical data center designintermediate
troubleshooting and root cause analysisadvanced
automation solutionsintermediate
Required Qualifications
Bachelor's degree in Computer Science, Computer Engineering, or relevant technical field (degree)
7+ years of experience operating, deploying, and designing large-scale network, optical, and/or physical layer infrastructure (experience)
Hands-on experience with various optical/network communication equipment such as network/fiber/copper test gear, and other WDM equipment (experience)
Experience analyzing tactical situations, troubleshooting, root causing systems and tools (experience)
Familiarity with Enterprise and Service Provider network hardware platforms and architectures (experience)
Knowledge of physical data center design: rack elevations, cabling, fiber optics, power/cooling, and facility infrastructure (experience)
Direct experience within global Network Operations Center (NOC) environments (experience)
Responsibilities
Collaborate with cross-functional teams, managed service providers, and third-party vendor partners to investigate complex technical and process issues during major incidents/site events (SEVs) on edge, caching, and network infrastructure. Work closely with team members to understand the root cause of incidents and contribute to their resolution
Identify security & business continuity issues affecting the network and infrastructure at Meta and work with the team to implement effective changes. Provide feedback on existing tools, processes, and policies to help scale with the rapid expansion of the Meta platform and customer base
Drive operations by identifying improvement opportunities across policies, processes, and procedures to improve efficiency and quality of activities. Ensure standards compliance across the network and optimize service delivery
Work closely with internal customers to address their needs and issues, and influence future iterations of data center and network designs for seamless integration of new infrastructure and scalability
Work with cross-functional teams within and outside the organization to deliver business outcomes predictability across global sites. Analyze operational datasets to detect and prioritize problems and create aligned team projects, programs, and roadmaps
Collaborate with partner teams to design and implement aligned processes that identify and manage data and asset protection risks, as well as operations continuity issues across the network
Ensure relevant operational process, procedure, and policy documentation is effectively managed, and the data required to support operations is complete and accurate in systems
Collaborate with the team to analyze operational events to identify new automation opportunities and achieve our vision of all Tier-1 faults in the network being fully remediated by software. Help others understand our requirements and drive their roadmaps, and in some cases, directly implement lightweight solutions in code
Drive quality into the metrics we report, measuring and analyzing escalation issues, fault/event trends, infrastructure capacity, and vendor performance failures. Formulate appropriate metrics and definitions of success to drive quality, efficiency, cost, and timeliness, evolving these over time to match changes to the infrastructure and business requirements
Provide clear and effective communication around personal and team goals, progress, outcomes, and lessons learned across assigned scope
International and domestic travel may be required up to 25% of the time, depending on the needs of the business
Benefits
bonus: Bonus included in compensation
equity: Equity included in compensation
health: Benefits package offered; learn more about benefits at Meta