As a Principal Software Engineer on the Azure Kubernetes Service (AKS) team, you will help build world-class container management and orchestration services for the cloud and beyond. This team develops container solutions for Microsoft Azure across both Linux and Windows platforms, enabling scalable and resilient infrastructure for a wide range of workloads. You will play a key role in defining the next generation of cloud-native infrastructure on Microsoft Azure. This position is ideal for engineers who are interested in containers and container orchestration using Kubernetes, and who are eager to build DevOps tools that simplify the development, deployment, and scaling of software in the cloud. The team is globally distributed across multiple countries and continents, and supports flexible work environments, including remote work. This role involves deep technical work focused on enhancing Kubernetes to support artificial intelligence and machine learning workloads. You will develop infrastructure controllers, onboard new Kubernetes features, integrate with other Microsoft Azure services, and build foundational components to automate and accelerate training and inference workflows. This is a unique opportunity to make a significant impact on Azure Kubernetes Service customers, especially as the artificial intelligence landscape continues to evolve rapidly. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Locations
United States (Remote)
Salary
139,900 - 274,800 USD / yearly
Required Qualifications
Bachelor's Degree in Computer Science or related technical field (degree in Computer Science)
technical engineering experience with coding in languages including, but not limited to, C, C++, golang, or Python (experience, 6 years)
designing, building, shipping, and operating reliable distributed systems (experience, 5 years)
hands-on experience with scalable infrastructure and fault-tolerant architectures (experience, 3 years)
delivering production-ready solutions in cloud environments (experience, 3 years)
Preferred Qualifications
Bachelor's Degree in Computer Science or related technical field (degree in Computer Science)
technical engineering experience with coding in languages including, but not limited to, C, C++, golang, or Python (experience, 10 years)
Master's Degree in Computer Science or related technical field (degree in Computer Science)
technical engineering experience with coding in languages including, but not limited to,C, C++, golang, or Python (experience, 8 years)
working with artificial intelligence and machine learning workloads, including: Integration of training and inference pipelines into distributed systems. Optimization of compute resources for performance and reliability. (experience, 1 years)
experience with container technologies and orchestration platforms, including: Use of containers such as Docker. Deployment and management of workloads using Kubernetes. (experience, 1 years)
Responsibilities
Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.
Drive the features from idea to production.
Work with emerging technologies, from software to hardware. Leading the design the implementations that advance Azure Kubernetes Service for artificial intelligence (Al) workloads.
Fully leverage AI in product development.
Maintains communication and make clarities with partners across the Microsoft ecosystem of engineers. Contribute to partner’s success.