Resume and JobRESUME AND JOB
MongoDB logo

Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!

MongoDB

Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!

full-timePosted: Feb 6, 2026

Job Description

Role Overview

Join MongoDB as a Lead Engineer, Inference Platform and shape the future of AI-native developer experiences on the world's most popular modern developer data platform. This hands-on technical leadership role sits at the intersection of MongoDB Atlas Vector Search, cutting-edge embedding models from our Voyage.ai acquisition, and high-scale inference infrastructure. You'll build the real-time, low-latency systems powering semantic search, hybrid retrieval, and RAG pipelines for thousands of global customers.

Based in Palo Alto, California or Seattle, Washington for our hybrid model, you'll lead key projects optimizing GPU utilization, autoscaling, and observability in a multi-tenant, cloud-native environment deeply integrated with MongoDB Atlas across AWS, Google Cloud, and Microsoft Azure.

Key Responsibilities at MongoDB

  • Partner closely with Search Platform and Voyage.ai AI engineers to productionize state-of-the-art embedding models and rerankers for both batch and real-time inference
  • Lead critical projects focused on performance optimization, GPU utilization, autoscaling, and comprehensive observability for the inference platform
  • Design and implement components of our multi-tenant inference service that powers Atlas Vector Search capabilities for semantic search and hybrid retrieval
  • Build essential platform features like model versioning, safe deployment pipelines, latency-aware routing, and model health monitoring
  • Collaborate with ML, infrastructure, and product teams to establish architectural patterns ensuring high availability and low latency at scale
  • Guide technical decisions on model serving architecture leveraging tools like vLLM, ONNX Runtime, and Kubernetes container orchestration
  • Provide hands-on technical leadership and mentorship to engineers across experience levels
  • Foster a culture of technical excellence, autonomy, and continuous improvement within the team

Qualifications & Requirements

To succeed as our Lead Engineer, Inference Platform, you'll bring:

  • 8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development with proven technical leadership
  • Deep expertise serving embedding models in production environments at global scale
  • Strong systems programming in Go, Rust, C++, or Python with experience profiling and optimizing performance
  • Proven experience building cloud-native distributed systems emphasizing latency, availability, and observability
  • Familiarity with inference runtimes (vLLM, ONNX Runtime) and vector search systems (Faiss, HNSW, ScaNN)
  • 1+ years experience serving as Technical Lead for large-scale ML inference or training platform projects
  • Track record of cross-discipline collaboration from ML researchers to junior engineers in multi-tenant SaaS environments

Nice to haves: Experience with hybrid retrieval, RAG, open-source ML serving contributions, or managing ML infrastructure teams.

Salary & Benefits

Lead Engineer, Inference Platform roles at MongoDB offer competitive Total Compensation packages including base salary, equity, and comprehensive benefits. Expected range: $220,000 - $350,000 USD (Palo Alto/Seattle) depending on experience.

  • Competitive base salary + significant equity ownership in high-growth public company
  • Comprehensive wellness programs including mental health support
  • MongoDB University for certifications and skill development
  • Global Family Leave with generous parental policies
  • 401(k) matching and ESPP programs
  • Unlimited PTO and flexible hybrid schedules
  • Annual learning stipend for conferences and courses
  • Premium health insurance (medical, dental, vision)

Why Join MongoDB?

Be part of redefining databases for the AI era. MongoDB powers innovators building AI-native applications on our globally distributed, multi-cloud platform. Work with Voyage.ai ML experts to bring cutting-edge research into production, solving real-time inference challenges at unprecedented scale.

  • Shape AI-native developer experiences on the #1 developer data platform
  • Solve hard problems in real-time inference and semantic retrieval for global customers
  • Thrive in a culture valuing mentorship, autonomy, and technical craft
  • Join 4,000+ talented engineers modernizing legacy workloads and unleashing AI

How to Apply

Ready to lead MongoDB's inference platform into the AI future? Apply now if you're in Palo Alto or Seattle and have production embedding model expertise. Our team reviews applications continuously—don't miss this chance to build scalable AI infrastructure at the world's leading NoSQL database platform.

Keywords: MongoDB Atlas, Vector Search, ML Inference, Kubernetes, Go, Rust, Semantic Search, RAG, Voyage.ai, GPU Optimization

Locations

  • Palo Alto, California, United States
  • Seattle, Washington, United States

Salary

Estimated Salary Rangehigh confidence

231,000 - 385,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • MongoDB Atlasintermediate
  • Vector Searchintermediate
  • Embedding Modelsintermediate
  • ML Inferenceintermediate
  • Kubernetesintermediate
  • Go Programmingintermediate
  • Rust Developmentintermediate
  • Distributed Systemsintermediate
  • GPU Optimizationintermediate
  • NoSQL Databasesintermediate
  • Semantic Searchintermediate
  • RAG Pipelinesintermediate
  • vLLM Servingintermediate
  • ONNX Runtimeintermediate
  • Cloud Native Architectureintermediate

Required Qualifications

  • 8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development with technical leadership (experience)
  • Expertise in serving embedding models in production environments at scale (experience)
  • Strong systems programming skills in Go, Rust, C++, or Python with performance profiling and optimization (experience)
  • Experience building cloud-native distributed systems focused on latency, availability, and observability (experience)
  • Familiarity with inference runtimes (vLLM, ONNX Runtime) and vector search systems (Faiss, HNSW, ScaNN) (experience)
  • 1+ years serving as Technical Lead for large-scale ML inference or training platform projects (experience)
  • Proven collaboration across ML researchers, engineers, and product teams in multi-tenant SaaS environments (experience)

Responsibilities

  • Partner with Search Platform and Voyage.ai teams to productionize state-of-the-art embedding models and rerankers
  • Lead performance optimization projects for GPU utilization, autoscaling, and observability
  • Design multi-tenant inference services integrated with MongoDB Atlas Vector Search
  • Build platform features including model versioning, safe deployment pipelines, and latency-aware routing
  • Collaborate across ML, infrastructure, and product teams to define scalable architectural patterns
  • Guide model serving architecture decisions using vLLM, ONNX Runtime, and Kubernetes orchestration
  • Provide technical leadership and mentorship to junior engineers on the inference platform team
  • Ensure high availability, low latency inference at global scale for MongoDB Atlas customers
  • Drive semantic search and hybrid retrieval capabilities powering AI-native MongoDB features

Benefits

  • general: Competitive compensation with significant equity ownership
  • general: Comprehensive wellness programs including mental health support
  • general: MongoDB University for continuous learning and certification
  • general: Global Family Leave with generous parental leave policies
  • general: Hybrid work model in premium tech hubs (Palo Alto, Seattle)
  • general: 401(k) matching and employee stock purchase plans
  • general: Unlimited PTO and flexible work arrangements
  • general: Annual learning stipend for conferences and professional development
  • general: Comprehensive health, dental, and vision insurance
  • general: Volunteer time off and community impact programs

Target Your Resume for "Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!" , MongoDB

Get personalized recommendations to optimize your resume specifically for Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!" , MongoDB

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

MongoDBAIMachine LearningInferenceVector SearchKubernetesGoRustPalo AltoSeattleTechnical LeadershipML InfrastructureLead Engineer Inference Platform MongoDBMongoDB Atlas Vector Search jobsML inference engineer Palo AltoEmbedding models production engineerKubernetes ML serving Seattle jobsVoyage.ai MongoDB engineering careersGPU optimization engineer MongoDBSemantic search platform leadRAG pipeline infrastructure jobsvLLM ONNX Runtime engineerMulti-tenant ML inference leadMongoDB AI platform engineeringHybrid retrieval systems engineerTechnical lead ML infrastructureNoSQL database AI engineerCloud native inference platformFaiss HNSW vector search jobsPTO Atlas Search

Answer 10 quick questions to check your fit for Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now! @ MongoDB.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

MongoDB logo

Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!

MongoDB

Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!

full-timePosted: Feb 6, 2026

Job Description

Role Overview

Join MongoDB as a Lead Engineer, Inference Platform and shape the future of AI-native developer experiences on the world's most popular modern developer data platform. This hands-on technical leadership role sits at the intersection of MongoDB Atlas Vector Search, cutting-edge embedding models from our Voyage.ai acquisition, and high-scale inference infrastructure. You'll build the real-time, low-latency systems powering semantic search, hybrid retrieval, and RAG pipelines for thousands of global customers.

Based in Palo Alto, California or Seattle, Washington for our hybrid model, you'll lead key projects optimizing GPU utilization, autoscaling, and observability in a multi-tenant, cloud-native environment deeply integrated with MongoDB Atlas across AWS, Google Cloud, and Microsoft Azure.

Key Responsibilities at MongoDB

  • Partner closely with Search Platform and Voyage.ai AI engineers to productionize state-of-the-art embedding models and rerankers for both batch and real-time inference
  • Lead critical projects focused on performance optimization, GPU utilization, autoscaling, and comprehensive observability for the inference platform
  • Design and implement components of our multi-tenant inference service that powers Atlas Vector Search capabilities for semantic search and hybrid retrieval
  • Build essential platform features like model versioning, safe deployment pipelines, latency-aware routing, and model health monitoring
  • Collaborate with ML, infrastructure, and product teams to establish architectural patterns ensuring high availability and low latency at scale
  • Guide technical decisions on model serving architecture leveraging tools like vLLM, ONNX Runtime, and Kubernetes container orchestration
  • Provide hands-on technical leadership and mentorship to engineers across experience levels
  • Foster a culture of technical excellence, autonomy, and continuous improvement within the team

Qualifications & Requirements

To succeed as our Lead Engineer, Inference Platform, you'll bring:

  • 8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development with proven technical leadership
  • Deep expertise serving embedding models in production environments at global scale
  • Strong systems programming in Go, Rust, C++, or Python with experience profiling and optimizing performance
  • Proven experience building cloud-native distributed systems emphasizing latency, availability, and observability
  • Familiarity with inference runtimes (vLLM, ONNX Runtime) and vector search systems (Faiss, HNSW, ScaNN)
  • 1+ years experience serving as Technical Lead for large-scale ML inference or training platform projects
  • Track record of cross-discipline collaboration from ML researchers to junior engineers in multi-tenant SaaS environments

Nice to haves: Experience with hybrid retrieval, RAG, open-source ML serving contributions, or managing ML infrastructure teams.

Salary & Benefits

Lead Engineer, Inference Platform roles at MongoDB offer competitive Total Compensation packages including base salary, equity, and comprehensive benefits. Expected range: $220,000 - $350,000 USD (Palo Alto/Seattle) depending on experience.

  • Competitive base salary + significant equity ownership in high-growth public company
  • Comprehensive wellness programs including mental health support
  • MongoDB University for certifications and skill development
  • Global Family Leave with generous parental policies
  • 401(k) matching and ESPP programs
  • Unlimited PTO and flexible hybrid schedules
  • Annual learning stipend for conferences and courses
  • Premium health insurance (medical, dental, vision)

Why Join MongoDB?

Be part of redefining databases for the AI era. MongoDB powers innovators building AI-native applications on our globally distributed, multi-cloud platform. Work with Voyage.ai ML experts to bring cutting-edge research into production, solving real-time inference challenges at unprecedented scale.

  • Shape AI-native developer experiences on the #1 developer data platform
  • Solve hard problems in real-time inference and semantic retrieval for global customers
  • Thrive in a culture valuing mentorship, autonomy, and technical craft
  • Join 4,000+ talented engineers modernizing legacy workloads and unleashing AI

How to Apply

Ready to lead MongoDB's inference platform into the AI future? Apply now if you're in Palo Alto or Seattle and have production embedding model expertise. Our team reviews applications continuously—don't miss this chance to build scalable AI infrastructure at the world's leading NoSQL database platform.

Keywords: MongoDB Atlas, Vector Search, ML Inference, Kubernetes, Go, Rust, Semantic Search, RAG, Voyage.ai, GPU Optimization

Locations

  • Palo Alto, California, United States
  • Seattle, Washington, United States

Salary

Estimated Salary Rangehigh confidence

231,000 - 385,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • MongoDB Atlasintermediate
  • Vector Searchintermediate
  • Embedding Modelsintermediate
  • ML Inferenceintermediate
  • Kubernetesintermediate
  • Go Programmingintermediate
  • Rust Developmentintermediate
  • Distributed Systemsintermediate
  • GPU Optimizationintermediate
  • NoSQL Databasesintermediate
  • Semantic Searchintermediate
  • RAG Pipelinesintermediate
  • vLLM Servingintermediate
  • ONNX Runtimeintermediate
  • Cloud Native Architectureintermediate

Required Qualifications

  • 8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development with technical leadership (experience)
  • Expertise in serving embedding models in production environments at scale (experience)
  • Strong systems programming skills in Go, Rust, C++, or Python with performance profiling and optimization (experience)
  • Experience building cloud-native distributed systems focused on latency, availability, and observability (experience)
  • Familiarity with inference runtimes (vLLM, ONNX Runtime) and vector search systems (Faiss, HNSW, ScaNN) (experience)
  • 1+ years serving as Technical Lead for large-scale ML inference or training platform projects (experience)
  • Proven collaboration across ML researchers, engineers, and product teams in multi-tenant SaaS environments (experience)

Responsibilities

  • Partner with Search Platform and Voyage.ai teams to productionize state-of-the-art embedding models and rerankers
  • Lead performance optimization projects for GPU utilization, autoscaling, and observability
  • Design multi-tenant inference services integrated with MongoDB Atlas Vector Search
  • Build platform features including model versioning, safe deployment pipelines, and latency-aware routing
  • Collaborate across ML, infrastructure, and product teams to define scalable architectural patterns
  • Guide model serving architecture decisions using vLLM, ONNX Runtime, and Kubernetes orchestration
  • Provide technical leadership and mentorship to junior engineers on the inference platform team
  • Ensure high availability, low latency inference at global scale for MongoDB Atlas customers
  • Drive semantic search and hybrid retrieval capabilities powering AI-native MongoDB features

Benefits

  • general: Competitive compensation with significant equity ownership
  • general: Comprehensive wellness programs including mental health support
  • general: MongoDB University for continuous learning and certification
  • general: Global Family Leave with generous parental leave policies
  • general: Hybrid work model in premium tech hubs (Palo Alto, Seattle)
  • general: 401(k) matching and employee stock purchase plans
  • general: Unlimited PTO and flexible work arrangements
  • general: Annual learning stipend for conferences and professional development
  • general: Comprehensive health, dental, and vision insurance
  • general: Volunteer time off and community impact programs

Target Your Resume for "Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!" , MongoDB

Get personalized recommendations to optimize your resume specifically for Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!" , MongoDB

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

MongoDBAIMachine LearningInferenceVector SearchKubernetesGoRustPalo AltoSeattleTechnical LeadershipML InfrastructureLead Engineer Inference Platform MongoDBMongoDB Atlas Vector Search jobsML inference engineer Palo AltoEmbedding models production engineerKubernetes ML serving Seattle jobsVoyage.ai MongoDB engineering careersGPU optimization engineer MongoDBSemantic search platform leadRAG pipeline infrastructure jobsvLLM ONNX Runtime engineerMulti-tenant ML inference leadMongoDB AI platform engineeringHybrid retrieval systems engineerTechnical lead ML infrastructureNoSQL database AI engineerCloud native inference platformFaiss HNSW vector search jobsPTO Atlas Search

Answer 10 quick questions to check your fit for Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now! @ MongoDB.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.