RESUME AND JOB

Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!

MongoDB

Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!

MongoDB

full-timePosted: Feb 6, 2026

Job Description

Role Overview

Join MongoDB as a Lead Engineer, Inference Platform and shape the future of AI-native developer experiences on the world's most popular modern developer data platform. This hands-on technical leadership role sits at the intersection of MongoDB Atlas Vector Search, cutting-edge embedding models from our Voyage.ai acquisition, and high-scale inference infrastructure. You'll build the real-time, low-latency systems powering semantic search, hybrid retrieval, and RAG pipelines for thousands of global customers.

Based in Palo Alto, California or Seattle, Washington for our hybrid model, you'll lead key projects optimizing GPU utilization, autoscaling, and observability in a multi-tenant, cloud-native environment deeply integrated with MongoDB Atlas across AWS, Google Cloud, and Microsoft Azure.

Key Responsibilities at MongoDB

Partner closely with Search Platform and Voyage.ai AI engineers to productionize state-of-the-art embedding models and rerankers for both batch and real-time inference
Lead critical projects focused on performance optimization, GPU utilization, autoscaling, and comprehensive observability for the inference platform
Design and implement components of our multi-tenant inference service that powers Atlas Vector Search capabilities for semantic search and hybrid retrieval
Build essential platform features like model versioning, safe deployment pipelines, latency-aware routing, and model health monitoring
Collaborate with ML, infrastructure, and product teams to establish architectural patterns ensuring high availability and low latency at scale
Guide technical decisions on model serving architecture leveraging tools like vLLM, ONNX Runtime, and Kubernetes container orchestration
Provide hands-on technical leadership and mentorship to engineers across experience levels
Foster a culture of technical excellence, autonomy, and continuous improvement within the team

Qualifications & Requirements

To succeed as our Lead Engineer, Inference Platform, you'll bring:

8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development with proven technical leadership
Deep expertise serving embedding models in production environments at global scale
Strong systems programming in Go, Rust, C++, or Python with experience profiling and optimizing performance
Proven experience building cloud-native distributed systems emphasizing latency, availability, and observability
Familiarity with inference runtimes (vLLM, ONNX Runtime) and vector search systems (Faiss, HNSW, ScaNN)
1+ years experience serving as Technical Lead for large-scale ML inference or training platform projects
Track record of cross-discipline collaboration from ML researchers to junior engineers in multi-tenant SaaS environments

Nice to haves: Experience with hybrid retrieval, RAG, open-source ML serving contributions, or managing ML infrastructure teams.

Salary & Benefits

Lead Engineer, Inference Platform roles at MongoDB offer competitive Total Compensation packages including base salary, equity, and comprehensive benefits. Expected range: $220,000 - $350,000 USD (Palo Alto/Seattle) depending on experience.

Competitive base salary + significant equity ownership in high-growth public company
Comprehensive wellness programs including mental health support
MongoDB University for certifications and skill development
Global Family Leave with generous parental policies
401(k) matching and ESPP programs
Unlimited PTO and flexible hybrid schedules
Annual learning stipend for conferences and courses
Premium health insurance (medical, dental, vision)

Why Join MongoDB?

Be part of redefining databases for the AI era. MongoDB powers innovators building AI-native applications on our globally distributed, multi-cloud platform. Work with Voyage.ai ML experts to bring cutting-edge research into production, solving real-time inference challenges at unprecedented scale.

Shape AI-native developer experiences on the #1 developer data platform
Solve hard problems in real-time inference and semantic retrieval for global customers
Thrive in a culture valuing mentorship, autonomy, and technical craft
Join 4,000+ talented engineers modernizing legacy workloads and unleashing AI

How to Apply

Ready to lead MongoDB's inference platform into the AI future? Apply now if you're in Palo Alto or Seattle and have production embedding model expertise. Our team reviews applications continuously—don't miss this chance to build scalable AI infrastructure at the world's leading NoSQL database platform.

Keywords: MongoDB Atlas, Vector Search, ML Inference, Kubernetes, Go, Rust, Semantic Search, RAG, Voyage.ai, GPU Optimization

Locations

Palo Alto, California, United States
Seattle, Washington, United States

Salary

Estimated Salary Rangehigh confidence

231,000 - 385,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

MongoDB Atlasintermediate
Vector Searchintermediate
Embedding Modelsintermediate
ML Inferenceintermediate
Kubernetesintermediate
Go Programmingintermediate
Rust Developmentintermediate
Distributed Systemsintermediate
GPU Optimizationintermediate
NoSQL Databasesintermediate
Semantic Searchintermediate
RAG Pipelinesintermediate
vLLM Servingintermediate
ONNX Runtimeintermediate
Cloud Native Architectureintermediate

Required Qualifications

8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development with technical leadership (experience)
Expertise in serving embedding models in production environments at scale (experience)
Strong systems programming skills in Go, Rust, C++, or Python with performance profiling and optimization (experience)
Experience building cloud-native distributed systems focused on latency, availability, and observability (experience)
Familiarity with inference runtimes (vLLM, ONNX Runtime) and vector search systems (Faiss, HNSW, ScaNN) (experience)
1+ years serving as Technical Lead for large-scale ML inference or training platform projects (experience)
Proven collaboration across ML researchers, engineers, and product teams in multi-tenant SaaS environments (experience)

Responsibilities

Partner with Search Platform and Voyage.ai teams to productionize state-of-the-art embedding models and rerankers
Lead performance optimization projects for GPU utilization, autoscaling, and observability
Design multi-tenant inference services integrated with MongoDB Atlas Vector Search
Build platform features including model versioning, safe deployment pipelines, and latency-aware routing
Collaborate across ML, infrastructure, and product teams to define scalable architectural patterns
Guide model serving architecture decisions using vLLM, ONNX Runtime, and Kubernetes orchestration
Provide technical leadership and mentorship to junior engineers on the inference platform team
Ensure high availability, low latency inference at global scale for MongoDB Atlas customers
Drive semantic search and hybrid retrieval capabilities powering AI-native MongoDB features

Benefits

general: Competitive compensation with significant equity ownership
general: Comprehensive wellness programs including mental health support
general: MongoDB University for continuous learning and certification
general: Global Family Leave with generous parental leave policies
general: Hybrid work model in premium tech hubs (Palo Alto, Seattle)
general: 401(k) matching and employee stock purchase plans
general: Unlimited PTO and flexible work arrangements
general: Annual learning stipend for conferences and professional development
general: Comprehensive health, dental, and vision insurance
general: Volunteer time off and community impact programs

Target Your Resume for "Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!" , MongoDB

Get personalized recommendations to optimize your resume specifically for Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!" , MongoDB

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

MongoDBAIMachine LearningInferenceVector SearchKubernetesGoRustPalo AltoSeattleTechnical LeadershipML InfrastructureLead Engineer Inference Platform MongoDBMongoDB Atlas Vector Search jobsML inference engineer Palo AltoEmbedding models production engineerKubernetes ML serving Seattle jobsVoyage.ai MongoDB engineering careersGPU optimization engineer MongoDBSemantic search platform leadRAG pipeline infrastructure jobsvLLM ONNX Runtime engineerMulti-tenant ML inference leadMongoDB AI platform engineeringHybrid retrieval systems engineerTechnical lead ML infrastructureNoSQL database AI engineerCloud native inference platformFaiss HNSW vector search jobsPTO Atlas Search

Answer 10 quick questions to check your fit for Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now! @ MongoDB.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap

Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!

MongoDB

Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!

MongoDB

full-timePosted: Feb 6, 2026

Job Description

Role Overview

Key Responsibilities at MongoDB

Partner closely with Search Platform and Voyage.ai AI engineers to productionize state-of-the-art embedding models and rerankers for both batch and real-time inference
Lead critical projects focused on performance optimization, GPU utilization, autoscaling, and comprehensive observability for the inference platform
Design and implement components of our multi-tenant inference service that powers Atlas Vector Search capabilities for semantic search and hybrid retrieval
Build essential platform features like model versioning, safe deployment pipelines, latency-aware routing, and model health monitoring
Collaborate with ML, infrastructure, and product teams to establish architectural patterns ensuring high availability and low latency at scale
Guide technical decisions on model serving architecture leveraging tools like vLLM, ONNX Runtime, and Kubernetes container orchestration
Provide hands-on technical leadership and mentorship to engineers across experience levels
Foster a culture of technical excellence, autonomy, and continuous improvement within the team

Qualifications & Requirements

To succeed as our Lead Engineer, Inference Platform, you'll bring:

8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development with proven technical leadership
Deep expertise serving embedding models in production environments at global scale
Strong systems programming in Go, Rust, C++, or Python with experience profiling and optimizing performance
Proven experience building cloud-native distributed systems emphasizing latency, availability, and observability
Familiarity with inference runtimes (vLLM, ONNX Runtime) and vector search systems (Faiss, HNSW, ScaNN)
1+ years experience serving as Technical Lead for large-scale ML inference or training platform projects
Track record of cross-discipline collaboration from ML researchers to junior engineers in multi-tenant SaaS environments

Nice to haves: Experience with hybrid retrieval, RAG, open-source ML serving contributions, or managing ML infrastructure teams.

Salary & Benefits

Competitive base salary + significant equity ownership in high-growth public company
Comprehensive wellness programs including mental health support
MongoDB University for certifications and skill development
Global Family Leave with generous parental policies
401(k) matching and ESPP programs
Unlimited PTO and flexible hybrid schedules
Annual learning stipend for conferences and courses
Premium health insurance (medical, dental, vision)

Why Join MongoDB?

Shape AI-native developer experiences on the #1 developer data platform
Solve hard problems in real-time inference and semantic retrieval for global customers
Thrive in a culture valuing mentorship, autonomy, and technical craft
Join 4,000+ talented engineers modernizing legacy workloads and unleashing AI

How to Apply

Keywords: MongoDB Atlas, Vector Search, ML Inference, Kubernetes, Go, Rust, Semantic Search, RAG, Voyage.ai, GPU Optimization

Locations

Palo Alto, California, United States
Seattle, Washington, United States

Salary

Estimated Salary Rangehigh confidence

231,000 - 385,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

MongoDB Atlasintermediate
Vector Searchintermediate
Embedding Modelsintermediate
ML Inferenceintermediate
Kubernetesintermediate
Go Programmingintermediate
Rust Developmentintermediate
Distributed Systemsintermediate
GPU Optimizationintermediate
NoSQL Databasesintermediate
Semantic Searchintermediate
RAG Pipelinesintermediate
vLLM Servingintermediate
ONNX Runtimeintermediate
Cloud Native Architectureintermediate

Required Qualifications

8+ years of engineering experience in backend systems, ML infrastructure, or scalable platform development with technical leadership (experience)
Expertise in serving embedding models in production environments at scale (experience)
Strong systems programming skills in Go, Rust, C++, or Python with performance profiling and optimization (experience)
Experience building cloud-native distributed systems focused on latency, availability, and observability (experience)
Familiarity with inference runtimes (vLLM, ONNX Runtime) and vector search systems (Faiss, HNSW, ScaNN) (experience)
1+ years serving as Technical Lead for large-scale ML inference or training platform projects (experience)
Proven collaboration across ML researchers, engineers, and product teams in multi-tenant SaaS environments (experience)

Responsibilities

Partner with Search Platform and Voyage.ai teams to productionize state-of-the-art embedding models and rerankers
Lead performance optimization projects for GPU utilization, autoscaling, and observability
Design multi-tenant inference services integrated with MongoDB Atlas Vector Search
Build platform features including model versioning, safe deployment pipelines, and latency-aware routing
Collaborate across ML, infrastructure, and product teams to define scalable architectural patterns
Guide model serving architecture decisions using vLLM, ONNX Runtime, and Kubernetes orchestration
Provide technical leadership and mentorship to junior engineers on the inference platform team
Ensure high availability, low latency inference at global scale for MongoDB Atlas customers
Drive semantic search and hybrid retrieval capabilities powering AI-native MongoDB features

Benefits

general: Competitive compensation with significant equity ownership
general: Comprehensive wellness programs including mental health support
general: MongoDB University for continuous learning and certification
general: Global Family Leave with generous parental leave policies
general: Hybrid work model in premium tech hubs (Palo Alto, Seattle)
general: 401(k) matching and employee stock purchase plans
general: Unlimited PTO and flexible work arrangements
general: Annual learning stipend for conferences and professional development
general: Comprehensive health, dental, and vision insurance
general: Volunteer time off and community impact programs

Target Your Resume for "Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!" , MongoDB

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now!" , MongoDB

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

Answer 10 quick questions to check your fit for Lead Engineer, Inference Platform Careers at MongoDB - Palo Alto, California, United States | Apply Now! @ MongoDB.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap