RESUME AND JOB

Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!

OpenAI

Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!

OpenAI

full-timePosted: Feb 10, 2026

Job Description

Software Engineer, Data Infrastructure at OpenAI - San Francisco Careers

Join OpenAI's Data Platform team as a Software Engineer, Data Infrastructure in San Francisco, California. This is your chance to build the foundational data stack powering AI innovation at the world's leading AI research company. We're seeking experienced engineers to scale massive Spark compute fleets, design exabyte-scale data lakes with Iceberg and Delta Lake, and operate high-throughput streaming platforms using Kafka and Flink.

Role Overview

The Data Platform team at OpenAI owns the core infrastructure that supports critical product, research, and analytics workflows. We run some of the largest Spark compute fleets in production and are pushing the boundaries of data engineering with modern tools like Airflow for orchestration, Chronon for ML feature engineering, and Trino for federated querying.

In this role, you'll take full ownership of the data infrastructure lifecycle—from architecture design to production operations and on-call support. Based in our San Francisco headquarters with a hybrid model (3 days in office), you'll collaborate with top researchers and engineers to deliver reliable, secure, and efficient data access at unprecedented scale.

Our vision goes beyond traditional data infrastructure. We're building intelligent, AI-assisted workflows that redefine how teams interact with data, making it faster, more intuitive, and more reliable. If you thrive in ambiguity, love debugging distributed systems, and want to work on exabyte-scale challenges in AI, this role is for you.

OpenAI offers relocation assistance and comprehensive benefits to support your transition to the Bay Area.

Key Responsibilities

As a Software Engineer, Data Infrastructure, your day-to-day will include:

Designing, building, and maintaining distributed compute systems such as massive Spark fleets ensuring scalability to exabyte levels.
Architecting data lakes and metadata systems using Apache Iceberg and Delta Lake for petabyte-scale storage.
Operating high-throughput streaming platforms with Kafka and Flink to handle real-time data flows.
Implementing Airflow-based orchestration for complex data pipelines across research and product teams.
Developing low-latency data ingestion systems to support ML training and real-time analytics.
Enabling secure, governed data access with fine-grained permissions for ML engineers and analysts.
Scaling platforms by orders of magnitude while maintaining 99.99% reliability and efficiency.
Debugging and optimizing large-scale distributed systems during production incidents.
Collaborating cross-functionally with product, research, and analytics teams to unlock new AI capabilities.
Owning system reliability through on-call rotation and proactive monitoring.
Empowering fellow engineers with intuitive data tooling and self-service platforms.
Contributing to AI-assisted data workflows that leverage OpenAI models for intelligent data management.
Using Terraform and IaC practices to deploy and manage infrastructure reliably.

Expect hands-on work with cutting-edge technologies in a fast-paced environment where your contributions directly impact OpenAI's products like ChatGPT and DALL-E.

Qualifications

We're looking for engineers who excel in data infrastructure. You might thrive if you have:

4+ years experience in data infrastructure engineering or infrastructure engineering with strong data focus.
Hands-on experience supporting platforms like Spark, Kafka, Flink, Airflow, Trino, Iceberg, or Delta Lake.
Proficiency with infrastructure-as-code tools like Terraform, Kubernetes, or similar.
Demonstrated expertise debugging and operating large-scale distributed systems.
Experience with data lakes, streaming systems, and ML infrastructure tooling like Chronon.
Comfort taking full ownership from design through production operations and on-call.
Strong systems design skills for scalability, reliability, security at extreme scale.
Ability to navigate ambiguity, rapid iteration, and evolving priorities.
Passion for learning new technologies and clearly communicating complex ideas.
Experience with exabyte-scale architecture planning a plus.

San Francisco location required with hybrid schedule. Visa sponsorship available for exceptional candidates.

Salary & Benefits

OpenAI offers competitive compensation for Software Engineer, Data Infrastructure roles in San Francisco, typically ranging from $250,000 to $450,000 base salary plus equity, depending on experience. Total compensation includes:

Performance bonuses and profit-sharing.
Comprehensive medical, dental, vision coverage.
401(k) with generous company match.
Unlimited PTO and flexible holidays.
Relocation package including housing assistance.
Wellness stipend, mental health support, and family leave.
Learning budget for conferences and courses.
Equity in OpenAI with significant upside potential.
Daily catered meals and fully-stocked kitchens at HQ.

This package positions OpenAI in the top percentile for Bay Area tech compensation.

Why Join OpenAI?

OpenAI is at the forefront of artificial general intelligence, ensuring AGI benefits humanity. Our Data Platform team powers breakthroughs in AI research and deployment. You'll work with the brightest minds on infrastructure that supports world-changing products.

Recent achievements include scaling to handle billions of API calls daily while maintaining sub-second latencies. Join us to redefine data infrastructure for the AI era in San Francisco's vibrant tech ecosystem.

Culture emphasizes impact, ownership, and rapid learning. With hybrid flexibility and top-tier perks, OpenAI invests in your growth and well-being.

How to Apply

Ready to build the data foundation for AGI? Submit your resume and a brief note on your experience with distributed data systems. Our team reviews applications on a rolling basis.

Keywords: OpenAI data engineer jobs San Francisco, Spark engineer careers, Kafka streaming roles, AI infrastructure positions.

Total word count: 1,856

Locations

San Francisco, California, United States

Salary

Estimated Salary Rangehigh confidence

262,500 - 495,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Apache Sparkintermediate
Kafkaintermediate
Flinkintermediate
Airflowintermediate
Trinointermediate
Icebergintermediate
Delta Lakeintermediate
Terraformintermediate
Distributed Systemsintermediate
Data Lakesintermediate
Streaming Platformsintermediate
ML Feature Engineeringintermediate
Chrononintermediate
Big Data Computeintermediate
Data Orchestrationintermediate
Infrastructure as Codeintermediate
On-call Operationsintermediate
Scalable Storage Systemsintermediate
Data Governanceintermediate
Low Latency Ingestionintermediate

Required Qualifications

4+ years in data infrastructure engineering (experience)
4+ years in infrastructure engineering with strong data interest (experience)
Experience supporting Spark, Kafka, Flink, Airflow, Trino, or Iceberg platforms (experience)
Proficiency in infrastructure tooling like Terraform (experience)
Expertise in debugging large-scale distributed systems (experience)
Comfortable with full lifecycle ownership: architecture, implementation, operations (experience)
Experience with high-throughput streaming systems (experience)
Knowledge of data lakes and metadata systems on Iceberg and Delta (experience)
Familiarity with exabyte-scale architecture design (experience)
Strong skills in ensuring scalability, reliability, and security (experience)
Ability to handle ambiguity and rapid change (experience)
Intrinsic desire to learn and share knowledge clearly (experience)

Responsibilities

Design and build distributed compute systems like Spark fleets
Maintain data orchestration platforms using Airflow
Develop and operate distributed storage systems with Iceberg and Delta
Build high-throughput streaming infrastructure on Kafka and Flink
Implement low-latency data ingestion pipelines
Enable secure and governed data access for ML and analytics teams
Scale data platforms by orders of magnitude while maintaining reliability
Debug and optimize large-scale distributed systems
Collaborate with product, research, and analytics teams on technical foundations
Own production operations including on-call rotation for critical incidents
Empower engineers with excellent data tooling and systems
Design for reliability and performance at extreme scale
Accelerate AI-assisted data workflows and intelligent interfaces

Benefits

general: Competitive salary with equity package
general: Comprehensive health, dental, and vision insurance
general: 401(k) matching program
general: Relocation assistance to San Francisco
general: Hybrid work model: 3 days in office per week
general: Unlimited PTO and flexible vacation policy
general: Mental health and wellness programs
general: Parental leave and family planning benefits
general: Learning and development stipend
general: Gym membership and fitness reimbursements
general: Commuter benefits and free lunches
general: Stock options in a high-growth AI company
general: On-site amenities at SF headquarters
general: Generous employee referral bonuses

Target Your Resume for "Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!" , OpenAI

Get personalized recommendations to optimize your resume specifically for Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!" , OpenAI

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

software engineer data infrastructure openaiopenai data engineer jobs san franciscospark engineer careers californiakafka flink jobs openaidata platform engineer openaiiceberg delta lake engineerairflow orchestration jobs sfdistributed systems engineer openaiml infrastructure jobs san franciscoterraform data infrastructure rolesexabyte scale data engineeropenai san francisco careersbig data compute engineerstreaming platform engineer kafkaai data infrastructure jobsopenai hybrid jobs californiadata lake architect openaitrino spark jobs bay areasoftware engineer openai salarydata engineering openai sfchronon ml feature jobsApplied AI

Answer 10 quick questions to check your fit for Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now! @ OpenAI.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap

Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!

OpenAI

Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!

OpenAI

full-timePosted: Feb 10, 2026

Job Description

Software Engineer, Data Infrastructure at OpenAI - San Francisco Careers

Role Overview

OpenAI offers relocation assistance and comprehensive benefits to support your transition to the Bay Area.

Key Responsibilities

As a Software Engineer, Data Infrastructure, your day-to-day will include:

Designing, building, and maintaining distributed compute systems such as massive Spark fleets ensuring scalability to exabyte levels.
Architecting data lakes and metadata systems using Apache Iceberg and Delta Lake for petabyte-scale storage.
Operating high-throughput streaming platforms with Kafka and Flink to handle real-time data flows.
Implementing Airflow-based orchestration for complex data pipelines across research and product teams.
Developing low-latency data ingestion systems to support ML training and real-time analytics.
Enabling secure, governed data access with fine-grained permissions for ML engineers and analysts.
Scaling platforms by orders of magnitude while maintaining 99.99% reliability and efficiency.
Debugging and optimizing large-scale distributed systems during production incidents.
Collaborating cross-functionally with product, research, and analytics teams to unlock new AI capabilities.
Owning system reliability through on-call rotation and proactive monitoring.
Empowering fellow engineers with intuitive data tooling and self-service platforms.
Contributing to AI-assisted data workflows that leverage OpenAI models for intelligent data management.
Using Terraform and IaC practices to deploy and manage infrastructure reliably.

Expect hands-on work with cutting-edge technologies in a fast-paced environment where your contributions directly impact OpenAI's products like ChatGPT and DALL-E.

Qualifications

We're looking for engineers who excel in data infrastructure. You might thrive if you have:

4+ years experience in data infrastructure engineering or infrastructure engineering with strong data focus.
Hands-on experience supporting platforms like Spark, Kafka, Flink, Airflow, Trino, Iceberg, or Delta Lake.
Proficiency with infrastructure-as-code tools like Terraform, Kubernetes, or similar.
Demonstrated expertise debugging and operating large-scale distributed systems.
Experience with data lakes, streaming systems, and ML infrastructure tooling like Chronon.
Comfort taking full ownership from design through production operations and on-call.
Strong systems design skills for scalability, reliability, security at extreme scale.
Ability to navigate ambiguity, rapid iteration, and evolving priorities.
Passion for learning new technologies and clearly communicating complex ideas.
Experience with exabyte-scale architecture planning a plus.

San Francisco location required with hybrid schedule. Visa sponsorship available for exceptional candidates.

Salary & Benefits

Performance bonuses and profit-sharing.
Comprehensive medical, dental, vision coverage.
401(k) with generous company match.
Unlimited PTO and flexible holidays.
Relocation package including housing assistance.
Wellness stipend, mental health support, and family leave.
Learning budget for conferences and courses.
Equity in OpenAI with significant upside potential.
Daily catered meals and fully-stocked kitchens at HQ.

This package positions OpenAI in the top percentile for Bay Area tech compensation.

Why Join OpenAI?

Culture emphasizes impact, ownership, and rapid learning. With hybrid flexibility and top-tier perks, OpenAI invests in your growth and well-being.

How to Apply

Ready to build the data foundation for AGI? Submit your resume and a brief note on your experience with distributed data systems. Our team reviews applications on a rolling basis.

Keywords: OpenAI data engineer jobs San Francisco, Spark engineer careers, Kafka streaming roles, AI infrastructure positions.

Total word count: 1,856

Locations

San Francisco, California, United States

Salary

Estimated Salary Rangehigh confidence

262,500 - 495,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Apache Sparkintermediate
Kafkaintermediate
Flinkintermediate
Airflowintermediate
Trinointermediate
Icebergintermediate
Delta Lakeintermediate
Terraformintermediate
Distributed Systemsintermediate
Data Lakesintermediate
Streaming Platformsintermediate
ML Feature Engineeringintermediate
Chrononintermediate
Big Data Computeintermediate
Data Orchestrationintermediate
Infrastructure as Codeintermediate
On-call Operationsintermediate
Scalable Storage Systemsintermediate
Data Governanceintermediate
Low Latency Ingestionintermediate

Required Qualifications

4+ years in data infrastructure engineering (experience)
4+ years in infrastructure engineering with strong data interest (experience)
Experience supporting Spark, Kafka, Flink, Airflow, Trino, or Iceberg platforms (experience)
Proficiency in infrastructure tooling like Terraform (experience)
Expertise in debugging large-scale distributed systems (experience)
Comfortable with full lifecycle ownership: architecture, implementation, operations (experience)
Experience with high-throughput streaming systems (experience)
Knowledge of data lakes and metadata systems on Iceberg and Delta (experience)
Familiarity with exabyte-scale architecture design (experience)
Strong skills in ensuring scalability, reliability, and security (experience)
Ability to handle ambiguity and rapid change (experience)
Intrinsic desire to learn and share knowledge clearly (experience)

Responsibilities

Design and build distributed compute systems like Spark fleets
Maintain data orchestration platforms using Airflow
Develop and operate distributed storage systems with Iceberg and Delta
Build high-throughput streaming infrastructure on Kafka and Flink
Implement low-latency data ingestion pipelines
Enable secure and governed data access for ML and analytics teams
Scale data platforms by orders of magnitude while maintaining reliability
Debug and optimize large-scale distributed systems
Collaborate with product, research, and analytics teams on technical foundations
Own production operations including on-call rotation for critical incidents
Empower engineers with excellent data tooling and systems
Design for reliability and performance at extreme scale
Accelerate AI-assisted data workflows and intelligent interfaces

Benefits

general: Competitive salary with equity package
general: Comprehensive health, dental, and vision insurance
general: 401(k) matching program
general: Relocation assistance to San Francisco
general: Hybrid work model: 3 days in office per week
general: Unlimited PTO and flexible vacation policy
general: Mental health and wellness programs
general: Parental leave and family planning benefits
general: Learning and development stipend
general: Gym membership and fitness reimbursements
general: Commuter benefits and free lunches
general: Stock options in a high-growth AI company
general: On-site amenities at SF headquarters
general: Generous employee referral bonuses

Target Your Resume for "Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!" , OpenAI

Get personalized recommendations to optimize your resume specifically for Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now!" , OpenAI

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

Answer 10 quick questions to check your fit for Software Engineer, Data Infrastructure Careers at OpenAI - San Francisco, California | Apply Now! @ OpenAI.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap