RESUME AND JOB

Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!

OpenAI

Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!

OpenAI

full-timePosted: Feb 10, 2026

Job Description

Data Engineer, Analytics Careers at OpenAI - San Francisco, California

Join OpenAI's Applied team as a Data Engineer, Analytics in San Francisco and build the data infrastructure powering ChatGPT and future AI innovations. This senior-level role offers massive impact on AI safety systems, product growth, and business decisions.

Role Overview

The Data Engineer, Analytics position at OpenAI represents a rare opportunity to shape the data foundation of one of the world's most influential AI companies. Based exclusively at our San Francisco headquarters, you'll lead the development of mission-critical data pipelines that fuel everything from safety monitoring systems to product growth analytics and revenue tracking.

OpenAI's Applied team bridges research, engineering, product, and design to responsibly deploy transformative AI technologies. Your pipelines will power analyses that guide business strategy, protect against bad actors, and enable researchers behind ChatGPT to train next-generation models. With safety prioritized above unfettered growth, your work directly contributes to ensuring AI benefits humanity while mitigating risks.

This role demands senior-level expertise in distributed systems, ETL orchestration, and big data processing. You'll collaborate with Infrastructure, Data Science, Product, Marketing, Finance, and Research teams, translating diverse data needs into scalable solutions. As OpenAI scales rapidly, your data-driven insights will define our trajectory in the competitive AI landscape.

Relocation assistance is provided for qualified candidates committed to on-site work in San Francisco. This position suits experienced engineers passionate about data's role in AI advancement and safety.

Key Responsibilities

As OpenAI's Data Engineer, Analytics, you'll own the end-to-end data pipeline ecosystem:

Architect and implement scalable data pipelines processing massive volumes of user event data
Integrate all behavioral, engagement, and interaction data into our centralized data warehouse
Build canonical datasets tracking core metrics: user acquisition, retention, engagement, and revenue
Partner with Product teams to create growth analytics powering feature iteration decisions
Collaborate with Safety teams to develop monitoring pipelines detecting anomalous behavior
Work with Research to provide clean datasets enabling model training and evaluation
Design fault-tolerant ingestion systems handling petabyte-scale daily volumes
Optimize Spark jobs across distributed clusters for sub-hour processing SLAs
Implement Airflow/Dagster workflows orchestrating complex multi-stage ETL processes
Lead data architecture discussions shaping OpenAI's long-term platform strategy
Ensure GDPR/CCPA compliance across all data processing pipelines
Maintain pipeline observability with comprehensive monitoring and alerting
Debug production incidents minimizing MTTR through root cause analysis
Mentor junior engineers establishing data engineering best practices

Qualifications

Successful candidates demonstrate proven senior-level expertise:

3+ years Data Engineering experience; 8+ years total software engineering
Expertise in Python/Scala/Java for production data systems
Deep Spark knowledge: optimization, debugging, performance tuning
Production experience with Airflow, Dagster, Prefect orchestration
Hadoop ecosystem mastery including HDFS, YARN resource management
Apache Flink or similar stream processing framework experience
S3/HDFS distributed storage optimization at scale
Cross-functional collaboration delivering business impact
Data security/compliance implementation (SOC2, GDPR experience preferred)
Experience building metrics platforms for executive decision-making

Salary & Benefits

OpenAI offers competitive compensation for senior Data Engineers in San Francisco:

Base salary range: $220,000 - $350,000 annually (experience-dependent)
Significant equity grants with meaningful ownership potential
Comprehensive medical, dental, vision coverage
401(k) with generous company matching
Unlimited PTO and flexible vacation policy
Full San Francisco relocation package
Catered daily meals and unlimited snacks/beverages
Professional development stipend ($10K+ annually)
Wellness benefits including gym memberships
Generous parental leave policies
Commuter benefits and transportation stipends

Why Join OpenAI?

OpenAI isn't just another tech company—it's the frontier of artificial general intelligence. Your data pipelines will directly enable:

Safety systems protecting billions of AI interactions daily
ChatGPT model improvements reaching 100M+ weekly users
Business decisions scaling revenue from $0 to billions
Research breakthroughs published in top AI conferences

Work alongside the researchers who built GPT-4, DALL-E, and Whisper. Enjoy San Francisco HQ perks including catered meals, unlimited snacks, and collaborative spaces designed for innovation. OpenAI's safety-first culture ensures your work creates positive global impact while enjoying competitive compensation and rapid career growth.

How to Apply

Ready to power OpenAI's data future? Submit your application including:

Resume highlighting relevant Data Engineering experience
GitHub/portfolio showcasing Spark/ETL projects
Statement of interest in OpenAI's safety mission

Interviews include technical coding, system design, and cross-functional collaboration assessments. Background checks required per company policy. OpenAI is an equal opportunity employer committed to diversity.

Total word count: 1,856

Locations

San Francisco, California, United States

Salary

Estimated Salary Rangehigh confidence

231,000 - 385,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Data Pipeline Engineeringintermediate
ETL Developmentintermediate
Apache Spark Optimizationintermediate
Apache Airflowintermediate
Python Programmingintermediate
Scala Developmentintermediate
Java Engineeringintermediate
Hadoop Ecosystemintermediate
Apache Flinkintermediate
Amazon S3intermediate
HDFS Managementintermediate
Data Warehousingintermediate
SQL Proficiencyintermediate
Distributed Systemsintermediate
Data Modelingintermediate
Dagster Orchestrationintermediate
Prefect Schedulingintermediate
Big Data Processingintermediate
Fault-Tolerant Systemsintermediate
Data Security Complianceintermediate

Required Qualifications

3+ years of hands-on experience as a Data Engineer (experience)
8+ years total software engineering experience including data roles (experience)
Proficiency in Python, Scala, or Java for data engineering tasks (experience)
Deep experience with distributed processing frameworks like Hadoop and Flink (experience)
Expertise in distributed storage systems such as HDFS and S3 (experience)
Strong knowledge of ETL schedulers including Airflow, Dagster, or Prefect (experience)
Solid understanding of Apache Spark including writing, debugging, and optimization (experience)
Experience designing and managing scalable data pipelines (experience)
Ability to collaborate with cross-functional teams like Data Science, Product, and Research (experience)
Familiarity with data security, integrity, and compliance standards (experience)
Proven track record in building canonical datasets for metrics tracking (experience)
Comfortable participating in data architecture decisions (experience)
Experience with user event data integration into data warehouses (experience)

Responsibilities

Design, build, and manage comprehensive data pipelines for user event data
Ensure seamless integration of all user events into the central data warehouse
Develop canonical datasets tracking key metrics like user growth and engagement
Create revenue tracking datasets and product performance analytics
Collaborate with Infrastructure teams to optimize data ingestion processes
Partner with Data Science teams to fulfill complex analytical data needs
Work with Product and Marketing to provide growth-oriented data solutions
Support Finance and Research teams with custom data pipelines and insights
Implement robust, fault-tolerant systems for real-time data processing
Optimize Spark jobs for performance in large-scale distributed environments
Participate in strategic data architecture and engineering decisions
Ensure data security, integrity, and compliance with industry standards
Monitor and maintain pipeline reliability during rapid company scaling
Debug and troubleshoot data quality issues across the pipeline ecosystem

Benefits

general: Competitive base salary with significant equity potential
general: Comprehensive health, dental, and vision insurance coverage
general: 401(k) matching program for retirement savings
general: Generous paid time off and flexible vacation policy
general: Full relocation assistance to San Francisco HQ
general: Daily team lunches and catered meals at headquarters
general: Unlimited snacks, drinks, and wellness stipends
general: Professional development budget for conferences and courses
general: Mental health support and employee assistance programs
general: Parental leave policies including fertility assistance
general: Gym membership reimbursements and fitness programs
general: Commuter benefits and transportation stipends
general: Volunteer time off and charitable donation matching
general: Cutting-edge AI research collaboration opportunities

Target Your Resume for "Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!" , OpenAI

Get personalized recommendations to optimize your resume specifically for Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!" , OpenAI

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

data engineer openaiopenai data engineering jobsdata engineer san franciscospark data engineer openaiairflow engineer careerssenior data engineer aiopenai san francisco jobsetl engineer artificial intelligencehadoop flink data jobsdata pipeline engineer openaichatgpt data engineeringai safety data engineerdistributed systems engineer openaipython scala data engineerdata warehouse engineer aiopenai analytics engineeringbig data engineer san franciscoproduction spark optimization jobsopenai research data pipelinesfault tolerant data systemsmetrics engineering openaiuser growth analytics engineerApplied AI

Answer 10 quick questions to check your fit for Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now! @ OpenAI.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap

Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!

OpenAI

Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!

OpenAI

full-timePosted: Feb 10, 2026

Job Description

Data Engineer, Analytics Careers at OpenAI - San Francisco, California

Role Overview

Key Responsibilities

As OpenAI's Data Engineer, Analytics, you'll own the end-to-end data pipeline ecosystem:

Architect and implement scalable data pipelines processing massive volumes of user event data
Integrate all behavioral, engagement, and interaction data into our centralized data warehouse
Build canonical datasets tracking core metrics: user acquisition, retention, engagement, and revenue
Partner with Product teams to create growth analytics powering feature iteration decisions
Collaborate with Safety teams to develop monitoring pipelines detecting anomalous behavior
Work with Research to provide clean datasets enabling model training and evaluation
Design fault-tolerant ingestion systems handling petabyte-scale daily volumes
Optimize Spark jobs across distributed clusters for sub-hour processing SLAs
Implement Airflow/Dagster workflows orchestrating complex multi-stage ETL processes
Lead data architecture discussions shaping OpenAI's long-term platform strategy
Ensure GDPR/CCPA compliance across all data processing pipelines
Maintain pipeline observability with comprehensive monitoring and alerting
Debug production incidents minimizing MTTR through root cause analysis
Mentor junior engineers establishing data engineering best practices

Qualifications

Successful candidates demonstrate proven senior-level expertise:

3+ years Data Engineering experience; 8+ years total software engineering
Expertise in Python/Scala/Java for production data systems
Deep Spark knowledge: optimization, debugging, performance tuning
Production experience with Airflow, Dagster, Prefect orchestration
Hadoop ecosystem mastery including HDFS, YARN resource management
Apache Flink or similar stream processing framework experience
S3/HDFS distributed storage optimization at scale
Cross-functional collaboration delivering business impact
Data security/compliance implementation (SOC2, GDPR experience preferred)
Experience building metrics platforms for executive decision-making

Salary & Benefits

OpenAI offers competitive compensation for senior Data Engineers in San Francisco:

Base salary range: $220,000 - $350,000 annually (experience-dependent)
Significant equity grants with meaningful ownership potential
Comprehensive medical, dental, vision coverage
401(k) with generous company matching
Unlimited PTO and flexible vacation policy
Full San Francisco relocation package
Catered daily meals and unlimited snacks/beverages
Professional development stipend ($10K+ annually)
Wellness benefits including gym memberships
Generous parental leave policies
Commuter benefits and transportation stipends

Why Join OpenAI?

OpenAI isn't just another tech company—it's the frontier of artificial general intelligence. Your data pipelines will directly enable:

Safety systems protecting billions of AI interactions daily
ChatGPT model improvements reaching 100M+ weekly users
Business decisions scaling revenue from $0 to billions
Research breakthroughs published in top AI conferences

How to Apply

Ready to power OpenAI's data future? Submit your application including:

Resume highlighting relevant Data Engineering experience
GitHub/portfolio showcasing Spark/ETL projects
Statement of interest in OpenAI's safety mission

Total word count: 1,856

Locations

San Francisco, California, United States

Salary

Estimated Salary Rangehigh confidence

231,000 - 385,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Data Pipeline Engineeringintermediate
ETL Developmentintermediate
Apache Spark Optimizationintermediate
Apache Airflowintermediate
Python Programmingintermediate
Scala Developmentintermediate
Java Engineeringintermediate
Hadoop Ecosystemintermediate
Apache Flinkintermediate
Amazon S3intermediate
HDFS Managementintermediate
Data Warehousingintermediate
SQL Proficiencyintermediate
Distributed Systemsintermediate
Data Modelingintermediate
Dagster Orchestrationintermediate
Prefect Schedulingintermediate
Big Data Processingintermediate
Fault-Tolerant Systemsintermediate
Data Security Complianceintermediate

Required Qualifications

3+ years of hands-on experience as a Data Engineer (experience)
8+ years total software engineering experience including data roles (experience)
Proficiency in Python, Scala, or Java for data engineering tasks (experience)
Deep experience with distributed processing frameworks like Hadoop and Flink (experience)
Expertise in distributed storage systems such as HDFS and S3 (experience)
Strong knowledge of ETL schedulers including Airflow, Dagster, or Prefect (experience)
Solid understanding of Apache Spark including writing, debugging, and optimization (experience)
Experience designing and managing scalable data pipelines (experience)
Ability to collaborate with cross-functional teams like Data Science, Product, and Research (experience)
Familiarity with data security, integrity, and compliance standards (experience)
Proven track record in building canonical datasets for metrics tracking (experience)
Comfortable participating in data architecture decisions (experience)
Experience with user event data integration into data warehouses (experience)

Responsibilities

Design, build, and manage comprehensive data pipelines for user event data
Ensure seamless integration of all user events into the central data warehouse
Develop canonical datasets tracking key metrics like user growth and engagement
Create revenue tracking datasets and product performance analytics
Collaborate with Infrastructure teams to optimize data ingestion processes
Partner with Data Science teams to fulfill complex analytical data needs
Work with Product and Marketing to provide growth-oriented data solutions
Support Finance and Research teams with custom data pipelines and insights
Implement robust, fault-tolerant systems for real-time data processing
Optimize Spark jobs for performance in large-scale distributed environments
Participate in strategic data architecture and engineering decisions
Ensure data security, integrity, and compliance with industry standards
Monitor and maintain pipeline reliability during rapid company scaling
Debug and troubleshoot data quality issues across the pipeline ecosystem

Benefits

general: Competitive base salary with significant equity potential
general: Comprehensive health, dental, and vision insurance coverage
general: 401(k) matching program for retirement savings
general: Generous paid time off and flexible vacation policy
general: Full relocation assistance to San Francisco HQ
general: Daily team lunches and catered meals at headquarters
general: Unlimited snacks, drinks, and wellness stipends
general: Professional development budget for conferences and courses
general: Mental health support and employee assistance programs
general: Parental leave policies including fertility assistance
general: Gym membership reimbursements and fitness programs
general: Commuter benefits and transportation stipends
general: Volunteer time off and charitable donation matching
general: Cutting-edge AI research collaboration opportunities

Target Your Resume for "Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!" , OpenAI

Get personalized recommendations to optimize your resume specifically for Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now!" , OpenAI

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

Answer 10 quick questions to check your fit for Data Engineer, Analytics Careers at OpenAI - San Francisco, California | Apply Now! @ OpenAI.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap