Resume and JobRESUME AND JOB
xAI logo

Site Reliability Engineer (SRE)

xAI

Site Reliability Engineer (SRE)

full-timePosted: Dec 29, 2025

Job Description

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the team

You will work on the team that is responsible for the backend services that power our products such as grok.com and the API. We focus on writing and maintaining highly scalable and reliable services that can efficiently process tens of thousands of queries per second. The services are hosted on a number of Kubernetes clusters (on-prem & cloud).

About the role

An ideal candidate meets at least the following requirements:

  • Expert knowledge of Kubernetes,
  • Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD,
  • Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty,
  • Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform.
  • Experience with traffic management and HTTP proxies such as nginx and envoy.

Location

This position is in-person in London, UK. We usually work from the office 5 days a week but allow for work-from-home days when required. Candidates must be willing to attend late meetings at least once a week to coordinate with the rest of our team in Palo Alto.

Interview process

After submitting your application, the team reviews your statement of exceptional work and CV. If your application passes this stage, the interview process is as follows:

  1. Initial technical screening during which a member of our team will ask some basic technical questions (15 minutes)
  2. Coding interview in Go (45 minutes)
  3. Distributed system design interview (45 minutes)
  4. Final stage with founding engineer Toby Pohlen (30 minutes)

All interviews will be conducted via Google Meet.

Benefits

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to our Aviva pension plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

Privacy Policy

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Locations

  • London, UK,

Salary

Salary details available upon request

Estimated Salary Rangemedium confidence

300,000 - 800,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Kubernetesintermediate
  • Buildkiteintermediate
  • ArgoCDintermediate
  • Prometheusintermediate
  • Grafanaintermediate
  • PagerDutyintermediate
  • Pulumiintermediate
  • Terraformintermediate
  • nginxintermediate
  • envoyintermediate
  • Gointermediate
  • strong communication skillsintermediate
  • strong prioritization skillsintermediate

Required Qualifications

  • Expert knowledge of Kubernetes (experience)
  • Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD (experience)
  • Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty (experience)
  • Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform (experience)
  • Experience with traffic management and HTTP proxies such as nginx and envoy (experience)

Benefits

  • general: equity
  • general: comprehensive medical, vision, and dental coverage
  • general: access to our Aviva pension plan
  • general: short & long-term disability insurance
  • general: life insurance
  • general: various other discounts and perks

Target Your Resume for "Site Reliability Engineer (SRE)" , xAI

Get personalized recommendations to optimize your resume specifically for Site Reliability Engineer (SRE). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Site Reliability Engineer (SRE)" , xAI

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

ProductProduct
Quiz Challenge

Answer 10 quick questions to check your fit for Site Reliability Engineer (SRE) @ xAI.

10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

xAI logo

Site Reliability Engineer (SRE)

xAI

Site Reliability Engineer (SRE)

full-timePosted: Dec 29, 2025

Job Description

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the team

You will work on the team that is responsible for the backend services that power our products such as grok.com and the API. We focus on writing and maintaining highly scalable and reliable services that can efficiently process tens of thousands of queries per second. The services are hosted on a number of Kubernetes clusters (on-prem & cloud).

About the role

An ideal candidate meets at least the following requirements:

  • Expert knowledge of Kubernetes,
  • Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD,
  • Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty,
  • Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform.
  • Experience with traffic management and HTTP proxies such as nginx and envoy.

Location

This position is in-person in London, UK. We usually work from the office 5 days a week but allow for work-from-home days when required. Candidates must be willing to attend late meetings at least once a week to coordinate with the rest of our team in Palo Alto.

Interview process

After submitting your application, the team reviews your statement of exceptional work and CV. If your application passes this stage, the interview process is as follows:

  1. Initial technical screening during which a member of our team will ask some basic technical questions (15 minutes)
  2. Coding interview in Go (45 minutes)
  3. Distributed system design interview (45 minutes)
  4. Final stage with founding engineer Toby Pohlen (30 minutes)

All interviews will be conducted via Google Meet.

Benefits

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to our Aviva pension plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

Privacy Policy

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Locations

  • London, UK,

Salary

Salary details available upon request

Estimated Salary Rangemedium confidence

300,000 - 800,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Kubernetesintermediate
  • Buildkiteintermediate
  • ArgoCDintermediate
  • Prometheusintermediate
  • Grafanaintermediate
  • PagerDutyintermediate
  • Pulumiintermediate
  • Terraformintermediate
  • nginxintermediate
  • envoyintermediate
  • Gointermediate
  • strong communication skillsintermediate
  • strong prioritization skillsintermediate

Required Qualifications

  • Expert knowledge of Kubernetes (experience)
  • Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD (experience)
  • Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty (experience)
  • Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform (experience)
  • Experience with traffic management and HTTP proxies such as nginx and envoy (experience)

Benefits

  • general: equity
  • general: comprehensive medical, vision, and dental coverage
  • general: access to our Aviva pension plan
  • general: short & long-term disability insurance
  • general: life insurance
  • general: various other discounts and perks

Target Your Resume for "Site Reliability Engineer (SRE)" , xAI

Get personalized recommendations to optimize your resume specifically for Site Reliability Engineer (SRE). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Site Reliability Engineer (SRE)" , xAI

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

ProductProduct
Quiz Challenge

Answer 10 quick questions to check your fit for Site Reliability Engineer (SRE) @ xAI.

10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.