Resume and JobRESUME AND JOB
Grammarly logo

Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!

Grammarly

Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!

full-timePosted: Jan 16, 2026

Job Description

Site Reliability Engineer (SRE) at Grammarly - San Francisco, CA (Hybrid)

Role Overview

Grammarly, now proudly part of Superhuman—the leading AI productivity platform—is seeking a talented Site Reliability Engineer (SRE) to join our infrastructure team in San Francisco, CA. This hybrid role offers the perfect balance of focused remote work and in-person collaboration to build trust, drive innovation, and cultivate a thriving team culture.

Superhuman empowers over 40 million people, 50,000 organizations, and 3,000 educational institutions worldwide with AI tools like Grammarly's writing assistance, Coda's collaborative workspaces, Mail's inbox management, and Go's proactive AI assistant. As an SRE, you'll play a pivotal role in scaling our Kubernetes-based systems that process billions of events daily, ensuring rock-solid reliability for our global user base.

We're transitioning from a 'you build it, you own it' model, partnering with our EU production engineering teams to implement world-class SRE practices. If you thrive in fast-paced environments, love automating away toil, and want to impact products used by millions, this is your opportunity to shine at Grammarly.

Our engineers have unprecedented freedom to innovate, influence the product roadmap, and tackle complex challenges in AI, ML deployment, and massive-scale infrastructure. Dive into our technical blog to hear directly from the team.

Key Responsibilities

As a Site Reliability Engineer at Grammarly, you'll wear multiple hats—builder, operator, and innovator. Here's what your day-to-day will look like:

  • Scale our Kubernetes control plane: Handle billions of events per day with zero downtime.
  • Enhance automation: Build reactive systems that adapt to dynamic workloads in real-time.
  • Deploy ML models company-wide: Ensure seamless integration and high availability for AI features.
  • Collaborate cross-functionally: Partner with developers to design reliable back-end systems from day one.
  • Incident leadership: Lead post-mortems, reduce MTTR, and prevent recurrence through automation.
  • Infrastructure planning: Forecast growth and architect scalable solutions for the future.
  • Observability mastery: Implement Prometheus, Grafana, and tracing for full-stack visibility.
  • CI/CD optimization: Streamline pipelines using Terraform, Docker, and Kubernetes.
  • On-call excellence: Participate in rotations with generous stipends and comprehensive support.
  • Runbook development: Create golden paths for common issues and knowledge sharing.
  • Performance tuning: Profile systems, eliminate bottlenecks, and boost efficiency.
  • Security & compliance: Harden infrastructure against threats while maintaining agility.
  • Mentorship: Guide junior engineers and contribute to SRE best practices.
  • Innovation: Experiment with cutting-edge tools to push reliability boundaries.

Expect to work on greenfield projects alongside mature production systems, balancing immediate impact with long-term strategy.

Qualifications

We're looking for battle-tested SREs who embody our EAGER values (Ethical, Adaptable, Gritty, Empathetic, Remarkable) and MOVE principles (Move fast, Obsess about customer value, Value impact, Embrace disagreement):

  • 5+ years as SRE, DevOps, or Production Engineer
  • Deep experience with incident management and blameless post-mortems
  • Expertise in Docker, Linux, and Terraform IaC
  • Hands-on with AWS, Azure, or GCP (multi-cloud experience a plus)
  • Kubernetes and Java proficiency preferred
  • Proven independence: minimal guidance, maximum ownership
  • Cross-functional collaboration wizard
  • Thrives in ambiguity and high-velocity environments

No degree required—demonstrate your skills through experience and passion for reliability.

Salary & Benefits

Competitive Compensation: $180,000–$250,000 base + equity + bonuses, depending on experience. Total comp can exceed $350K for top performers.

World-Class Benefits:

  • Comprehensive medical, dental, vision, mental health, fertility coverage
  • Disability & life insurance
  • 401(k) with match
  • Unlimited PTO + holidays
  • Hybrid SF model + remote flexibility
  • Professional growth stipend
  • Home office setup
  • Team events, wellness reimbursements
  • Generous parental leave

Why Join Grammarly?

Grammarly isn't just another company—it's the AI backbone for how the world communicates. Join us to:

  • Work on products with 40M+ users
  • Innovate at the intersection of AI and infrastructure
  • Grow with a values-driven culture (read our values)
  • Enjoy SF's tech ecosystem + hybrid perks

Superhuman's mission: unlock superhuman potential. Your reliability engineering will make it possible.

How to Apply

Ready to build the future of AI reliability? Submit your resume and a brief note on your favorite SRE project. We review applications weekly and respond within 48 hours. Let's chat!

No agencies, please. Grammarly is an equal opportunity employer.

Locations

  • San Francisco, California, United States

Salary

Estimated Salary Rangehigh confidence

189,000 - 275,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Site Reliability Engineeringintermediate
  • Kubernetesintermediate
  • Dockerintermediate
  • Terraformintermediate
  • Linux Administrationintermediate
  • AWSintermediate
  • Azureintermediate
  • GCPintermediate
  • Java Programmingintermediate
  • Incident Managementintermediate
  • DevOps Practicesintermediate
  • CI/CD Pipelinesintermediate
  • Monitoring Toolsintermediate
  • Prometheusintermediate
  • Grafanaintermediate
  • ELK Stackintermediate
  • Infrastructure as Codeintermediate
  • Microservices Architectureintermediate
  • Load Balancingintermediate
  • Automation Scriptingintermediate

Required Qualifications

  • 5+ years of relevant experience as an SRE or DevOps engineer (experience)
  • Proven experience in participating in incident management processes (experience)
  • Strong familiarity with Docker containerization (experience)
  • Expertise in Linux system administration (experience)
  • Hands-on experience with Terraform for infrastructure as code (experience)
  • Experience using AWS, Azure, or GCP cloud platforms (experience)
  • Java programming skills preferred (experience)
  • Kubernetes orchestration skills preferred (experience)
  • Demonstrated ability to work independently with minimal guidance (experience)
  • Proactively manages tasks and priorities across multiple projects (experience)
  • Analyzes and executes work efficiently (experience)
  • Collaborates effectively with cross-functional teams (experience)
  • Thrives in fast-paced, results-driven environments (experience)
  • Embodies EAGER values: ethical, adaptable, gritty, empathetic, remarkable (experience)
  • Inspired by MOVE principles: move fast, obsess about customer value (experience)

Responsibilities

  • Scale Kubernetes-based control plane processing billions of events per day
  • Improve automation mechanisms that react to workload changes
  • Deploy ML systems across the company infrastructure
  • Build software to ensure reliability of back-end systems
  • Collaborate with engineers developing back-end systems
  • Plan for future infrastructure growth and scalability
  • Work with EU production engineering teams during transition
  • Participate in incident management and on-call rotations
  • Monitor system performance and reliability metrics
  • Implement observability tools like Prometheus and Grafana
  • Optimize CI/CD pipelines for faster deployments
  • Troubleshoot and resolve production issues efficiently
  • Develop runbooks and automation for common incidents
  • Conduct capacity planning and performance tuning

Benefits

  • general: Excellent health care including medical, dental, vision coverage
  • general: Wide range of mental health benefits
  • general: Fertility benefits and family planning support
  • general: Disability insurance options
  • general: Life insurance coverage
  • general: Competitive base salary
  • general: Equity in a high-growth AI company
  • general: 401(k) retirement plan with company match
  • general: Unlimited PTO policy
  • general: Hybrid work model with flexibility
  • general: Professional development stipend
  • general: Home office setup allowance
  • general: Weekly team lunches and social events
  • general: Parental leave benefits
  • general: Wellness and gym membership reimbursements

Target Your Resume for "Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!" , Grammarly

Get personalized recommendations to optimize your resume specifically for Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!" , Grammarly

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

site reliability engineer grammarlysre jobs san franciscokubernetes engineer hybriddevops engineer grammarlysre kubernetes awssite reliability engineering careersgrammarly infrastructure jobssre san francisco hybridterraform docker linux jobsml deployment engineersre incident managementsuperhuman sre positionsbackend reliability engineercloud sre gcp azurejava kubernetes developerproduction engineering grammarlysre eu team collaborationai infrastructure jobs sfscalability engineer grammarlydevops hybrid san franciscosre billions events kubernetesEngineering

Answer 10 quick questions to check your fit for Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now! @ Grammarly.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Grammarly logo

Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!

Grammarly

Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!

full-timePosted: Jan 16, 2026

Job Description

Site Reliability Engineer (SRE) at Grammarly - San Francisco, CA (Hybrid)

Role Overview

Grammarly, now proudly part of Superhuman—the leading AI productivity platform—is seeking a talented Site Reliability Engineer (SRE) to join our infrastructure team in San Francisco, CA. This hybrid role offers the perfect balance of focused remote work and in-person collaboration to build trust, drive innovation, and cultivate a thriving team culture.

Superhuman empowers over 40 million people, 50,000 organizations, and 3,000 educational institutions worldwide with AI tools like Grammarly's writing assistance, Coda's collaborative workspaces, Mail's inbox management, and Go's proactive AI assistant. As an SRE, you'll play a pivotal role in scaling our Kubernetes-based systems that process billions of events daily, ensuring rock-solid reliability for our global user base.

We're transitioning from a 'you build it, you own it' model, partnering with our EU production engineering teams to implement world-class SRE practices. If you thrive in fast-paced environments, love automating away toil, and want to impact products used by millions, this is your opportunity to shine at Grammarly.

Our engineers have unprecedented freedom to innovate, influence the product roadmap, and tackle complex challenges in AI, ML deployment, and massive-scale infrastructure. Dive into our technical blog to hear directly from the team.

Key Responsibilities

As a Site Reliability Engineer at Grammarly, you'll wear multiple hats—builder, operator, and innovator. Here's what your day-to-day will look like:

  • Scale our Kubernetes control plane: Handle billions of events per day with zero downtime.
  • Enhance automation: Build reactive systems that adapt to dynamic workloads in real-time.
  • Deploy ML models company-wide: Ensure seamless integration and high availability for AI features.
  • Collaborate cross-functionally: Partner with developers to design reliable back-end systems from day one.
  • Incident leadership: Lead post-mortems, reduce MTTR, and prevent recurrence through automation.
  • Infrastructure planning: Forecast growth and architect scalable solutions for the future.
  • Observability mastery: Implement Prometheus, Grafana, and tracing for full-stack visibility.
  • CI/CD optimization: Streamline pipelines using Terraform, Docker, and Kubernetes.
  • On-call excellence: Participate in rotations with generous stipends and comprehensive support.
  • Runbook development: Create golden paths for common issues and knowledge sharing.
  • Performance tuning: Profile systems, eliminate bottlenecks, and boost efficiency.
  • Security & compliance: Harden infrastructure against threats while maintaining agility.
  • Mentorship: Guide junior engineers and contribute to SRE best practices.
  • Innovation: Experiment with cutting-edge tools to push reliability boundaries.

Expect to work on greenfield projects alongside mature production systems, balancing immediate impact with long-term strategy.

Qualifications

We're looking for battle-tested SREs who embody our EAGER values (Ethical, Adaptable, Gritty, Empathetic, Remarkable) and MOVE principles (Move fast, Obsess about customer value, Value impact, Embrace disagreement):

  • 5+ years as SRE, DevOps, or Production Engineer
  • Deep experience with incident management and blameless post-mortems
  • Expertise in Docker, Linux, and Terraform IaC
  • Hands-on with AWS, Azure, or GCP (multi-cloud experience a plus)
  • Kubernetes and Java proficiency preferred
  • Proven independence: minimal guidance, maximum ownership
  • Cross-functional collaboration wizard
  • Thrives in ambiguity and high-velocity environments

No degree required—demonstrate your skills through experience and passion for reliability.

Salary & Benefits

Competitive Compensation: $180,000–$250,000 base + equity + bonuses, depending on experience. Total comp can exceed $350K for top performers.

World-Class Benefits:

  • Comprehensive medical, dental, vision, mental health, fertility coverage
  • Disability & life insurance
  • 401(k) with match
  • Unlimited PTO + holidays
  • Hybrid SF model + remote flexibility
  • Professional growth stipend
  • Home office setup
  • Team events, wellness reimbursements
  • Generous parental leave

Why Join Grammarly?

Grammarly isn't just another company—it's the AI backbone for how the world communicates. Join us to:

  • Work on products with 40M+ users
  • Innovate at the intersection of AI and infrastructure
  • Grow with a values-driven culture (read our values)
  • Enjoy SF's tech ecosystem + hybrid perks

Superhuman's mission: unlock superhuman potential. Your reliability engineering will make it possible.

How to Apply

Ready to build the future of AI reliability? Submit your resume and a brief note on your favorite SRE project. We review applications weekly and respond within 48 hours. Let's chat!

No agencies, please. Grammarly is an equal opportunity employer.

Locations

  • San Francisco, California, United States

Salary

Estimated Salary Rangehigh confidence

189,000 - 275,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Site Reliability Engineeringintermediate
  • Kubernetesintermediate
  • Dockerintermediate
  • Terraformintermediate
  • Linux Administrationintermediate
  • AWSintermediate
  • Azureintermediate
  • GCPintermediate
  • Java Programmingintermediate
  • Incident Managementintermediate
  • DevOps Practicesintermediate
  • CI/CD Pipelinesintermediate
  • Monitoring Toolsintermediate
  • Prometheusintermediate
  • Grafanaintermediate
  • ELK Stackintermediate
  • Infrastructure as Codeintermediate
  • Microservices Architectureintermediate
  • Load Balancingintermediate
  • Automation Scriptingintermediate

Required Qualifications

  • 5+ years of relevant experience as an SRE or DevOps engineer (experience)
  • Proven experience in participating in incident management processes (experience)
  • Strong familiarity with Docker containerization (experience)
  • Expertise in Linux system administration (experience)
  • Hands-on experience with Terraform for infrastructure as code (experience)
  • Experience using AWS, Azure, or GCP cloud platforms (experience)
  • Java programming skills preferred (experience)
  • Kubernetes orchestration skills preferred (experience)
  • Demonstrated ability to work independently with minimal guidance (experience)
  • Proactively manages tasks and priorities across multiple projects (experience)
  • Analyzes and executes work efficiently (experience)
  • Collaborates effectively with cross-functional teams (experience)
  • Thrives in fast-paced, results-driven environments (experience)
  • Embodies EAGER values: ethical, adaptable, gritty, empathetic, remarkable (experience)
  • Inspired by MOVE principles: move fast, obsess about customer value (experience)

Responsibilities

  • Scale Kubernetes-based control plane processing billions of events per day
  • Improve automation mechanisms that react to workload changes
  • Deploy ML systems across the company infrastructure
  • Build software to ensure reliability of back-end systems
  • Collaborate with engineers developing back-end systems
  • Plan for future infrastructure growth and scalability
  • Work with EU production engineering teams during transition
  • Participate in incident management and on-call rotations
  • Monitor system performance and reliability metrics
  • Implement observability tools like Prometheus and Grafana
  • Optimize CI/CD pipelines for faster deployments
  • Troubleshoot and resolve production issues efficiently
  • Develop runbooks and automation for common incidents
  • Conduct capacity planning and performance tuning

Benefits

  • general: Excellent health care including medical, dental, vision coverage
  • general: Wide range of mental health benefits
  • general: Fertility benefits and family planning support
  • general: Disability insurance options
  • general: Life insurance coverage
  • general: Competitive base salary
  • general: Equity in a high-growth AI company
  • general: 401(k) retirement plan with company match
  • general: Unlimited PTO policy
  • general: Hybrid work model with flexibility
  • general: Professional development stipend
  • general: Home office setup allowance
  • general: Weekly team lunches and social events
  • general: Parental leave benefits
  • general: Wellness and gym membership reimbursements

Target Your Resume for "Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!" , Grammarly

Get personalized recommendations to optimize your resume specifically for Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!" , Grammarly

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

site reliability engineer grammarlysre jobs san franciscokubernetes engineer hybriddevops engineer grammarlysre kubernetes awssite reliability engineering careersgrammarly infrastructure jobssre san francisco hybridterraform docker linux jobsml deployment engineersre incident managementsuperhuman sre positionsbackend reliability engineercloud sre gcp azurejava kubernetes developerproduction engineering grammarlysre eu team collaborationai infrastructure jobs sfscalability engineer grammarlydevops hybrid san franciscosre billions events kubernetesEngineering

Answer 10 quick questions to check your fit for Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now! @ Grammarly.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.