RESUME AND JOB

Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!

Grammarly

Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!

Grammarly

full-timePosted: Jan 16, 2026

Job Description

Site Reliability Engineer (SRE) at Grammarly - San Francisco, CA (Hybrid)

Role Overview

Grammarly, now proudly part of Superhuman—the leading AI productivity platform—is seeking a talented Site Reliability Engineer (SRE) to join our infrastructure team in San Francisco, CA. This hybrid role offers the perfect balance of focused remote work and in-person collaboration to build trust, drive innovation, and cultivate a thriving team culture.

Superhuman empowers over 40 million people, 50,000 organizations, and 3,000 educational institutions worldwide with AI tools like Grammarly's writing assistance, Coda's collaborative workspaces, Mail's inbox management, and Go's proactive AI assistant. As an SRE, you'll play a pivotal role in scaling our Kubernetes-based systems that process billions of events daily, ensuring rock-solid reliability for our global user base.

We're transitioning from a 'you build it, you own it' model, partnering with our EU production engineering teams to implement world-class SRE practices. If you thrive in fast-paced environments, love automating away toil, and want to impact products used by millions, this is your opportunity to shine at Grammarly.

Our engineers have unprecedented freedom to innovate, influence the product roadmap, and tackle complex challenges in AI, ML deployment, and massive-scale infrastructure. Dive into our technical blog to hear directly from the team.

Key Responsibilities

As a Site Reliability Engineer at Grammarly, you'll wear multiple hats—builder, operator, and innovator. Here's what your day-to-day will look like:

Scale our Kubernetes control plane: Handle billions of events per day with zero downtime.
Enhance automation: Build reactive systems that adapt to dynamic workloads in real-time.
Deploy ML models company-wide: Ensure seamless integration and high availability for AI features.
Collaborate cross-functionally: Partner with developers to design reliable back-end systems from day one.
Incident leadership: Lead post-mortems, reduce MTTR, and prevent recurrence through automation.
Infrastructure planning: Forecast growth and architect scalable solutions for the future.
Observability mastery: Implement Prometheus, Grafana, and tracing for full-stack visibility.
CI/CD optimization: Streamline pipelines using Terraform, Docker, and Kubernetes.
On-call excellence: Participate in rotations with generous stipends and comprehensive support.
Runbook development: Create golden paths for common issues and knowledge sharing.
Performance tuning: Profile systems, eliminate bottlenecks, and boost efficiency.
Security & compliance: Harden infrastructure against threats while maintaining agility.
Mentorship: Guide junior engineers and contribute to SRE best practices.
Innovation: Experiment with cutting-edge tools to push reliability boundaries.

Expect to work on greenfield projects alongside mature production systems, balancing immediate impact with long-term strategy.

Qualifications

We're looking for battle-tested SREs who embody our EAGER values (Ethical, Adaptable, Gritty, Empathetic, Remarkable) and MOVE principles (Move fast, Obsess about customer value, Value impact, Embrace disagreement):

5+ years as SRE, DevOps, or Production Engineer
Deep experience with incident management and blameless post-mortems
Expertise in Docker, Linux, and Terraform IaC
Hands-on with AWS, Azure, or GCP (multi-cloud experience a plus)
Kubernetes and Java proficiency preferred
Proven independence: minimal guidance, maximum ownership
Cross-functional collaboration wizard
Thrives in ambiguity and high-velocity environments

No degree required—demonstrate your skills through experience and passion for reliability.

Salary & Benefits

Competitive Compensation: $180,000–$250,000 base + equity + bonuses, depending on experience. Total comp can exceed $350K for top performers.

World-Class Benefits:

Comprehensive medical, dental, vision, mental health, fertility coverage
Disability & life insurance
401(k) with match
Unlimited PTO + holidays
Hybrid SF model + remote flexibility
Professional growth stipend
Home office setup
Team events, wellness reimbursements
Generous parental leave

Why Join Grammarly?

Grammarly isn't just another company—it's the AI backbone for how the world communicates. Join us to:

Work on products with 40M+ users
Innovate at the intersection of AI and infrastructure
Grow with a values-driven culture (read our values)
Enjoy SF's tech ecosystem + hybrid perks

Superhuman's mission: unlock superhuman potential. Your reliability engineering will make it possible.

How to Apply

Ready to build the future of AI reliability? Submit your resume and a brief note on your favorite SRE project. We review applications weekly and respond within 48 hours. Let's chat!

No agencies, please. Grammarly is an equal opportunity employer.

Locations

San Francisco, California, United States

Salary

Estimated Salary Rangehigh confidence

189,000 - 275,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Site Reliability Engineeringintermediate
Kubernetesintermediate
Dockerintermediate
Terraformintermediate
Linux Administrationintermediate
AWSintermediate
Azureintermediate
GCPintermediate
Java Programmingintermediate
Incident Managementintermediate
DevOps Practicesintermediate
CI/CD Pipelinesintermediate
Monitoring Toolsintermediate
Prometheusintermediate
Grafanaintermediate
ELK Stackintermediate
Infrastructure as Codeintermediate
Microservices Architectureintermediate
Load Balancingintermediate
Automation Scriptingintermediate

Required Qualifications

5+ years of relevant experience as an SRE or DevOps engineer (experience)
Proven experience in participating in incident management processes (experience)
Strong familiarity with Docker containerization (experience)
Expertise in Linux system administration (experience)
Hands-on experience with Terraform for infrastructure as code (experience)
Experience using AWS, Azure, or GCP cloud platforms (experience)
Java programming skills preferred (experience)
Kubernetes orchestration skills preferred (experience)
Demonstrated ability to work independently with minimal guidance (experience)
Proactively manages tasks and priorities across multiple projects (experience)
Analyzes and executes work efficiently (experience)
Collaborates effectively with cross-functional teams (experience)
Thrives in fast-paced, results-driven environments (experience)
Embodies EAGER values: ethical, adaptable, gritty, empathetic, remarkable (experience)
Inspired by MOVE principles: move fast, obsess about customer value (experience)

Responsibilities

Scale Kubernetes-based control plane processing billions of events per day
Improve automation mechanisms that react to workload changes
Deploy ML systems across the company infrastructure
Build software to ensure reliability of back-end systems
Collaborate with engineers developing back-end systems
Plan for future infrastructure growth and scalability
Work with EU production engineering teams during transition
Participate in incident management and on-call rotations
Monitor system performance and reliability metrics
Implement observability tools like Prometheus and Grafana
Optimize CI/CD pipelines for faster deployments
Troubleshoot and resolve production issues efficiently
Develop runbooks and automation for common incidents
Conduct capacity planning and performance tuning

Benefits

general: Excellent health care including medical, dental, vision coverage
general: Wide range of mental health benefits
general: Fertility benefits and family planning support
general: Disability insurance options
general: Life insurance coverage
general: Competitive base salary
general: Equity in a high-growth AI company
general: 401(k) retirement plan with company match
general: Unlimited PTO policy
general: Hybrid work model with flexibility
general: Professional development stipend
general: Home office setup allowance
general: Weekly team lunches and social events
general: Parental leave benefits
general: Wellness and gym membership reimbursements

Target Your Resume for "Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!" , Grammarly

Get personalized recommendations to optimize your resume specifically for Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!" , Grammarly

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

site reliability engineer grammarlysre jobs san franciscokubernetes engineer hybriddevops engineer grammarlysre kubernetes awssite reliability engineering careersgrammarly infrastructure jobssre san francisco hybridterraform docker linux jobsml deployment engineersre incident managementsuperhuman sre positionsbackend reliability engineercloud sre gcp azurejava kubernetes developerproduction engineering grammarlysre eu team collaborationai infrastructure jobs sfscalability engineer grammarlydevops hybrid san franciscosre billions events kubernetesEngineering

Answer 10 quick questions to check your fit for Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now! @ Grammarly.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap

Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!

Grammarly

Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!

Grammarly

full-timePosted: Jan 16, 2026

Job Description

Site Reliability Engineer (SRE) at Grammarly - San Francisco, CA (Hybrid)

Role Overview

Key Responsibilities

As a Site Reliability Engineer at Grammarly, you'll wear multiple hats—builder, operator, and innovator. Here's what your day-to-day will look like:

Scale our Kubernetes control plane: Handle billions of events per day with zero downtime.
Enhance automation: Build reactive systems that adapt to dynamic workloads in real-time.
Deploy ML models company-wide: Ensure seamless integration and high availability for AI features.
Collaborate cross-functionally: Partner with developers to design reliable back-end systems from day one.
Incident leadership: Lead post-mortems, reduce MTTR, and prevent recurrence through automation.
Infrastructure planning: Forecast growth and architect scalable solutions for the future.
Observability mastery: Implement Prometheus, Grafana, and tracing for full-stack visibility.
CI/CD optimization: Streamline pipelines using Terraform, Docker, and Kubernetes.
On-call excellence: Participate in rotations with generous stipends and comprehensive support.
Runbook development: Create golden paths for common issues and knowledge sharing.
Performance tuning: Profile systems, eliminate bottlenecks, and boost efficiency.
Security & compliance: Harden infrastructure against threats while maintaining agility.
Mentorship: Guide junior engineers and contribute to SRE best practices.
Innovation: Experiment with cutting-edge tools to push reliability boundaries.

Expect to work on greenfield projects alongside mature production systems, balancing immediate impact with long-term strategy.

Qualifications

5+ years as SRE, DevOps, or Production Engineer
Deep experience with incident management and blameless post-mortems
Expertise in Docker, Linux, and Terraform IaC
Hands-on with AWS, Azure, or GCP (multi-cloud experience a plus)
Kubernetes and Java proficiency preferred
Proven independence: minimal guidance, maximum ownership
Cross-functional collaboration wizard
Thrives in ambiguity and high-velocity environments

No degree required—demonstrate your skills through experience and passion for reliability.

Salary & Benefits

Competitive Compensation: $180,000–$250,000 base + equity + bonuses, depending on experience. Total comp can exceed $350K for top performers.

World-Class Benefits:

Comprehensive medical, dental, vision, mental health, fertility coverage
Disability & life insurance
401(k) with match
Unlimited PTO + holidays
Hybrid SF model + remote flexibility
Professional growth stipend
Home office setup
Team events, wellness reimbursements
Generous parental leave

Why Join Grammarly?

Grammarly isn't just another company—it's the AI backbone for how the world communicates. Join us to:

Work on products with 40M+ users
Innovate at the intersection of AI and infrastructure
Grow with a values-driven culture (read our values)
Enjoy SF's tech ecosystem + hybrid perks

Superhuman's mission: unlock superhuman potential. Your reliability engineering will make it possible.

How to Apply

Ready to build the future of AI reliability? Submit your resume and a brief note on your favorite SRE project. We review applications weekly and respond within 48 hours. Let's chat!

No agencies, please. Grammarly is an equal opportunity employer.

Locations

San Francisco, California, United States

Salary

Estimated Salary Rangehigh confidence

189,000 - 275,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Site Reliability Engineeringintermediate
Kubernetesintermediate
Dockerintermediate
Terraformintermediate
Linux Administrationintermediate
AWSintermediate
Azureintermediate
GCPintermediate
Java Programmingintermediate
Incident Managementintermediate
DevOps Practicesintermediate
CI/CD Pipelinesintermediate
Monitoring Toolsintermediate
Prometheusintermediate
Grafanaintermediate
ELK Stackintermediate
Infrastructure as Codeintermediate
Microservices Architectureintermediate
Load Balancingintermediate
Automation Scriptingintermediate

Required Qualifications

5+ years of relevant experience as an SRE or DevOps engineer (experience)
Proven experience in participating in incident management processes (experience)
Strong familiarity with Docker containerization (experience)
Expertise in Linux system administration (experience)
Hands-on experience with Terraform for infrastructure as code (experience)
Experience using AWS, Azure, or GCP cloud platforms (experience)
Java programming skills preferred (experience)
Kubernetes orchestration skills preferred (experience)
Demonstrated ability to work independently with minimal guidance (experience)
Proactively manages tasks and priorities across multiple projects (experience)
Analyzes and executes work efficiently (experience)
Collaborates effectively with cross-functional teams (experience)
Thrives in fast-paced, results-driven environments (experience)
Embodies EAGER values: ethical, adaptable, gritty, empathetic, remarkable (experience)
Inspired by MOVE principles: move fast, obsess about customer value (experience)

Responsibilities

Scale Kubernetes-based control plane processing billions of events per day
Improve automation mechanisms that react to workload changes
Deploy ML systems across the company infrastructure
Build software to ensure reliability of back-end systems
Collaborate with engineers developing back-end systems
Plan for future infrastructure growth and scalability
Work with EU production engineering teams during transition
Participate in incident management and on-call rotations
Monitor system performance and reliability metrics
Implement observability tools like Prometheus and Grafana
Optimize CI/CD pipelines for faster deployments
Troubleshoot and resolve production issues efficiently
Develop runbooks and automation for common incidents
Conduct capacity planning and performance tuning

Benefits

general: Excellent health care including medical, dental, vision coverage
general: Wide range of mental health benefits
general: Fertility benefits and family planning support
general: Disability insurance options
general: Life insurance coverage
general: Competitive base salary
general: Equity in a high-growth AI company
general: 401(k) retirement plan with company match
general: Unlimited PTO policy
general: Hybrid work model with flexibility
general: Professional development stipend
general: Home office setup allowance
general: Weekly team lunches and social events
general: Parental leave benefits
general: Wellness and gym membership reimbursements

Target Your Resume for "Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!" , Grammarly

Get personalized recommendations to optimize your resume specifically for Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now!" , Grammarly

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

Answer 10 quick questions to check your fit for Site Reliability Engineer Careers at Grammarly - San Francisco, CA | Apply Now! @ Grammarly.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap