Resume and JobRESUME AND JOB
GitLab logo

Senior Site Reliability Engineer, Environment Automation

GitLab

Engineering Jobs

Senior Site Reliability Engineer, Environment Automation

full-timePosted: Dec 18, 2025

Job Description

About this Role

Join GitLab as a Senior Site Reliability Engineer specializing in Environment Automation to empower teams worldwide with seamless DevSecOps innovation.

Our mission is to enable everyone to contribute to and co-create the software powering our world, accelerating human progress through open collaboration.

Dive into the excitement of operating and automating hundreds of GitLab environments, from provisioning to maintenance, at massive scale.

Combine pragmatic operations with elite software engineering to drive automation, slash toil, and fortify platform resilience.

Craft infrastructure that provisions isolated, secure GitLab instances across cloud ecosystems, ensuring consistency and reliability.

Thrive in a high-performance culture where AI amplifies productivity, innovation blooms, and every voice shapes the future.

Lead the charge in eliminating manual operations, delivering a fully managed single-tenant GitLab experience without infrastructure headaches.

Collaborate with industry leaders to solve intricate challenges in distributed systems, scalability, and operational excellence.

Experience the thrill of building tools that orchestrate IaC workflows, deploy microservices on Kubernetes, and enhance observability stacks.

Co-create the future at GitLab, where your expertise in automation will unlock unprecedented efficiency and impact in software delivery.

Locations

  • Americas, Remote, Canada (Remote)

Salary

Salary details available upon request

Estimated Salary Rangemedium confidence

140,000 - 220,000 CAD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Terraformintermediate
  • Kubernetesintermediate
  • Infrastructure as Code (IaC)intermediate
  • Go programmingintermediate
  • Ruby programmingintermediate
  • Prometheusintermediate
  • ELK stackintermediate
  • Ansibleintermediate
  • Cloud infrastructure (GCP, AWS)intermediate
  • Observability and monitoring toolsintermediate

Required Qualifications

  • Proven ability to operate and troubleshoot production workloads across multiple tenants or environments (experience)
  • Deep understanding of how distributed systems fail at scale and how to build in resilience (experience)
  • Strong hands-on experience with Terraform, including workspace strategies and state management (experience)
  • Skilled at diagnosing deployment failures, interpreting pod logs, and debugging Kubernetes scheduling issues (experience)
  • Ability to read and debug code in Go and/or Ruby for performance and scalability concerns (experience)
  • Experience supporting infrastructure for many customers or environments simultaneously (experience)
  • Able to reason through complex systems and operational challenges with on-call experience (experience)
  • Proven ability to work across teams to solve technical problems while maintaining service commitments (experience)
  • Comfortable managing isolation, scaling, monitoring, and incident response across diverse workloads (experience)
  • Experience leading technical discussions and incident resolution efforts under pressure (experience)

Preferred Qualifications

  • Experience with Ansible and templating tools like Jsonnet (experience)
  • Proficiency with GitLab platform operations and workflows (experience)
  • Background in cloud provider ecosystems like GCP and AWS (experience)
  • Knowledge of IAM, networking, and storage integration (experience)
  • Expertise in Prometheus, ELK, and observability stacks (experience)
  • Familiarity with microservices deployment on Kubernetes at scale (experience)
  • Skills in automated version upgrades and configuration rollouts (experience)
  • Understanding of cloud security best practices implementation (experience)
  • Experience with capacity prediction and bottleneck detection (experience)
  • Contributions to infrastructure tooling through code analysis (experience)

Responsibilities

  • Design and implement automation to provision and manage hundreds of isolated GitLab environments
  • Troubleshoot issues across Kubernetes clusters, cloud services, and GitLab apps to ensure continuity
  • Replace manual workflows with infrastructure-as-code solutions for upgrades and provisioning
  • Build observability systems to detect bottlenecks and predict capacity needs
  • Lead incident response, postmortem efforts, and establish standards to reduce future risk
  • Influence architectural decisions around automation, scalability, and operational excellence
  • Partner with engineering teams to improve platform resilience and production-readiness
  • Manage complex state strategies and workspace configurations for maintainability at scale
  • Deploy and manage microservices on Kubernetes clusters across multiple tenants
  • Champion cloud security best practices in automated infrastructure workflows

Benefits

  • general: Comprehensive benefits to support your health, finances, and well-being
  • general: Flexible Paid Time Off policy
  • general: Team Member Resource Groups for inclusion and belonging
  • general: Equity Compensation and Employee Stock Purchase Plan
  • general: Growth and Development Fund for professional advancement
  • general: Generous Parental leave
  • general: Home office support and equipment
  • general: Bonuses and incentive pay opportunities
  • general: Continuous knowledge exchange and learning resources
  • general: High-performance culture with career acceleration potential

Target Your Resume for "Senior Site Reliability Engineer, Environment Automation" , GitLab

Get personalized recommendations to optimize your resume specifically for Senior Site Reliability Engineer, Environment Automation. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Senior Site Reliability Engineer, Environment Automation" , GitLab

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Platforms EngineeringTechnologySoftware

Answer 10 quick questions to check your fit for Senior Site Reliability Engineer, Environment Automation @ GitLab.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

GitLab logo

Senior Site Reliability Engineer, Environment Automation

GitLab

Engineering Jobs

Senior Site Reliability Engineer, Environment Automation

full-timePosted: Dec 18, 2025

Job Description

About this Role

Join GitLab as a Senior Site Reliability Engineer specializing in Environment Automation to empower teams worldwide with seamless DevSecOps innovation.

Our mission is to enable everyone to contribute to and co-create the software powering our world, accelerating human progress through open collaboration.

Dive into the excitement of operating and automating hundreds of GitLab environments, from provisioning to maintenance, at massive scale.

Combine pragmatic operations with elite software engineering to drive automation, slash toil, and fortify platform resilience.

Craft infrastructure that provisions isolated, secure GitLab instances across cloud ecosystems, ensuring consistency and reliability.

Thrive in a high-performance culture where AI amplifies productivity, innovation blooms, and every voice shapes the future.

Lead the charge in eliminating manual operations, delivering a fully managed single-tenant GitLab experience without infrastructure headaches.

Collaborate with industry leaders to solve intricate challenges in distributed systems, scalability, and operational excellence.

Experience the thrill of building tools that orchestrate IaC workflows, deploy microservices on Kubernetes, and enhance observability stacks.

Co-create the future at GitLab, where your expertise in automation will unlock unprecedented efficiency and impact in software delivery.

Locations

  • Americas, Remote, Canada (Remote)

Salary

Salary details available upon request

Estimated Salary Rangemedium confidence

140,000 - 220,000 CAD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Terraformintermediate
  • Kubernetesintermediate
  • Infrastructure as Code (IaC)intermediate
  • Go programmingintermediate
  • Ruby programmingintermediate
  • Prometheusintermediate
  • ELK stackintermediate
  • Ansibleintermediate
  • Cloud infrastructure (GCP, AWS)intermediate
  • Observability and monitoring toolsintermediate

Required Qualifications

  • Proven ability to operate and troubleshoot production workloads across multiple tenants or environments (experience)
  • Deep understanding of how distributed systems fail at scale and how to build in resilience (experience)
  • Strong hands-on experience with Terraform, including workspace strategies and state management (experience)
  • Skilled at diagnosing deployment failures, interpreting pod logs, and debugging Kubernetes scheduling issues (experience)
  • Ability to read and debug code in Go and/or Ruby for performance and scalability concerns (experience)
  • Experience supporting infrastructure for many customers or environments simultaneously (experience)
  • Able to reason through complex systems and operational challenges with on-call experience (experience)
  • Proven ability to work across teams to solve technical problems while maintaining service commitments (experience)
  • Comfortable managing isolation, scaling, monitoring, and incident response across diverse workloads (experience)
  • Experience leading technical discussions and incident resolution efforts under pressure (experience)

Preferred Qualifications

  • Experience with Ansible and templating tools like Jsonnet (experience)
  • Proficiency with GitLab platform operations and workflows (experience)
  • Background in cloud provider ecosystems like GCP and AWS (experience)
  • Knowledge of IAM, networking, and storage integration (experience)
  • Expertise in Prometheus, ELK, and observability stacks (experience)
  • Familiarity with microservices deployment on Kubernetes at scale (experience)
  • Skills in automated version upgrades and configuration rollouts (experience)
  • Understanding of cloud security best practices implementation (experience)
  • Experience with capacity prediction and bottleneck detection (experience)
  • Contributions to infrastructure tooling through code analysis (experience)

Responsibilities

  • Design and implement automation to provision and manage hundreds of isolated GitLab environments
  • Troubleshoot issues across Kubernetes clusters, cloud services, and GitLab apps to ensure continuity
  • Replace manual workflows with infrastructure-as-code solutions for upgrades and provisioning
  • Build observability systems to detect bottlenecks and predict capacity needs
  • Lead incident response, postmortem efforts, and establish standards to reduce future risk
  • Influence architectural decisions around automation, scalability, and operational excellence
  • Partner with engineering teams to improve platform resilience and production-readiness
  • Manage complex state strategies and workspace configurations for maintainability at scale
  • Deploy and manage microservices on Kubernetes clusters across multiple tenants
  • Champion cloud security best practices in automated infrastructure workflows

Benefits

  • general: Comprehensive benefits to support your health, finances, and well-being
  • general: Flexible Paid Time Off policy
  • general: Team Member Resource Groups for inclusion and belonging
  • general: Equity Compensation and Employee Stock Purchase Plan
  • general: Growth and Development Fund for professional advancement
  • general: Generous Parental leave
  • general: Home office support and equipment
  • general: Bonuses and incentive pay opportunities
  • general: Continuous knowledge exchange and learning resources
  • general: High-performance culture with career acceleration potential

Target Your Resume for "Senior Site Reliability Engineer, Environment Automation" , GitLab

Get personalized recommendations to optimize your resume specifically for Senior Site Reliability Engineer, Environment Automation. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Senior Site Reliability Engineer, Environment Automation" , GitLab

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Platforms EngineeringTechnologySoftware

Answer 10 quick questions to check your fit for Senior Site Reliability Engineer, Environment Automation @ GitLab.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.