Resume and JobRESUME AND JOB
Apple logo

Production Services Site Reliability Engineer

Apple

Software and Technology Jobs

Production Services Site Reliability Engineer

full-timePosted: May 9, 2025

Job Description

The Production Services Site Reliability Engineer (SRE) role resides within the Software Delivery organization, which is at the core of the Apple software release process. This role is responsible for applying SRE practices in maintaining Atlassian services, which are used by software engineers and project managers to develop Apple software for delivery to customers around the world. The Atlassian Services team drives reliability and performance engineering of data center applications, instruments observability of services, responds to incident alerts, and reports on SLI/SLO metrics for visibility across the organization. This SRE role is essential in maintaining the production systems of Bitbucket, Confluence, and Jira that are used to deliver the state-of-the-art operating systems, applications, and firmware to Apple customers. As an Production Services Site Reliability Engineer, responsibilities include: - Configuration and monitoring of on-prem and cloud-based dependencies -Automate continuous integration (CI) and continuous delivery (CD) pipelines - Maintain staging and production environments with goal of maximizing uptimes - Implement observability of systems for monitoring, alerting, and metrics reporting - Generate reports regarding service metrics on performance, availability, and reliability - Champion practices regarding change control management and incident response A successful Production Services Site Reliability Engineer will be expected to: - Proactively communicate status of Atlassian services to stakeholders and follow through on time-sensitive tasks - Demonstrate willingness to ask for clarification and increase awareness of the larger context - Explore solutions to problems, evaluate risk vs reward, then execute best approach - Communicate asynchronously with a global team across multiple timezones - Document new processes or update existing documentation pages - Eager and curious to learn across multiple technology stacks

Locations

  • San Diego, California, United States 92128

Salary

Estimated Salary Rangemedium confidence

2,500,000 - 4,500,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • SRE practicesintermediate
  • maintaining Atlassian servicesintermediate
  • reliability engineeringintermediate
  • performance engineeringintermediate
  • data center applicationsintermediate
  • observability of servicesintermediate
  • incident responseintermediate
  • SLI/SLO metricsintermediate
  • configuration and monitoringintermediate
  • on-prem dependenciesintermediate
  • cloud-based dependenciesintermediate
  • automate continuous integration (CI)intermediate
  • automate continuous delivery (CD)intermediate
  • maintain staging environmentsintermediate
  • maintain production environmentsintermediate
  • maximizing uptimesintermediate
  • implement observabilityintermediate
  • monitoringintermediate
  • alertingintermediate
  • metrics reportingintermediate
  • generate reportsintermediate
  • service metricsintermediate
  • performance metricsintermediate
  • availability metricsintermediate
  • reliability metricsintermediate
  • change control managementintermediate
  • proactively communicateintermediate
  • communicate statusintermediate
  • follow through on time-sensitive tasksintermediate
  • ask for clarificationintermediate
  • increase awarenessintermediate
  • explore solutionsintermediate
  • evaluate risk vs rewardintermediate
  • execute best approachintermediate
  • communicate asynchronouslyintermediate
  • global team communicationintermediate
  • document new processesintermediate
  • update documentationintermediate
  • learn across technology stacksintermediate

Required Qualifications

  • B.S. in Computer Science or related work experience (experience)
  • Passion in building reliable, scalable, and performant distributed systems (experience)
  • Understanding of distributed systems w.r.t. application, networking, and security (experience)
  • SRE or Dev/Ops experience in managing customer-facing systems in 24/7 environment Experience in managing and monitoring fleets of *nix systems or container platforms (experience)

Preferred Qualifications

  • Excellent judgment and integrity with ability to make timely and sound decisions (experience)
  • Ability to anticipate the needs of others and adapt to changing conditions (experience)

Responsibilities

  • As an Production Services Site Reliability Engineer, responsibilities include:
  • - Configuration and monitoring of on-prem and cloud-based dependencies
  • -Automate continuous integration (CI) and continuous delivery (CD) pipelines
  • - Maintain staging and production environments with goal of maximizing uptimes
  • - Implement observability of systems for monitoring, alerting, and metrics reporting
  • - Generate reports regarding service metrics on performance, availability, and reliability - Champion practices regarding change control management and incident response
  • A successful Production Services Site Reliability Engineer will be expected to:
  • - Proactively communicate status of Atlassian services to stakeholders and follow through on time-sensitive tasks
  • - Demonstrate willingness to ask for clarification and increase awareness of the larger context
  • - Explore solutions to problems, evaluate risk vs reward, then execute best approach
  • - Communicate asynchronously with a global team across multiple timezones
  • - Document new processes or update existing documentation pages
  • - Eager and curious to learn across multiple technology stacks

Target Your Resume for "Production Services Site Reliability Engineer" , Apple

Get personalized recommendations to optimize your resume specifically for Production Services Site Reliability Engineer. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Production Services Site Reliability Engineer" , Apple

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Hardware

Answer 10 quick questions to check your fit for Production Services Site Reliability Engineer @ Apple.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Apple logo

Production Services Site Reliability Engineer

Apple

Software and Technology Jobs

Production Services Site Reliability Engineer

full-timePosted: May 9, 2025

Job Description

The Production Services Site Reliability Engineer (SRE) role resides within the Software Delivery organization, which is at the core of the Apple software release process. This role is responsible for applying SRE practices in maintaining Atlassian services, which are used by software engineers and project managers to develop Apple software for delivery to customers around the world. The Atlassian Services team drives reliability and performance engineering of data center applications, instruments observability of services, responds to incident alerts, and reports on SLI/SLO metrics for visibility across the organization. This SRE role is essential in maintaining the production systems of Bitbucket, Confluence, and Jira that are used to deliver the state-of-the-art operating systems, applications, and firmware to Apple customers. As an Production Services Site Reliability Engineer, responsibilities include: - Configuration and monitoring of on-prem and cloud-based dependencies -Automate continuous integration (CI) and continuous delivery (CD) pipelines - Maintain staging and production environments with goal of maximizing uptimes - Implement observability of systems for monitoring, alerting, and metrics reporting - Generate reports regarding service metrics on performance, availability, and reliability - Champion practices regarding change control management and incident response A successful Production Services Site Reliability Engineer will be expected to: - Proactively communicate status of Atlassian services to stakeholders and follow through on time-sensitive tasks - Demonstrate willingness to ask for clarification and increase awareness of the larger context - Explore solutions to problems, evaluate risk vs reward, then execute best approach - Communicate asynchronously with a global team across multiple timezones - Document new processes or update existing documentation pages - Eager and curious to learn across multiple technology stacks

Locations

  • San Diego, California, United States 92128

Salary

Estimated Salary Rangemedium confidence

2,500,000 - 4,500,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • SRE practicesintermediate
  • maintaining Atlassian servicesintermediate
  • reliability engineeringintermediate
  • performance engineeringintermediate
  • data center applicationsintermediate
  • observability of servicesintermediate
  • incident responseintermediate
  • SLI/SLO metricsintermediate
  • configuration and monitoringintermediate
  • on-prem dependenciesintermediate
  • cloud-based dependenciesintermediate
  • automate continuous integration (CI)intermediate
  • automate continuous delivery (CD)intermediate
  • maintain staging environmentsintermediate
  • maintain production environmentsintermediate
  • maximizing uptimesintermediate
  • implement observabilityintermediate
  • monitoringintermediate
  • alertingintermediate
  • metrics reportingintermediate
  • generate reportsintermediate
  • service metricsintermediate
  • performance metricsintermediate
  • availability metricsintermediate
  • reliability metricsintermediate
  • change control managementintermediate
  • proactively communicateintermediate
  • communicate statusintermediate
  • follow through on time-sensitive tasksintermediate
  • ask for clarificationintermediate
  • increase awarenessintermediate
  • explore solutionsintermediate
  • evaluate risk vs rewardintermediate
  • execute best approachintermediate
  • communicate asynchronouslyintermediate
  • global team communicationintermediate
  • document new processesintermediate
  • update documentationintermediate
  • learn across technology stacksintermediate

Required Qualifications

  • B.S. in Computer Science or related work experience (experience)
  • Passion in building reliable, scalable, and performant distributed systems (experience)
  • Understanding of distributed systems w.r.t. application, networking, and security (experience)
  • SRE or Dev/Ops experience in managing customer-facing systems in 24/7 environment Experience in managing and monitoring fleets of *nix systems or container platforms (experience)

Preferred Qualifications

  • Excellent judgment and integrity with ability to make timely and sound decisions (experience)
  • Ability to anticipate the needs of others and adapt to changing conditions (experience)

Responsibilities

  • As an Production Services Site Reliability Engineer, responsibilities include:
  • - Configuration and monitoring of on-prem and cloud-based dependencies
  • -Automate continuous integration (CI) and continuous delivery (CD) pipelines
  • - Maintain staging and production environments with goal of maximizing uptimes
  • - Implement observability of systems for monitoring, alerting, and metrics reporting
  • - Generate reports regarding service metrics on performance, availability, and reliability - Champion practices regarding change control management and incident response
  • A successful Production Services Site Reliability Engineer will be expected to:
  • - Proactively communicate status of Atlassian services to stakeholders and follow through on time-sensitive tasks
  • - Demonstrate willingness to ask for clarification and increase awareness of the larger context
  • - Explore solutions to problems, evaluate risk vs reward, then execute best approach
  • - Communicate asynchronously with a global team across multiple timezones
  • - Document new processes or update existing documentation pages
  • - Eager and curious to learn across multiple technology stacks

Target Your Resume for "Production Services Site Reliability Engineer" , Apple

Get personalized recommendations to optimize your resume specifically for Production Services Site Reliability Engineer. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Production Services Site Reliability Engineer" , Apple

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Hardware

Answer 10 quick questions to check your fit for Production Services Site Reliability Engineer @ Apple.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.