Resume and JobRESUME AND JOB
Apple logo

Software Engineer (Technical Operations & Site Reliability Engineer)

Apple

Software and Technology Jobs

Software Engineer (Technical Operations & Site Reliability Engineer)

full-timePosted: Sep 11, 2025

Job Description

At Apple, customer experience is at the forefront of everything we do. Apple Customer Systems Operations team is looking for a highly skilled and motivated Software Engineer (Technical Operations & Site Reliability Engineer) to join our Operations team. The team is responsible for maintaining the reliability, availability, and performance of business-critical, globally distributed systems. If you have the desire and motivation to design and develop automation solutions to streamline system sustenance, monitoring, and operational workflows, while collaborating closely with support, engineering and business operations teams, this profile is for you. Ideal candidates will combine strong software engineering skills with a passion for operational excellence, and thrive in a fast-paced, change-driven environment focused on continuous improvement and flawless delivery. Our Production Operations team: -Manage large-scale production outages, leading incident response and improving efficiency. -Design, build, and maintain automation solutions to streamline the monitoring, sustenance, and management of large-scale distributed systems. -Develop tools and software (using Java/JEE, REST, Swift/Objective C, Python, Go, or Bash) to automate repetitive operational tasks, reduce manual intervention, and improve system reliability. We utilise AI & LLM models to gain Operations Excellence in application support. -Plan and execute actionable system health monitoring, incident response and communication across critical global applications. We drive operational metrics and KPI identification and alignment. -Partner with multi-functional teams to improve reliability, efficiency, stability and processes. -Are self-directed problem-solvers exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner. -Create and maintain accurate, up-to-date documentation reflecting architecture, infra configuration and procedures. We write status and incident reports. We write training material and train users in complex topics. -Lead a team of highly skilled engineers across the globe and guide their work towards operations excellence, gaining efficiency. -Builds a culture where the regional members are responsible for cultivating strong in-region relationships and getting results for our business partners ensuring they remain informed about significant incidents and problems.

Locations

  • Cork, County Cork, Ireland

Salary

Estimated Salary Rangemedium confidence

2,500,000 - 5,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • software engineering skillsintermediate
  • design and develop automation solutionsintermediate
  • streamline system sustenanceintermediate
  • monitoringintermediate
  • operational workflowsintermediate
  • collaborating with teamsintermediate
  • operational excellenceintermediate
  • manage large-scale production outagesintermediate
  • leading incident responseintermediate
  • improving efficiencyintermediate
  • design, build, and maintain automation solutionsintermediate
  • streamline the monitoringintermediate
  • sustenance and management of large-scale distributed systemsintermediate
  • develop tools and softwareintermediate
  • proficiency in Java/JEEintermediate
  • proficiency in RESTintermediate
  • proficiency in Swiftintermediate
  • proficiency in Objective Cintermediate
  • proficiency in Pythonintermediate
  • proficiency in Gointermediate
  • proficiency in Bashintermediate
  • automate repetitive operational tasksintermediate
  • reduce manual interventionintermediate
  • improve system reliabilityintermediate
  • utilise AI & LLM modelsintermediate
  • plan and execute system health monitoringintermediate
  • incident response and communicationintermediate
  • drive operational metrics and KPI identificationintermediate
  • partner with multi-functional teamsintermediate
  • improve reliability, efficiency, stability and processesintermediate
  • self-directed problem-solversintermediate
  • handle multiple simultaneous competing prioritiesintermediate
  • deliver solutions in a timely mannerintermediate
  • create and maintain documentationintermediate
  • reflecting architectureintermediate
  • infra configurationintermediate
  • proceduresintermediate
  • write status and incident reportsintermediate
  • write training materialintermediate
  • train users in complex topicsintermediate
  • lead a team of engineersintermediate
  • guide work towards operations excellenceintermediate
  • gaining efficiencyintermediate
  • build a cultureintermediate
  • cultivating strong in-region relationshipsintermediate
  • getting results for business partnersintermediate
  • ensuring informed about incidents and problemsintermediate

Required Qualifications

  • Experience in using AI and Large Language Models (LLMs) to enhance operational efficiency through tasks such as model training, optimisation (including areas like Model Context Protocol or similar methods), and designing effective model utilities. (experience)
  • Understanding of standard networking protocols and components such as: HTTP, DNS, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing. (experience)
  • Experience in Java, JEE, REST, Swift/Objective C, database schema design and data access technologies. (experience)
  • Experience in driving operations teams for large scale mission critical applications working in a 24x7 operations across multiple locations and geographies. (experience)
  • Experience in interpreting data from systems like Hubble, ExtraHop, Splunk and other monitoring tools along with hands on experience of production monitoring systems, log analysis, troubleshooting, support dashboards and proficiency in scripting languages and automation tools. (experience)
  • Excellent interpersonal skills. Proactive, with a strong sense of personal ownership. (experience)
  • Bachelor’s degree in Engineering or equivalent. (degree in engineering or equivalent)

Preferred Qualifications

  • Experience in strategising and achieving operational excellence in global distributed systems. (experience)
  • Fundamental understanding of distributed systems including: Micro services, Messaging Brokers and Versioning. (experience)
  • Understanding of the Linux Operating System, including Kernel, Memory, Process, Threads, Static / Shared Libraries, IPC and Signals. (experience)
  • Excellent organisational and documentation skills. (experience)
  • Worried that you don’t quite tick all the above boxes? If you are excited about this role but your experience doesn’t align exactly with every part of the job description, we encourage you to apply anyway. You may very well be the right candidate. (experience)

Responsibilities

  • Our Production Operations team:
  • -Manage large-scale production outages, leading incident response and improving efficiency.
  • -Design, build, and maintain automation solutions to streamline the monitoring, sustenance, and management of large-scale distributed systems.
  • -Develop tools and software (using Java/JEE, REST, Swift/Objective C, Python, Go, or Bash) to automate repetitive operational tasks, reduce manual intervention, and improve system reliability. We utilise AI & LLM models to gain Operations Excellence in application support.
  • -Plan and execute actionable system health monitoring, incident response and communication across critical global applications. We drive operational metrics and KPI identification and alignment.
  • -Partner with multi-functional teams to improve reliability, efficiency, stability and processes.
  • -Are self-directed problem-solvers exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner.
  • -Create and maintain accurate, up-to-date documentation reflecting architecture, infra configuration and procedures. We write status and incident reports. We write training material and train users in complex topics.
  • -Lead a team of highly skilled engineers across the globe and guide their work towards operations excellence, gaining efficiency.
  • -Builds a culture where the regional members are responsible for cultivating strong in-region relationships and getting results for our business partners ensuring they remain informed about significant incidents and problems.

Target Your Resume for "Software Engineer (Technical Operations & Site Reliability Engineer)" , Apple

Get personalized recommendations to optimize your resume specifically for Software Engineer (Technical Operations & Site Reliability Engineer). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Software Engineer (Technical Operations & Site Reliability Engineer)" , Apple

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Hardware

Answer 10 quick questions to check your fit for Software Engineer (Technical Operations & Site Reliability Engineer) @ Apple.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

Apple logo

Software Engineer (Technical Operations & Site Reliability Engineer)

Apple

Software and Technology Jobs

Software Engineer (Technical Operations & Site Reliability Engineer)

full-timePosted: Sep 11, 2025

Job Description

At Apple, customer experience is at the forefront of everything we do. Apple Customer Systems Operations team is looking for a highly skilled and motivated Software Engineer (Technical Operations & Site Reliability Engineer) to join our Operations team. The team is responsible for maintaining the reliability, availability, and performance of business-critical, globally distributed systems. If you have the desire and motivation to design and develop automation solutions to streamline system sustenance, monitoring, and operational workflows, while collaborating closely with support, engineering and business operations teams, this profile is for you. Ideal candidates will combine strong software engineering skills with a passion for operational excellence, and thrive in a fast-paced, change-driven environment focused on continuous improvement and flawless delivery. Our Production Operations team: -Manage large-scale production outages, leading incident response and improving efficiency. -Design, build, and maintain automation solutions to streamline the monitoring, sustenance, and management of large-scale distributed systems. -Develop tools and software (using Java/JEE, REST, Swift/Objective C, Python, Go, or Bash) to automate repetitive operational tasks, reduce manual intervention, and improve system reliability. We utilise AI & LLM models to gain Operations Excellence in application support. -Plan and execute actionable system health monitoring, incident response and communication across critical global applications. We drive operational metrics and KPI identification and alignment. -Partner with multi-functional teams to improve reliability, efficiency, stability and processes. -Are self-directed problem-solvers exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner. -Create and maintain accurate, up-to-date documentation reflecting architecture, infra configuration and procedures. We write status and incident reports. We write training material and train users in complex topics. -Lead a team of highly skilled engineers across the globe and guide their work towards operations excellence, gaining efficiency. -Builds a culture where the regional members are responsible for cultivating strong in-region relationships and getting results for our business partners ensuring they remain informed about significant incidents and problems.

Locations

  • Cork, County Cork, Ireland

Salary

Estimated Salary Rangemedium confidence

2,500,000 - 5,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • software engineering skillsintermediate
  • design and develop automation solutionsintermediate
  • streamline system sustenanceintermediate
  • monitoringintermediate
  • operational workflowsintermediate
  • collaborating with teamsintermediate
  • operational excellenceintermediate
  • manage large-scale production outagesintermediate
  • leading incident responseintermediate
  • improving efficiencyintermediate
  • design, build, and maintain automation solutionsintermediate
  • streamline the monitoringintermediate
  • sustenance and management of large-scale distributed systemsintermediate
  • develop tools and softwareintermediate
  • proficiency in Java/JEEintermediate
  • proficiency in RESTintermediate
  • proficiency in Swiftintermediate
  • proficiency in Objective Cintermediate
  • proficiency in Pythonintermediate
  • proficiency in Gointermediate
  • proficiency in Bashintermediate
  • automate repetitive operational tasksintermediate
  • reduce manual interventionintermediate
  • improve system reliabilityintermediate
  • utilise AI & LLM modelsintermediate
  • plan and execute system health monitoringintermediate
  • incident response and communicationintermediate
  • drive operational metrics and KPI identificationintermediate
  • partner with multi-functional teamsintermediate
  • improve reliability, efficiency, stability and processesintermediate
  • self-directed problem-solversintermediate
  • handle multiple simultaneous competing prioritiesintermediate
  • deliver solutions in a timely mannerintermediate
  • create and maintain documentationintermediate
  • reflecting architectureintermediate
  • infra configurationintermediate
  • proceduresintermediate
  • write status and incident reportsintermediate
  • write training materialintermediate
  • train users in complex topicsintermediate
  • lead a team of engineersintermediate
  • guide work towards operations excellenceintermediate
  • gaining efficiencyintermediate
  • build a cultureintermediate
  • cultivating strong in-region relationshipsintermediate
  • getting results for business partnersintermediate
  • ensuring informed about incidents and problemsintermediate

Required Qualifications

  • Experience in using AI and Large Language Models (LLMs) to enhance operational efficiency through tasks such as model training, optimisation (including areas like Model Context Protocol or similar methods), and designing effective model utilities. (experience)
  • Understanding of standard networking protocols and components such as: HTTP, DNS, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing. (experience)
  • Experience in Java, JEE, REST, Swift/Objective C, database schema design and data access technologies. (experience)
  • Experience in driving operations teams for large scale mission critical applications working in a 24x7 operations across multiple locations and geographies. (experience)
  • Experience in interpreting data from systems like Hubble, ExtraHop, Splunk and other monitoring tools along with hands on experience of production monitoring systems, log analysis, troubleshooting, support dashboards and proficiency in scripting languages and automation tools. (experience)
  • Excellent interpersonal skills. Proactive, with a strong sense of personal ownership. (experience)
  • Bachelor’s degree in Engineering or equivalent. (degree in engineering or equivalent)

Preferred Qualifications

  • Experience in strategising and achieving operational excellence in global distributed systems. (experience)
  • Fundamental understanding of distributed systems including: Micro services, Messaging Brokers and Versioning. (experience)
  • Understanding of the Linux Operating System, including Kernel, Memory, Process, Threads, Static / Shared Libraries, IPC and Signals. (experience)
  • Excellent organisational and documentation skills. (experience)
  • Worried that you don’t quite tick all the above boxes? If you are excited about this role but your experience doesn’t align exactly with every part of the job description, we encourage you to apply anyway. You may very well be the right candidate. (experience)

Responsibilities

  • Our Production Operations team:
  • -Manage large-scale production outages, leading incident response and improving efficiency.
  • -Design, build, and maintain automation solutions to streamline the monitoring, sustenance, and management of large-scale distributed systems.
  • -Develop tools and software (using Java/JEE, REST, Swift/Objective C, Python, Go, or Bash) to automate repetitive operational tasks, reduce manual intervention, and improve system reliability. We utilise AI & LLM models to gain Operations Excellence in application support.
  • -Plan and execute actionable system health monitoring, incident response and communication across critical global applications. We drive operational metrics and KPI identification and alignment.
  • -Partner with multi-functional teams to improve reliability, efficiency, stability and processes.
  • -Are self-directed problem-solvers exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner.
  • -Create and maintain accurate, up-to-date documentation reflecting architecture, infra configuration and procedures. We write status and incident reports. We write training material and train users in complex topics.
  • -Lead a team of highly skilled engineers across the globe and guide their work towards operations excellence, gaining efficiency.
  • -Builds a culture where the regional members are responsible for cultivating strong in-region relationships and getting results for our business partners ensuring they remain informed about significant incidents and problems.

Target Your Resume for "Software Engineer (Technical Operations & Site Reliability Engineer)" , Apple

Get personalized recommendations to optimize your resume specifically for Software Engineer (Technical Operations & Site Reliability Engineer). Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Software Engineer (Technical Operations & Site Reliability Engineer)" , Apple

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Hardware

Answer 10 quick questions to check your fit for Software Engineer (Technical Operations & Site Reliability Engineer) @ Apple.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.