RESUME AND JOB

Software Engineer (Technical Operations & Site Reliability Engineer)

Apple

Software Engineer (Technical Operations & Site Reliability Engineer)

Apple

full-timePosted: Sep 11, 2025

Job Description

At Apple, customer experience is at the forefront of everything we do. Apple Customer Systems Operations team is looking for a highly skilled and motivated Software Engineer (Technical Operations & Site Reliability Engineer) to join our Operations team. The team is responsible for maintaining the reliability, availability, and performance of business-critical, globally distributed systems. If you have the desire and motivation to design and develop automation solutions to streamline system sustenance, monitoring, and operational workflows, while collaborating closely with support, engineering and business operations teams, this profile is for you. Ideal candidates will combine strong software engineering skills with a passion for operational excellence, and thrive in a fast-paced, change-driven environment focused on continuous improvement and flawless delivery. Our Production Operations team: -Manage large-scale production outages, leading incident response and improving efficiency. -Design, build, and maintain automation solutions to streamline the monitoring, sustenance, and management of large-scale distributed systems. -Develop tools and software (using Java/JEE, REST, Swift/Objective C, Python, Go, or Bash) to automate repetitive operational tasks, reduce manual intervention, and improve system reliability. We utilise AI & LLM models to gain Operations Excellence in application support. -Plan and execute actionable system health monitoring, incident response and communication across critical global applications. We drive operational metrics and KPI identification and alignment. -Partner with multi-functional teams to improve reliability, efficiency, stability and processes. -Are self-directed problem-solvers exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner. -Create and maintain accurate, up-to-date documentation reflecting architecture, infra configuration and procedures. We write status and incident reports. We write training material and train users in complex topics. -Lead a team of highly skilled engineers across the globe and guide their work towards operations excellence, gaining efficiency. -Builds a culture where the regional members are responsible for cultivating strong in-region relationships and getting results for our business partners ensuring they remain informed about significant incidents and problems.

Locations

Cork, County Cork, Ireland

Salary

Estimated Salary Rangemedium confidence

2,500,000 - 5,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

software engineering skillsintermediate
design and develop automation solutionsintermediate
streamline system sustenanceintermediate
monitoringintermediate
operational workflowsintermediate
collaborating with teamsintermediate
operational excellenceintermediate
manage large-scale production outagesintermediate
leading incident responseintermediate
improving efficiencyintermediate
design, build, and maintain automation solutionsintermediate
streamline the monitoringintermediate
sustenance and management of large-scale distributed systemsintermediate
develop tools and softwareintermediate
proficiency in Java/JEEintermediate
proficiency in RESTintermediate
proficiency in Swiftintermediate
proficiency in Objective Cintermediate
proficiency in Pythonintermediate
proficiency in Gointermediate
proficiency in Bashintermediate
automate repetitive operational tasksintermediate
reduce manual interventionintermediate
improve system reliabilityintermediate
utilise AI & LLM modelsintermediate
plan and execute system health monitoringintermediate
incident response and communicationintermediate
drive operational metrics and KPI identificationintermediate
partner with multi-functional teamsintermediate
improve reliability, efficiency, stability and processesintermediate
self-directed problem-solversintermediate
handle multiple simultaneous competing prioritiesintermediate
deliver solutions in a timely mannerintermediate
create and maintain documentationintermediate
reflecting architectureintermediate
infra configurationintermediate
proceduresintermediate
write status and incident reportsintermediate
write training materialintermediate
train users in complex topicsintermediate
lead a team of engineersintermediate
guide work towards operations excellenceintermediate
gaining efficiencyintermediate
build a cultureintermediate
cultivating strong in-region relationshipsintermediate
getting results for business partnersintermediate
ensuring informed about incidents and problemsintermediate

Required Qualifications

Experience in using AI and Large Language Models (LLMs) to enhance operational efficiency through tasks such as model training, optimisation (including areas like Model Context Protocol or similar methods), and designing effective model utilities. (experience)
Understanding of standard networking protocols and components such as: HTTP, DNS, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing. (experience)
Experience in Java, JEE, REST, Swift/Objective C, database schema design and data access technologies. (experience)
Experience in driving operations teams for large scale mission critical applications working in a 24x7 operations across multiple locations and geographies. (experience)
Experience in interpreting data from systems like Hubble, ExtraHop, Splunk and other monitoring tools along with hands on experience of production monitoring systems, log analysis, troubleshooting, support dashboards and proficiency in scripting languages and automation tools. (experience)
Excellent interpersonal skills. Proactive, with a strong sense of personal ownership. (experience)
Bachelor’s degree in Engineering or equivalent. (degree in engineering or equivalent)

Preferred Qualifications

Experience in strategising and achieving operational excellence in global distributed systems. (experience)
Fundamental understanding of distributed systems including: Micro services, Messaging Brokers and Versioning. (experience)
Understanding of the Linux Operating System, including Kernel, Memory, Process, Threads, Static / Shared Libraries, IPC and Signals. (experience)
Excellent organisational and documentation skills. (experience)
Worried that you don’t quite tick all the above boxes? If you are excited about this role but your experience doesn’t align exactly with every part of the job description, we encourage you to apply anyway. You may very well be the right candidate. (experience)

Responsibilities

Our Production Operations team:
-Manage large-scale production outages, leading incident response and improving efficiency.
-Design, build, and maintain automation solutions to streamline the monitoring, sustenance, and management of large-scale distributed systems.
-Develop tools and software (using Java/JEE, REST, Swift/Objective C, Python, Go, or Bash) to automate repetitive operational tasks, reduce manual intervention, and improve system reliability. We utilise AI & LLM models to gain Operations Excellence in application support.
-Plan and execute actionable system health monitoring, incident response and communication across critical global applications. We drive operational metrics and KPI identification and alignment.
-Partner with multi-functional teams to improve reliability, efficiency, stability and processes.
-Are self-directed problem-solvers exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner.
-Create and maintain accurate, up-to-date documentation reflecting architecture, infra configuration and procedures. We write status and incident reports. We write training material and train users in complex topics.
-Lead a team of highly skilled engineers across the globe and guide their work towards operations excellence, gaining efficiency.
-Builds a culture where the regional members are responsible for cultivating strong in-region relationships and getting results for our business partners ensuring they remain informed about significant incidents and problems.

Target Your Resume for "Software Engineer (Technical Operations & Site Reliability Engineer)" , Apple

Get personalized recommendations to optimize your resume specifically for Software Engineer (Technical Operations & Site Reliability Engineer). Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Software Engineer (Technical Operations & Site Reliability Engineer)" , Apple

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

Hardware

Answer 10 quick questions to check your fit for Software Engineer (Technical Operations & Site Reliability Engineer) @ Apple.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap

Software Engineer (Technical Operations & Site Reliability Engineer)

Apple

Software Engineer (Technical Operations & Site Reliability Engineer)

Apple

full-timePosted: Sep 11, 2025

Job Description

Locations

Cork, County Cork, Ireland

Salary

Estimated Salary Rangemedium confidence

2,500,000 - 5,000,000 INR / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

software engineering skillsintermediate
design and develop automation solutionsintermediate
streamline system sustenanceintermediate
monitoringintermediate
operational workflowsintermediate
collaborating with teamsintermediate
operational excellenceintermediate
manage large-scale production outagesintermediate
leading incident responseintermediate
improving efficiencyintermediate
design, build, and maintain automation solutionsintermediate
streamline the monitoringintermediate
sustenance and management of large-scale distributed systemsintermediate
develop tools and softwareintermediate
proficiency in Java/JEEintermediate
proficiency in RESTintermediate
proficiency in Swiftintermediate
proficiency in Objective Cintermediate
proficiency in Pythonintermediate
proficiency in Gointermediate
proficiency in Bashintermediate
automate repetitive operational tasksintermediate
reduce manual interventionintermediate
improve system reliabilityintermediate
utilise AI & LLM modelsintermediate
plan and execute system health monitoringintermediate
incident response and communicationintermediate
drive operational metrics and KPI identificationintermediate
partner with multi-functional teamsintermediate
improve reliability, efficiency, stability and processesintermediate
self-directed problem-solversintermediate
handle multiple simultaneous competing prioritiesintermediate
deliver solutions in a timely mannerintermediate
create and maintain documentationintermediate
reflecting architectureintermediate
infra configurationintermediate
proceduresintermediate
write status and incident reportsintermediate
write training materialintermediate
train users in complex topicsintermediate
lead a team of engineersintermediate
guide work towards operations excellenceintermediate
gaining efficiencyintermediate
build a cultureintermediate
cultivating strong in-region relationshipsintermediate
getting results for business partnersintermediate
ensuring informed about incidents and problemsintermediate

Required Qualifications

Experience in using AI and Large Language Models (LLMs) to enhance operational efficiency through tasks such as model training, optimisation (including areas like Model Context Protocol or similar methods), and designing effective model utilities. (experience)
Understanding of standard networking protocols and components such as: HTTP, DNS, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing. (experience)
Experience in Java, JEE, REST, Swift/Objective C, database schema design and data access technologies. (experience)
Experience in driving operations teams for large scale mission critical applications working in a 24x7 operations across multiple locations and geographies. (experience)
Experience in interpreting data from systems like Hubble, ExtraHop, Splunk and other monitoring tools along with hands on experience of production monitoring systems, log analysis, troubleshooting, support dashboards and proficiency in scripting languages and automation tools. (experience)
Excellent interpersonal skills. Proactive, with a strong sense of personal ownership. (experience)
Bachelor’s degree in Engineering or equivalent. (degree in engineering or equivalent)

Preferred Qualifications

Experience in strategising and achieving operational excellence in global distributed systems. (experience)
Fundamental understanding of distributed systems including: Micro services, Messaging Brokers and Versioning. (experience)
Understanding of the Linux Operating System, including Kernel, Memory, Process, Threads, Static / Shared Libraries, IPC and Signals. (experience)
Excellent organisational and documentation skills. (experience)
Worried that you don’t quite tick all the above boxes? If you are excited about this role but your experience doesn’t align exactly with every part of the job description, we encourage you to apply anyway. You may very well be the right candidate. (experience)

Responsibilities

Our Production Operations team:
-Manage large-scale production outages, leading incident response and improving efficiency.
-Design, build, and maintain automation solutions to streamline the monitoring, sustenance, and management of large-scale distributed systems.
-Develop tools and software (using Java/JEE, REST, Swift/Objective C, Python, Go, or Bash) to automate repetitive operational tasks, reduce manual intervention, and improve system reliability. We utilise AI & LLM models to gain Operations Excellence in application support.
-Plan and execute actionable system health monitoring, incident response and communication across critical global applications. We drive operational metrics and KPI identification and alignment.
-Partner with multi-functional teams to improve reliability, efficiency, stability and processes.
-Are self-directed problem-solvers exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner.
-Create and maintain accurate, up-to-date documentation reflecting architecture, infra configuration and procedures. We write status and incident reports. We write training material and train users in complex topics.
-Lead a team of highly skilled engineers across the globe and guide their work towards operations excellence, gaining efficiency.
-Builds a culture where the regional members are responsible for cultivating strong in-region relationships and getting results for our business partners ensuring they remain informed about significant incidents and problems.

Target Your Resume for "Software Engineer (Technical Operations & Site Reliability Engineer)" , Apple

Get personalized recommendations to optimize your resume specifically for Software Engineer (Technical Operations & Site Reliability Engineer). Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Software Engineer (Technical Operations & Site Reliability Engineer)" , Apple

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

Hardware

Answer 10 quick questions to check your fit for Software Engineer (Technical Operations & Site Reliability Engineer) @ Apple.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap