Resume and JobRESUME AND JOB
BAE Systems logo

Site Reliability Engineer – NS London

BAE Systems

Software and Technology Jobs

Site Reliability Engineer – NS London

full-timePosted: Jan 7, 2026

Job Description

 

Location(s): [[mfield3]] 

 

BAE Systems Digital Intelligence is home to 4,500 digital, cyber and intelligence experts. We work collaboratively across 10 countries to collect, connect and understand complex data, so that governments, nation states, armed forces and commercial businesses can unlock digital advantage in the most demanding environments.

Site Reliability Engineering is a rapidly growing concept in industry, with a remit to drive the quality, reliability and performance of essential systems. As a Site Reliability Engineer you'll be part of a team in BAE Systems at the forefront of this, delivering these benefits to a key national security customer. We are in the process of building our team and tools, and with your help will create a culture of continual improvement to revolutionise the way our customer’s systems are built and maintained. This role blends operational product support with software engineering to create applications to understand the overall health of our systems. The SRE team sits within a wider programme at the core of the customer mission.

 

The role holder:

As an SRE, fundamentally you will be doing work that has historically been done by an operations team, but using software and systems engineering expertise to substitute automation for human labour, with the objective of limiting traditional manual operations work (incident tickets, on-call etc.) to no more than half of the SRE team's time (and aiming for considerably less). You will have an enthusiasm to learn and experiment, to develop tools to understand application health and improve their reliability to support the customer mission.
 

Role accountabilities include:
    Supporting and maintaining essential service that support core mission applications, proactively enhancing their availability, performance and stability.
    Being part of the 24/7 on call rota, supporting critical production systems out of business hours, for which additional on call allowances and overtime benefits will be paid.
    Finding innovative solutions to problems rather than undertaking repetitive work, automating everything you can. You will work alongside development teams, advising them of good practice in how to design and build systems, learning from what you know works well. 
    You will design and deploy monitoring products, creating bespoke tools where required, to provide comprehensive and intelligent observations to meet the customer requirements and demonstrate the improvements the team are making on a daily basis. You will be well versed in the relationship between software and infrastructure, understanding the characteristics of systems that enable them to be scalable and resilient to failure, and how to get the best out of the infrastructure they are deployed to.
    Participating in the wider DevOps/SRE community within the organisation.

 

Competancies

 

    It is desirable for you to have experience in the areas below. However more valued for this role is that you have excitement and enthusiasm to learn new technologies, and to deal with hard problems. Training, knowledge sharing and on the job development will enable you to plug any knowledge gaps.

o    Software development in web technologies and object oriented programming
o    Database technologies such as Oracle SQL, Mongo, Postgres
o    Know your way around Linux and Windows command lines, e.g. Bash and PowerShell
o    Monitoring large systems using technologies such as Grafana, Prometheus, ELK, Splunk
o    Experience of working in Agile teams, and the tooling that supports it, e.g. Atlassian
o    Diagnosing and troubleshooting application issues resulting in service outages
o    Troubleshooting skills across different levels of the stack
o    Understanding of ITIL
o    Micro-services architectures, Docker and container platforms such as Openshift, Kubernetes

  Awareness and insight into technology trends to adopt new cutting edge tools

 

Security Clearance

Due to the nature of our work, successful candidates for this role will be required to hold an active eDV before applying for this opportunity.

Life at BAE Systems Digital Intelligence 

We are embracing Hybrid Working. This means you and your colleagues may be working in different locations, such as from home, another BAE Systems office or client site, some or all of the time, and work might be going on at different times of the day.

By embracing technology, we can interact, collaborate and create together, even when we’re working remotely from one another. Hybrid Working allows for increased flexibility in when and where we work, helping us to balance our work and personal life more effectively, and enhance well-being.

Diversity and inclusion are integral to the success of BAE Systems Digital Intelligence. We are proud to have an organisational culture where employees with varying perspectives, skills, life experiences and backgrounds – the best and brightest minds – can work together to achieve excellence and realise individual and organisational potential. 

Division overview: Capabilities

At BAE Systems Digital Intelligence, we pride ourselves in being a leader in the cyber defence industry, and Capabilities is the engine that keeps the business moving forward. It is the largest area of Digital Intelligence, containing our Engineering, Consulting and Project Management teams that design and implement the defence solutions and digital transformation projects that make us a globally recognised brand in both the public and private sector.

As a member of the Capabilities team, you will be creating and managing the solutions that earn us our place in an ever changing digital world. We all have a role to play in defending our clients, and this is yours. 

Locations

  • London, United Kingdom

Salary

Estimated Salary Rangemedium confidence

60,000 - 80,000 GBP / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Software and systems engineering expertiseintermediate
  • Automation skills to substitute human labourintermediate
  • Enthusiasm to learn and experimentintermediate
  • Ability to understand application health and improve reliabilityintermediate
  • Knowledge of the relationship between software and infrastructureintermediate
  • Understanding of scalable and resilient system characteristicsintermediate
  • Ability to get the best out of the infrastructureintermediate
  • Innovative problem-solving skillsintermediate
  • Collaboration and communication skillsintermediate

Required Qualifications

  • Software development in web technologies and object oriented programming (experience)
  • Database technologies such as Oracle SQL, Mongo, Postgres (experience)
  • Experience with Linux and Windows command lines, e.g. Bash and PowerShell (experience)
  • Monitoring large systems using technologies such as Grafana, Prometheus, ELK, Splunk (experience)
  • Experience of working in Agile teams, and the tooling that supports it, e.g. Atlassian (experience)
  • Diagnosing and troubleshooting application issues resulting in service outages (experience)
  • Troubleshooting skills across different levels of the stack (experience)
  • Understanding of ITIL (experience)
  • Micro-services architectures, Docker and container platforms such as Openshift, Kubernetes (experience)

Preferred Qualifications

  • Awareness and insight into technology trends to adopt new cutting edge tools (experience)
  • Excitement and enthusiasm to learn new technologies and deal with hard problems (experience)

Responsibilities

  • Supporting and maintaining essential services that support core mission applications, proactively enhancing their availability, performance and stability
  • Being part of the 24/7 on call rota, supporting critical production systems out of business hours
  • Finding innovative solutions to problems rather than undertaking repetitive work, automating everything possible
  • Working alongside development teams, advising them on good practice in system design and build
  • Designing and deploying monitoring products, creating bespoke tools where required
  • Participating in the wider DevOps/SRE community within the organisation

Benefits

  • general: Additional on call allowances and overtime benefits
  • general: Hybrid Working for increased flexibility and enhanced well-being

Target Your Resume for "Site Reliability Engineer – NS London" , BAE Systems

Get personalized recommendations to optimize your resume specifically for Site Reliability Engineer – NS London. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Site Reliability Engineer – NS London" , BAE Systems

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Digital IntelligenceEngineeringExperienced professionalsDigital IntelligenceEngineeringExperienced professionals

Answer 10 quick questions to check your fit for Site Reliability Engineer – NS London @ BAE Systems.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

BAE Systems logo

Site Reliability Engineer – NS London

BAE Systems

Software and Technology Jobs

Site Reliability Engineer – NS London

full-timePosted: Jan 7, 2026

Job Description

 

Location(s): [[mfield3]] 

 

BAE Systems Digital Intelligence is home to 4,500 digital, cyber and intelligence experts. We work collaboratively across 10 countries to collect, connect and understand complex data, so that governments, nation states, armed forces and commercial businesses can unlock digital advantage in the most demanding environments.

Site Reliability Engineering is a rapidly growing concept in industry, with a remit to drive the quality, reliability and performance of essential systems. As a Site Reliability Engineer you'll be part of a team in BAE Systems at the forefront of this, delivering these benefits to a key national security customer. We are in the process of building our team and tools, and with your help will create a culture of continual improvement to revolutionise the way our customer’s systems are built and maintained. This role blends operational product support with software engineering to create applications to understand the overall health of our systems. The SRE team sits within a wider programme at the core of the customer mission.

 

The role holder:

As an SRE, fundamentally you will be doing work that has historically been done by an operations team, but using software and systems engineering expertise to substitute automation for human labour, with the objective of limiting traditional manual operations work (incident tickets, on-call etc.) to no more than half of the SRE team's time (and aiming for considerably less). You will have an enthusiasm to learn and experiment, to develop tools to understand application health and improve their reliability to support the customer mission.
 

Role accountabilities include:
    Supporting and maintaining essential service that support core mission applications, proactively enhancing their availability, performance and stability.
    Being part of the 24/7 on call rota, supporting critical production systems out of business hours, for which additional on call allowances and overtime benefits will be paid.
    Finding innovative solutions to problems rather than undertaking repetitive work, automating everything you can. You will work alongside development teams, advising them of good practice in how to design and build systems, learning from what you know works well. 
    You will design and deploy monitoring products, creating bespoke tools where required, to provide comprehensive and intelligent observations to meet the customer requirements and demonstrate the improvements the team are making on a daily basis. You will be well versed in the relationship between software and infrastructure, understanding the characteristics of systems that enable them to be scalable and resilient to failure, and how to get the best out of the infrastructure they are deployed to.
    Participating in the wider DevOps/SRE community within the organisation.

 

Competancies

 

    It is desirable for you to have experience in the areas below. However more valued for this role is that you have excitement and enthusiasm to learn new technologies, and to deal with hard problems. Training, knowledge sharing and on the job development will enable you to plug any knowledge gaps.

o    Software development in web technologies and object oriented programming
o    Database technologies such as Oracle SQL, Mongo, Postgres
o    Know your way around Linux and Windows command lines, e.g. Bash and PowerShell
o    Monitoring large systems using technologies such as Grafana, Prometheus, ELK, Splunk
o    Experience of working in Agile teams, and the tooling that supports it, e.g. Atlassian
o    Diagnosing and troubleshooting application issues resulting in service outages
o    Troubleshooting skills across different levels of the stack
o    Understanding of ITIL
o    Micro-services architectures, Docker and container platforms such as Openshift, Kubernetes

  Awareness and insight into technology trends to adopt new cutting edge tools

 

Security Clearance

Due to the nature of our work, successful candidates for this role will be required to hold an active eDV before applying for this opportunity.

Life at BAE Systems Digital Intelligence 

We are embracing Hybrid Working. This means you and your colleagues may be working in different locations, such as from home, another BAE Systems office or client site, some or all of the time, and work might be going on at different times of the day.

By embracing technology, we can interact, collaborate and create together, even when we’re working remotely from one another. Hybrid Working allows for increased flexibility in when and where we work, helping us to balance our work and personal life more effectively, and enhance well-being.

Diversity and inclusion are integral to the success of BAE Systems Digital Intelligence. We are proud to have an organisational culture where employees with varying perspectives, skills, life experiences and backgrounds – the best and brightest minds – can work together to achieve excellence and realise individual and organisational potential. 

Division overview: Capabilities

At BAE Systems Digital Intelligence, we pride ourselves in being a leader in the cyber defence industry, and Capabilities is the engine that keeps the business moving forward. It is the largest area of Digital Intelligence, containing our Engineering, Consulting and Project Management teams that design and implement the defence solutions and digital transformation projects that make us a globally recognised brand in both the public and private sector.

As a member of the Capabilities team, you will be creating and managing the solutions that earn us our place in an ever changing digital world. We all have a role to play in defending our clients, and this is yours. 

Locations

  • London, United Kingdom

Salary

Estimated Salary Rangemedium confidence

60,000 - 80,000 GBP / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Software and systems engineering expertiseintermediate
  • Automation skills to substitute human labourintermediate
  • Enthusiasm to learn and experimentintermediate
  • Ability to understand application health and improve reliabilityintermediate
  • Knowledge of the relationship between software and infrastructureintermediate
  • Understanding of scalable and resilient system characteristicsintermediate
  • Ability to get the best out of the infrastructureintermediate
  • Innovative problem-solving skillsintermediate
  • Collaboration and communication skillsintermediate

Required Qualifications

  • Software development in web technologies and object oriented programming (experience)
  • Database technologies such as Oracle SQL, Mongo, Postgres (experience)
  • Experience with Linux and Windows command lines, e.g. Bash and PowerShell (experience)
  • Monitoring large systems using technologies such as Grafana, Prometheus, ELK, Splunk (experience)
  • Experience of working in Agile teams, and the tooling that supports it, e.g. Atlassian (experience)
  • Diagnosing and troubleshooting application issues resulting in service outages (experience)
  • Troubleshooting skills across different levels of the stack (experience)
  • Understanding of ITIL (experience)
  • Micro-services architectures, Docker and container platforms such as Openshift, Kubernetes (experience)

Preferred Qualifications

  • Awareness and insight into technology trends to adopt new cutting edge tools (experience)
  • Excitement and enthusiasm to learn new technologies and deal with hard problems (experience)

Responsibilities

  • Supporting and maintaining essential services that support core mission applications, proactively enhancing their availability, performance and stability
  • Being part of the 24/7 on call rota, supporting critical production systems out of business hours
  • Finding innovative solutions to problems rather than undertaking repetitive work, automating everything possible
  • Working alongside development teams, advising them on good practice in system design and build
  • Designing and deploying monitoring products, creating bespoke tools where required
  • Participating in the wider DevOps/SRE community within the organisation

Benefits

  • general: Additional on call allowances and overtime benefits
  • general: Hybrid Working for increased flexibility and enhanced well-being

Target Your Resume for "Site Reliability Engineer – NS London" , BAE Systems

Get personalized recommendations to optimize your resume specifically for Site Reliability Engineer – NS London. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Site Reliability Engineer – NS London" , BAE Systems

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Digital IntelligenceEngineeringExperienced professionalsDigital IntelligenceEngineeringExperienced professionals

Answer 10 quick questions to check your fit for Site Reliability Engineer – NS London @ BAE Systems.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.