Resume and JobRESUME AND JOB
JP Morgan Chase logo

Lead Site Reliability Engineer

JP Morgan Chase

Software and Technology Jobs

Lead Site Reliability Engineer

full-timePosted: Nov 7, 2025

Job Description

Lead Site Reliability Engineer

Location: Plano, TX, United States

Job Family: Software Engineering

About the Role

At JP Morgan Chase, we are at the forefront of financial innovation, powering the world's leading banks with cutting-edge technology. As a Lead Site Reliability Engineer in our Plano, TX office, you will play a pivotal role in ensuring the reliability and scalability of our mission-critical platforms that support global banking operations, payments, and investment services. This position within the Software Engineering category involves leading resiliency design reviews, breaking down complex problems, and serving as a technical lead for medium to large-sized products. You will collaborate with elite teams to build systems that withstand the rigors of the financial industry, where downtime can have significant implications. In this leadership role, you will conduct in-depth resiliency assessments, identifying potential failure points and architecting solutions that align with JP Morgan Chase's commitment to operational excellence. Expect to mentor junior engineers, drive the adoption of SRE methodologies like error budgets and service level objectives (SLOs), and integrate security best practices to comply with stringent regulations such as those from the SEC and Federal Reserve. Your work will directly impact products handling trillions in transactions, requiring a blend of technical expertise and strategic foresight to enhance system performance in a high-stakes environment. We value innovation and reliability equally, offering you the chance to work on diverse projects from real-time trading platforms to AI-enhanced risk management tools. Joining JP Morgan Chase means becoming part of a collaborative culture that invests in your growth through world-class training and resources. If you thrive in dynamic settings and are passionate about engineering resilient financial infrastructures, this role provides an unparalleled opportunity to lead transformative initiatives in one of the most influential firms in the industry.

Key Responsibilities

  • Lead resiliency design reviews for medium to large-sized products, ensuring alignment with JP Morgan Chase's enterprise standards
  • Conduct thorough analysis of system failures and recommend improvements to enhance platform reliability in financial services
  • Act as a technical lead, mentoring engineers and guiding the decomposition of complex problems into scalable solutions
  • Collaborate with cross-functional teams including software developers, product managers, and compliance officers to build robust infrastructures
  • Implement monitoring, alerting, and automation tools to maintain 99.99% uptime for critical banking applications
  • Drive adoption of SRE best practices, such as error budgets and chaos engineering, tailored to the financial industry's risk profile
  • Contribute to the strategic roadmap for site reliability, focusing on scalability and security in a regulated environment
  • Perform code reviews and provide technical guidance on infrastructure as code (IaC) practices using tools like Terraform
  • Ensure systems comply with financial regulations and internal JP Morgan policies during design and deployment phases
  • Foster a culture of continuous improvement by sharing insights from post-incident reviews across global teams

Required Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent work experience
  • 7+ years of experience in software engineering, site reliability engineering, or DevOps roles
  • Proven experience leading technical teams in designing and implementing resilient systems in a financial services environment
  • Strong understanding of cloud infrastructure, preferably AWS or Azure, with hands-on experience in high-availability architectures
  • Experience with resiliency design reviews and conducting failure mode analysis in mission-critical applications
  • Demonstrated ability to break down complex technical problems into actionable solutions
  • Familiarity with regulatory compliance standards such as SOX, GDPR, and PCI-DSS in the banking sector

Preferred Qualifications

  • Master's degree in Computer Science or related field
  • Experience working at a large-scale financial institution like JP Morgan Chase
  • Certifications such as AWS Certified Solutions Architect or Google Professional SRE
  • Prior leadership in agile methodologies for medium to large product teams
  • Knowledge of machine learning operations (MLOps) for AI-driven financial products

Required Skills

  • Site Reliability Engineering (SRE) principles and practices
  • Cloud computing platforms (AWS, Azure, GCP)
  • Infrastructure as Code (Terraform, CloudFormation)
  • Containerization and orchestration (Docker, Kubernetes)
  • Monitoring and observability tools (Prometheus, Grafana, ELK Stack)
  • CI/CD pipelines (Jenkins, GitLab CI, CircleCI)
  • Programming languages (Python, Go, Java)
  • Network security and encryption protocols
  • Agile and Scrum methodologies
  • Problem-solving and analytical thinking
  • Leadership and team mentoring
  • Communication and stakeholder management
  • Risk assessment in financial systems
  • Chaos engineering tools (Chaos Monkey, Gremlin)
  • Data privacy and compliance knowledge

Benefits

  • Comprehensive health, dental, and vision insurance plans with employer contributions
  • 401(k) retirement savings plan with generous company matching
  • Paid time off including vacation, sick days, and parental leave
  • Professional development opportunities through JP Morgan's internal training programs and tuition reimbursement
  • Employee stock purchase plan and performance-based bonuses
  • Wellness programs including gym memberships and mental health support
  • Flexible work arrangements with hybrid options in Plano, TX
  • Access to on-site amenities and community volunteer opportunities

JP Morgan Chase is an equal opportunity employer.

Locations

  • Plano, US

Salary

Estimated Salary Rangehigh confidence

180,000 - 280,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Site Reliability Engineering (SRE) principles and practicesintermediate
  • Cloud computing platforms (AWS, Azure, GCP)intermediate
  • Infrastructure as Code (Terraform, CloudFormation)intermediate
  • Containerization and orchestration (Docker, Kubernetes)intermediate
  • Monitoring and observability tools (Prometheus, Grafana, ELK Stack)intermediate
  • CI/CD pipelines (Jenkins, GitLab CI, CircleCI)intermediate
  • Programming languages (Python, Go, Java)intermediate
  • Network security and encryption protocolsintermediate
  • Agile and Scrum methodologiesintermediate
  • Problem-solving and analytical thinkingintermediate
  • Leadership and team mentoringintermediate
  • Communication and stakeholder managementintermediate
  • Risk assessment in financial systemsintermediate
  • Chaos engineering tools (Chaos Monkey, Gremlin)intermediate
  • Data privacy and compliance knowledgeintermediate

Required Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent work experience (experience)
  • 7+ years of experience in software engineering, site reliability engineering, or DevOps roles (experience)
  • Proven experience leading technical teams in designing and implementing resilient systems in a financial services environment (experience)
  • Strong understanding of cloud infrastructure, preferably AWS or Azure, with hands-on experience in high-availability architectures (experience)
  • Experience with resiliency design reviews and conducting failure mode analysis in mission-critical applications (experience)
  • Demonstrated ability to break down complex technical problems into actionable solutions (experience)
  • Familiarity with regulatory compliance standards such as SOX, GDPR, and PCI-DSS in the banking sector (experience)

Preferred Qualifications

  • Master's degree in Computer Science or related field (experience)
  • Experience working at a large-scale financial institution like JP Morgan Chase (experience)
  • Certifications such as AWS Certified Solutions Architect or Google Professional SRE (experience)
  • Prior leadership in agile methodologies for medium to large product teams (experience)
  • Knowledge of machine learning operations (MLOps) for AI-driven financial products (experience)

Responsibilities

  • Lead resiliency design reviews for medium to large-sized products, ensuring alignment with JP Morgan Chase's enterprise standards
  • Conduct thorough analysis of system failures and recommend improvements to enhance platform reliability in financial services
  • Act as a technical lead, mentoring engineers and guiding the decomposition of complex problems into scalable solutions
  • Collaborate with cross-functional teams including software developers, product managers, and compliance officers to build robust infrastructures
  • Implement monitoring, alerting, and automation tools to maintain 99.99% uptime for critical banking applications
  • Drive adoption of SRE best practices, such as error budgets and chaos engineering, tailored to the financial industry's risk profile
  • Contribute to the strategic roadmap for site reliability, focusing on scalability and security in a regulated environment
  • Perform code reviews and provide technical guidance on infrastructure as code (IaC) practices using tools like Terraform
  • Ensure systems comply with financial regulations and internal JP Morgan policies during design and deployment phases
  • Foster a culture of continuous improvement by sharing insights from post-incident reviews across global teams

Benefits

  • general: Comprehensive health, dental, and vision insurance plans with employer contributions
  • general: 401(k) retirement savings plan with generous company matching
  • general: Paid time off including vacation, sick days, and parental leave
  • general: Professional development opportunities through JP Morgan's internal training programs and tuition reimbursement
  • general: Employee stock purchase plan and performance-based bonuses
  • general: Wellness programs including gym memberships and mental health support
  • general: Flexible work arrangements with hybrid options in Plano, TX
  • general: Access to on-site amenities and community volunteer opportunities

Target Your Resume for "Lead Site Reliability Engineer" , JP Morgan Chase

Get personalized recommendations to optimize your resume specifically for Lead Site Reliability Engineer. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Lead Site Reliability Engineer" , JP Morgan Chase

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Software EngineeringFinancial ServicesBankingJP MorganSoftware Engineering

Answer 10 quick questions to check your fit for Lead Site Reliability Engineer @ JP Morgan Chase.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.

JP Morgan Chase logo

Lead Site Reliability Engineer

JP Morgan Chase

Software and Technology Jobs

Lead Site Reliability Engineer

full-timePosted: Nov 7, 2025

Job Description

Lead Site Reliability Engineer

Location: Plano, TX, United States

Job Family: Software Engineering

About the Role

At JP Morgan Chase, we are at the forefront of financial innovation, powering the world's leading banks with cutting-edge technology. As a Lead Site Reliability Engineer in our Plano, TX office, you will play a pivotal role in ensuring the reliability and scalability of our mission-critical platforms that support global banking operations, payments, and investment services. This position within the Software Engineering category involves leading resiliency design reviews, breaking down complex problems, and serving as a technical lead for medium to large-sized products. You will collaborate with elite teams to build systems that withstand the rigors of the financial industry, where downtime can have significant implications. In this leadership role, you will conduct in-depth resiliency assessments, identifying potential failure points and architecting solutions that align with JP Morgan Chase's commitment to operational excellence. Expect to mentor junior engineers, drive the adoption of SRE methodologies like error budgets and service level objectives (SLOs), and integrate security best practices to comply with stringent regulations such as those from the SEC and Federal Reserve. Your work will directly impact products handling trillions in transactions, requiring a blend of technical expertise and strategic foresight to enhance system performance in a high-stakes environment. We value innovation and reliability equally, offering you the chance to work on diverse projects from real-time trading platforms to AI-enhanced risk management tools. Joining JP Morgan Chase means becoming part of a collaborative culture that invests in your growth through world-class training and resources. If you thrive in dynamic settings and are passionate about engineering resilient financial infrastructures, this role provides an unparalleled opportunity to lead transformative initiatives in one of the most influential firms in the industry.

Key Responsibilities

  • Lead resiliency design reviews for medium to large-sized products, ensuring alignment with JP Morgan Chase's enterprise standards
  • Conduct thorough analysis of system failures and recommend improvements to enhance platform reliability in financial services
  • Act as a technical lead, mentoring engineers and guiding the decomposition of complex problems into scalable solutions
  • Collaborate with cross-functional teams including software developers, product managers, and compliance officers to build robust infrastructures
  • Implement monitoring, alerting, and automation tools to maintain 99.99% uptime for critical banking applications
  • Drive adoption of SRE best practices, such as error budgets and chaos engineering, tailored to the financial industry's risk profile
  • Contribute to the strategic roadmap for site reliability, focusing on scalability and security in a regulated environment
  • Perform code reviews and provide technical guidance on infrastructure as code (IaC) practices using tools like Terraform
  • Ensure systems comply with financial regulations and internal JP Morgan policies during design and deployment phases
  • Foster a culture of continuous improvement by sharing insights from post-incident reviews across global teams

Required Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent work experience
  • 7+ years of experience in software engineering, site reliability engineering, or DevOps roles
  • Proven experience leading technical teams in designing and implementing resilient systems in a financial services environment
  • Strong understanding of cloud infrastructure, preferably AWS or Azure, with hands-on experience in high-availability architectures
  • Experience with resiliency design reviews and conducting failure mode analysis in mission-critical applications
  • Demonstrated ability to break down complex technical problems into actionable solutions
  • Familiarity with regulatory compliance standards such as SOX, GDPR, and PCI-DSS in the banking sector

Preferred Qualifications

  • Master's degree in Computer Science or related field
  • Experience working at a large-scale financial institution like JP Morgan Chase
  • Certifications such as AWS Certified Solutions Architect or Google Professional SRE
  • Prior leadership in agile methodologies for medium to large product teams
  • Knowledge of machine learning operations (MLOps) for AI-driven financial products

Required Skills

  • Site Reliability Engineering (SRE) principles and practices
  • Cloud computing platforms (AWS, Azure, GCP)
  • Infrastructure as Code (Terraform, CloudFormation)
  • Containerization and orchestration (Docker, Kubernetes)
  • Monitoring and observability tools (Prometheus, Grafana, ELK Stack)
  • CI/CD pipelines (Jenkins, GitLab CI, CircleCI)
  • Programming languages (Python, Go, Java)
  • Network security and encryption protocols
  • Agile and Scrum methodologies
  • Problem-solving and analytical thinking
  • Leadership and team mentoring
  • Communication and stakeholder management
  • Risk assessment in financial systems
  • Chaos engineering tools (Chaos Monkey, Gremlin)
  • Data privacy and compliance knowledge

Benefits

  • Comprehensive health, dental, and vision insurance plans with employer contributions
  • 401(k) retirement savings plan with generous company matching
  • Paid time off including vacation, sick days, and parental leave
  • Professional development opportunities through JP Morgan's internal training programs and tuition reimbursement
  • Employee stock purchase plan and performance-based bonuses
  • Wellness programs including gym memberships and mental health support
  • Flexible work arrangements with hybrid options in Plano, TX
  • Access to on-site amenities and community volunteer opportunities

JP Morgan Chase is an equal opportunity employer.

Locations

  • Plano, US

Salary

Estimated Salary Rangehigh confidence

180,000 - 280,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

  • Site Reliability Engineering (SRE) principles and practicesintermediate
  • Cloud computing platforms (AWS, Azure, GCP)intermediate
  • Infrastructure as Code (Terraform, CloudFormation)intermediate
  • Containerization and orchestration (Docker, Kubernetes)intermediate
  • Monitoring and observability tools (Prometheus, Grafana, ELK Stack)intermediate
  • CI/CD pipelines (Jenkins, GitLab CI, CircleCI)intermediate
  • Programming languages (Python, Go, Java)intermediate
  • Network security and encryption protocolsintermediate
  • Agile and Scrum methodologiesintermediate
  • Problem-solving and analytical thinkingintermediate
  • Leadership and team mentoringintermediate
  • Communication and stakeholder managementintermediate
  • Risk assessment in financial systemsintermediate
  • Chaos engineering tools (Chaos Monkey, Gremlin)intermediate
  • Data privacy and compliance knowledgeintermediate

Required Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent work experience (experience)
  • 7+ years of experience in software engineering, site reliability engineering, or DevOps roles (experience)
  • Proven experience leading technical teams in designing and implementing resilient systems in a financial services environment (experience)
  • Strong understanding of cloud infrastructure, preferably AWS or Azure, with hands-on experience in high-availability architectures (experience)
  • Experience with resiliency design reviews and conducting failure mode analysis in mission-critical applications (experience)
  • Demonstrated ability to break down complex technical problems into actionable solutions (experience)
  • Familiarity with regulatory compliance standards such as SOX, GDPR, and PCI-DSS in the banking sector (experience)

Preferred Qualifications

  • Master's degree in Computer Science or related field (experience)
  • Experience working at a large-scale financial institution like JP Morgan Chase (experience)
  • Certifications such as AWS Certified Solutions Architect or Google Professional SRE (experience)
  • Prior leadership in agile methodologies for medium to large product teams (experience)
  • Knowledge of machine learning operations (MLOps) for AI-driven financial products (experience)

Responsibilities

  • Lead resiliency design reviews for medium to large-sized products, ensuring alignment with JP Morgan Chase's enterprise standards
  • Conduct thorough analysis of system failures and recommend improvements to enhance platform reliability in financial services
  • Act as a technical lead, mentoring engineers and guiding the decomposition of complex problems into scalable solutions
  • Collaborate with cross-functional teams including software developers, product managers, and compliance officers to build robust infrastructures
  • Implement monitoring, alerting, and automation tools to maintain 99.99% uptime for critical banking applications
  • Drive adoption of SRE best practices, such as error budgets and chaos engineering, tailored to the financial industry's risk profile
  • Contribute to the strategic roadmap for site reliability, focusing on scalability and security in a regulated environment
  • Perform code reviews and provide technical guidance on infrastructure as code (IaC) practices using tools like Terraform
  • Ensure systems comply with financial regulations and internal JP Morgan policies during design and deployment phases
  • Foster a culture of continuous improvement by sharing insights from post-incident reviews across global teams

Benefits

  • general: Comprehensive health, dental, and vision insurance plans with employer contributions
  • general: 401(k) retirement savings plan with generous company matching
  • general: Paid time off including vacation, sick days, and parental leave
  • general: Professional development opportunities through JP Morgan's internal training programs and tuition reimbursement
  • general: Employee stock purchase plan and performance-based bonuses
  • general: Wellness programs including gym memberships and mental health support
  • general: Flexible work arrangements with hybrid options in Plano, TX
  • general: Access to on-site amenities and community volunteer opportunities

Target Your Resume for "Lead Site Reliability Engineer" , JP Morgan Chase

Get personalized recommendations to optimize your resume specifically for Lead Site Reliability Engineer. Takes only 15 seconds!

AI-powered keyword optimization
Skills matching & gap analysis
Experience alignment suggestions

Check Your ATS Score for "Lead Site Reliability Engineer" , JP Morgan Chase

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check
Keyword optimization analysis
Skill matching & gap identification
Format & readability score

Tags & Categories

Software EngineeringFinancial ServicesBankingJP MorganSoftware Engineering

Answer 10 quick questions to check your fit for Lead Site Reliability Engineer @ JP Morgan Chase.

Quiz Challenge
10 Questions
~2 Minutes
Instant Score

Related Books and Jobs

No related jobs found at the moment.