RESUME AND JOB
Crusoe
As a Senior Site Reliability Engineer (SRE) specializing in Managed AI at Crusoe, you will be at the forefront of building and maintaining the infrastructure that powers the AI revolution. You will work on ensuring the reliability, scalability, and performance of Crusoe's AI-optimized cloud platform, with a specific focus on large language models (LLMs). This role is crucial for delivering highly available, performant, and cost-efficient AI infrastructure to our customers, enabling them to tackle compute-intensive and latency-sensitive workloads.
Your primary responsibility will be to design, build, and operate reliable managed AI services, with a strong emphasis on serving and scaling LLM workloads. You will develop automation and reliability tooling to support distributed AI pipelines and inference services. Defining and measuring Service Level Indicators (SLIs) and Service Level Objectives (SLOs) across AI workloads will be essential to ensure performance and reliability targets are met. Collaboration with AI, platform, and infrastructure teams will be vital for optimizing large-scale training and inference clusters. Furthermore, you will automate observability, create telemetry, and devise performance tuning strategies for latency-sensitive AI services.
Investigating and resolving reliability issues in distributed AI systems using telemetry, logs, and profiling tools will be a regular part of your work. You will also contribute to the architecture of next-generation distributed systems purpose-built for AI-first environments, playing a key role in shaping the future of AI infrastructure.
Here’s what a typical day might look like for a Senior Site Reliability Engineer at Crusoe:
San Francisco is a global hub for technology and innovation, making it an ideal location for a Senior Site Reliability Engineer working on cutting-edge AI infrastructure. The city boasts a vibrant ecosystem of startups, established tech companies, and research institutions, creating a wealth of opportunities for professional growth and networking. Furthermore, San Francisco is renowned for its diverse culture, world-class dining, and access to outdoor activities, making it a desirable place to live and work.
At Crusoe, we are committed to the growth and development of our employees. A Senior Site Reliability Engineer can advance their career in several directions:
The estimated salary range for a Senior Site Reliability Engineer (Managed AI) in San Francisco is $180,000 to $280,000 per year. Note that salary ranges can vary based on experience, skills, and other factors.
Crusoe offers a comprehensive benefits package, including:
Crusoe is committed to accelerating the abundance of energy and intelligence by crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. Crusoe is a fast-paced, mission-driven environment where innovation and collaboration are highly valued. We are looking for individuals who are passionate about making a tangible impact and contributing to a team that is setting the pace for responsible, transformative cloud infrastructure. We value integrity, intellectual honesty, and a commitment to excellence.
Interested candidates are encouraged to apply through the Crusoe careers page. Please submit your resume and a cover letter highlighting your relevant experience and qualifications.
198,000 - 308,000 USD / yearly
Source: ai estimated
* This is an estimated range based on market data and may vary based on experience and qualifications.
Get personalized recommendations to optimize your resume specifically for Senior Site Reliability Engineer (Managed AI) Careers at Crusoe - San Francisco, California | Apply Now!. Takes only 15 seconds!
Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.
Answer 10 quick questions to check your fit for Senior Site Reliability Engineer (Managed AI) Careers at Crusoe - San Francisco, California | Apply Now! @ Crusoe.

No related jobs found at the moment.

© 2026 Pointers. All rights reserved.

Crusoe
As a Senior Site Reliability Engineer (SRE) specializing in Managed AI at Crusoe, you will be at the forefront of building and maintaining the infrastructure that powers the AI revolution. You will work on ensuring the reliability, scalability, and performance of Crusoe's AI-optimized cloud platform, with a specific focus on large language models (LLMs). This role is crucial for delivering highly available, performant, and cost-efficient AI infrastructure to our customers, enabling them to tackle compute-intensive and latency-sensitive workloads.
Your primary responsibility will be to design, build, and operate reliable managed AI services, with a strong emphasis on serving and scaling LLM workloads. You will develop automation and reliability tooling to support distributed AI pipelines and inference services. Defining and measuring Service Level Indicators (SLIs) and Service Level Objectives (SLOs) across AI workloads will be essential to ensure performance and reliability targets are met. Collaboration with AI, platform, and infrastructure teams will be vital for optimizing large-scale training and inference clusters. Furthermore, you will automate observability, create telemetry, and devise performance tuning strategies for latency-sensitive AI services.
Investigating and resolving reliability issues in distributed AI systems using telemetry, logs, and profiling tools will be a regular part of your work. You will also contribute to the architecture of next-generation distributed systems purpose-built for AI-first environments, playing a key role in shaping the future of AI infrastructure.
Here’s what a typical day might look like for a Senior Site Reliability Engineer at Crusoe:
San Francisco is a global hub for technology and innovation, making it an ideal location for a Senior Site Reliability Engineer working on cutting-edge AI infrastructure. The city boasts a vibrant ecosystem of startups, established tech companies, and research institutions, creating a wealth of opportunities for professional growth and networking. Furthermore, San Francisco is renowned for its diverse culture, world-class dining, and access to outdoor activities, making it a desirable place to live and work.
At Crusoe, we are committed to the growth and development of our employees. A Senior Site Reliability Engineer can advance their career in several directions:
The estimated salary range for a Senior Site Reliability Engineer (Managed AI) in San Francisco is $180,000 to $280,000 per year. Note that salary ranges can vary based on experience, skills, and other factors.
Crusoe offers a comprehensive benefits package, including:
Crusoe is committed to accelerating the abundance of energy and intelligence by crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. Crusoe is a fast-paced, mission-driven environment where innovation and collaboration are highly valued. We are looking for individuals who are passionate about making a tangible impact and contributing to a team that is setting the pace for responsible, transformative cloud infrastructure. We value integrity, intellectual honesty, and a commitment to excellence.
Interested candidates are encouraged to apply through the Crusoe careers page. Please submit your resume and a cover letter highlighting your relevant experience and qualifications.
198,000 - 308,000 USD / yearly
Source: ai estimated
* This is an estimated range based on market data and may vary based on experience and qualifications.
Get personalized recommendations to optimize your resume specifically for Senior Site Reliability Engineer (Managed AI) Careers at Crusoe - San Francisco, California | Apply Now!. Takes only 15 seconds!
Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.
Answer 10 quick questions to check your fit for Senior Site Reliability Engineer (Managed AI) Careers at Crusoe - San Francisco, California | Apply Now! @ Crusoe.

No related jobs found at the moment.

© 2026 Pointers. All rights reserved.