RESUME AND JOB

Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!

Crusoe

Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!

Crusoe

full-timePosted: Oct 29, 2025

Job Description

Senior Site Reliability Engineer, Storage at Crusoe - San Francisco, CA

Overview

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure. As a Senior Site Reliability Engineer (SRE) specializing in Storage, you will be pivotal in maintaining the performance, reliability, and scalability of Crusoe’s AI-optimized cloud infrastructure. This role is ideal for someone passionate about distributed systems, storage technologies, and ensuring high availability in a demanding environment.

A Day in the Life

Your day-to-day activities will be a mix of proactive monitoring, reactive troubleshooting, and strategic planning. You will:

Develop and maintain automation tools for monitoring and managing Crusoe’s distributed cloud storage infrastructure.
Respond to and resolve storage-related incidents, utilizing telemetry, logs, and performance profiling.
Collaborate with storage engineers to enhance data replication, encryption, backup, and failover mechanisms.
Work with hardware and kernel teams to optimize I/O paths, cache policies, and file systems.
Participate in the design and architecture of fault-tolerant storage backends for AI-driven cloud environments.
Support user-facing storage services, ensuring high availability and adherence to error budgets.
Contribute to documentation and knowledge sharing within the team.

Why San Francisco?

San Francisco is a global hub for technology and innovation, offering a vibrant ecosystem for professionals in the tech industry. By working in San Francisco, you'll be at the heart of a dynamic community, surrounded by leading companies, startups, and research institutions. The city provides unparalleled opportunities for networking, learning, and career advancement.

Career Path

This Senior SRE role offers significant opportunities for career advancement within Crusoe. You could grow into a Principal SRE, leading critical projects and initiatives, or transition into a management role, overseeing a team of SREs. Additionally, you might specialize further in a specific storage technology or architecture, becoming a subject matter expert within the organization.

Salary & Benefits

Crusoe provides a competitive salary and benefits package, reflecting the company's commitment to attracting and retaining top talent. While the exact salary will depend on experience and qualifications, the expected range for this role in San Francisco is $170,000 - $250,000 annually. In addition to salary, Crusoe offers comprehensive health insurance, paid time off, a retirement plan, and other benefits to support employees' well-being and professional development. Crusoe’s full benefits are available to employees – medical, dental, vision insurance; paid time off and holidays; retirement and savings plans; parental leave; life insurance; disability insurance; mental health support; commuter benefits; employee assistance programs; and education benefits!

Crusoe Culture

Crusoe is committed to building a diverse and inclusive workplace where everyone feels valued and respected. The company encourages collaboration, innovation, and continuous learning. Employees are empowered to take ownership of their work and contribute to the company's mission of accelerating the abundance of energy and intelligence in a sustainable way.

How to Apply

Interested candidates are encouraged to apply online through the Crusoe Energy Systems careers page. Please submit your resume and a cover letter highlighting your relevant experience and qualifications. Be sure to emphasize your experience with distributed storage systems, automation tools, and Linux internals.

FAQ

What is Crusoe's mission?
Crusoe's mission is to accelerate the abundance of energy and intelligence.
What is the role of the SRE team at Crusoe?
The SRE team plays a mission-critical role in maintaining the performance and reliability of Crusoe’s AI-optimized cloud infrastructure.
What storage systems will I be working with?
You will be working with distributed storage systems such as Ceph, GlusterFS, and OpenEBS, as well as object, block, and file storage paradigms.
What programming languages are important for this role?
Proficiency in a programming language such as Python, Go, Java, or C is highly desirable.
What is the company culture like at Crusoe?
Crusoe is committed to building a diverse and inclusive workplace where everyone feels valued and respected.
What are the benefits of working in San Francisco?
San Francisco is a global hub for technology and innovation, offering a vibrant ecosystem for professionals in the tech industry.
What opportunities for career advancement are available?
You could grow into a Principal SRE, lead critical projects, or transition into a management role.
What is the expected salary range for this role?
The expected salary range for this role in San Francisco is $170,000 - $250,000 annually.
What kind of health insurance does Crusoe offer?
Crusoe offers comprehensive health insurance, including medical, dental, and vision coverage.
How does Crusoe support employee well-being?
Crusoe provides a variety of benefits to support employees' well-being, including mental health support and employee assistance programs.

Locations

San Francisco, California, United States

Salary

Estimated Salary Rangemedium confidence

187,000 - 275,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Site Reliability Engineering (SRE)intermediate
Distributed Storage Systemsintermediate
Cephintermediate
GlusterFSintermediate
OpenEBSintermediate
Object Storageintermediate
Block Storageintermediate
File Storageintermediate
Pythonintermediate
Gointermediate
Javaintermediate
C++intermediate
Infrastructure as Codeintermediate
Terraformintermediate
Ansibleintermediate
Puppetintermediate
Linux Internalsintermediate
I/O Subsystemsintermediate
Memory Managementintermediate
Storage Schedulingintermediate
NFSintermediate
SMBintermediate
iSCSIintermediate
NVMe-oFintermediate
Containerizationintermediate
Kubernetesintermediate
Dockerintermediate
Incident Responseintermediate
Troubleshootingintermediate
Documentationintermediate
Cloud Storage Services (AWS, GCP, Azure)intermediate
High-throughput networkingintermediate
RoCEintermediate
RDMAintermediate
InfiniBandintermediate

Required Qualifications

5+ years of professional experience in SRE, systems, or storage engineering (experience)
Hands-on experience with distributed storage systems (e.g., Ceph, GlusterFS, OpenEBS) (experience)
Deep understanding of object, block, and file storage paradigms (experience)
Proficiency in a programming language such as Python, Go, Java, or C (experience)
Experience with Infrastructure as Code and deployment tooling such as Terraform, Ansible, or Puppet (experience)
Deep knowledge of Linux internals with a focus on I/O subsystems, memory management, and storage scheduling (experience)
Familiarity with storage protocols like NFS, SMB, iSCSI, or NVMe-oF (experience)
Strong experience working with containerized workloads and orchestration platforms (e.g., Kubernetes, Docker) (experience)
Excellent incident response, troubleshooting, and documentation practices (experience)
Experience with building and operating managed services at scale such as object, file and block storage (AWS, GCP, Azure) (experience)
Excellent communication skills (experience)
Must be able to pass a background check (experience)
Embody the Company values (experience)
Contributions to open-source storage projects or the Linux storage stack (Bonus) (experience)
Experience with hybrid storage models across on-prem and cloud environments (Bonus) (experience)
Familiarity with high-throughput network topologies for storage backplanes (e.g., RoCE, RDMA, InfiniBand) (Bonus) (experience)

Responsibilities

Build automation and self-healing tools to monitor and maintain Crusoe’s distributed cloud storage infrastructure.
Maintain Crusoe’s distributed cloud storage infrastructure, including block, file, and object storage systems.
Drive reliability initiatives focused on data replication, encryption, backup and restore strategies, and robust failover mechanisms.
Collaborate closely with storage engineers to implement and maintain high-performance NVMe- and SSD-backed volumes.
Support large-scale AI compute clusters.
Support user-facing storage services with a focus on availability and performance tuning.
Adhere to error budgets for storage services.
Investigate and resolve storage-related incidents using deep telemetry, logs, and performance profiling.
Partner with hardware and kernel teams to diagnose low-level I/O issues.
Optimize I/O paths, cache policies, and file systems.
Contribute to the architecture of fault-tolerant, scalable storage backends tailored for AI-first cloud environments.
Ensure the availability, performance, and scalability of Crusoe’s cloud storage products and services.
Optimize storage systems that power compute-intensive, latency-sensitive workloads for AI and HPC use cases.

Benefits

general: Contribute to a company accelerating the abundance of energy and intelligence.
general: Be part of the AI revolution with sustainable technology.
general: Drive meaningful innovation.
general: Make a tangible impact.
general: Join a team setting the pace for responsible, transformative cloud infrastructure.
general: Work on mission-critical role maintaining the performance and reliability of AI-optimized cloud infrastructure.
general: Build and optimize distributed, fault-tolerant storage systems at scale.
general: Competitive salary and benefits package
general: Opportunity for professional growth and development
general: Collaborative and supportive work environment
general: Chance to work on cutting-edge technology
general: Be a part of a company committed to sustainability.
general: Health insurance
general: Paid time off
general: Retirement plan

Target Your Resume for "Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!" , Crusoe

Get personalized recommendations to optimize your resume specifically for Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!" , Crusoe

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

SREStorageCloudAISan FranciscoFull-timeSite Reliability EngineerStorage EngineerCaliforniaArtificial IntelligenceCloud InfrastructureDistributed SystemsCephGlusterFSOpenEBSObject StorageBlock StorageFile StorageLinuxI/OAutomationTerraformAnsibleKubernetesDockerNVMeSSDHigh Performance ComputingData ReplicationFault ToleranceCrusoe Energy SystemsSustainable TechnologyGreen TechAI InfrastructureCloudEngineering

Answer 10 quick questions to check your fit for Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now! @ Crusoe.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap

Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!

Crusoe

Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!

Crusoe

full-timePosted: Oct 29, 2025

Job Description

Senior Site Reliability Engineer, Storage at Crusoe - San Francisco, CA

Overview

A Day in the Life

Your day-to-day activities will be a mix of proactive monitoring, reactive troubleshooting, and strategic planning. You will:

Develop and maintain automation tools for monitoring and managing Crusoe’s distributed cloud storage infrastructure.
Respond to and resolve storage-related incidents, utilizing telemetry, logs, and performance profiling.
Collaborate with storage engineers to enhance data replication, encryption, backup, and failover mechanisms.
Work with hardware and kernel teams to optimize I/O paths, cache policies, and file systems.
Participate in the design and architecture of fault-tolerant storage backends for AI-driven cloud environments.
Support user-facing storage services, ensuring high availability and adherence to error budgets.
Contribute to documentation and knowledge sharing within the team.

Why San Francisco?

Career Path

Salary & Benefits

Crusoe Culture

How to Apply

FAQ

What is Crusoe's mission?
Crusoe's mission is to accelerate the abundance of energy and intelligence.
What is the role of the SRE team at Crusoe?
The SRE team plays a mission-critical role in maintaining the performance and reliability of Crusoe’s AI-optimized cloud infrastructure.
What storage systems will I be working with?
You will be working with distributed storage systems such as Ceph, GlusterFS, and OpenEBS, as well as object, block, and file storage paradigms.
What programming languages are important for this role?
Proficiency in a programming language such as Python, Go, Java, or C is highly desirable.
What is the company culture like at Crusoe?
Crusoe is committed to building a diverse and inclusive workplace where everyone feels valued and respected.
What are the benefits of working in San Francisco?
San Francisco is a global hub for technology and innovation, offering a vibrant ecosystem for professionals in the tech industry.
What opportunities for career advancement are available?
You could grow into a Principal SRE, lead critical projects, or transition into a management role.
What is the expected salary range for this role?
The expected salary range for this role in San Francisco is $170,000 - $250,000 annually.
What kind of health insurance does Crusoe offer?
Crusoe offers comprehensive health insurance, including medical, dental, and vision coverage.
How does Crusoe support employee well-being?
Crusoe provides a variety of benefits to support employees' well-being, including mental health support and employee assistance programs.

Locations

San Francisco, California, United States

Salary

Estimated Salary Rangemedium confidence

187,000 - 275,000 USD / yearly

Source: ai estimated

* This is an estimated range based on market data and may vary based on experience and qualifications.

Skills Required

Site Reliability Engineering (SRE)intermediate
Distributed Storage Systemsintermediate
Cephintermediate
GlusterFSintermediate
OpenEBSintermediate
Object Storageintermediate
Block Storageintermediate
File Storageintermediate
Pythonintermediate
Gointermediate
Javaintermediate
C++intermediate
Infrastructure as Codeintermediate
Terraformintermediate
Ansibleintermediate
Puppetintermediate
Linux Internalsintermediate
I/O Subsystemsintermediate
Memory Managementintermediate
Storage Schedulingintermediate
NFSintermediate
SMBintermediate
iSCSIintermediate
NVMe-oFintermediate
Containerizationintermediate
Kubernetesintermediate
Dockerintermediate
Incident Responseintermediate
Troubleshootingintermediate
Documentationintermediate
Cloud Storage Services (AWS, GCP, Azure)intermediate
High-throughput networkingintermediate
RoCEintermediate
RDMAintermediate
InfiniBandintermediate

Required Qualifications

5+ years of professional experience in SRE, systems, or storage engineering (experience)
Hands-on experience with distributed storage systems (e.g., Ceph, GlusterFS, OpenEBS) (experience)
Deep understanding of object, block, and file storage paradigms (experience)
Proficiency in a programming language such as Python, Go, Java, or C (experience)
Experience with Infrastructure as Code and deployment tooling such as Terraform, Ansible, or Puppet (experience)
Deep knowledge of Linux internals with a focus on I/O subsystems, memory management, and storage scheduling (experience)
Familiarity with storage protocols like NFS, SMB, iSCSI, or NVMe-oF (experience)
Strong experience working with containerized workloads and orchestration platforms (e.g., Kubernetes, Docker) (experience)
Excellent incident response, troubleshooting, and documentation practices (experience)
Experience with building and operating managed services at scale such as object, file and block storage (AWS, GCP, Azure) (experience)
Excellent communication skills (experience)
Must be able to pass a background check (experience)
Embody the Company values (experience)
Contributions to open-source storage projects or the Linux storage stack (Bonus) (experience)
Experience with hybrid storage models across on-prem and cloud environments (Bonus) (experience)
Familiarity with high-throughput network topologies for storage backplanes (e.g., RoCE, RDMA, InfiniBand) (Bonus) (experience)

Responsibilities

Build automation and self-healing tools to monitor and maintain Crusoe’s distributed cloud storage infrastructure.
Maintain Crusoe’s distributed cloud storage infrastructure, including block, file, and object storage systems.
Drive reliability initiatives focused on data replication, encryption, backup and restore strategies, and robust failover mechanisms.
Collaborate closely with storage engineers to implement and maintain high-performance NVMe- and SSD-backed volumes.
Support large-scale AI compute clusters.
Support user-facing storage services with a focus on availability and performance tuning.
Adhere to error budgets for storage services.
Investigate and resolve storage-related incidents using deep telemetry, logs, and performance profiling.
Partner with hardware and kernel teams to diagnose low-level I/O issues.
Optimize I/O paths, cache policies, and file systems.
Contribute to the architecture of fault-tolerant, scalable storage backends tailored for AI-first cloud environments.
Ensure the availability, performance, and scalability of Crusoe’s cloud storage products and services.
Optimize storage systems that power compute-intensive, latency-sensitive workloads for AI and HPC use cases.

Benefits

general: Contribute to a company accelerating the abundance of energy and intelligence.
general: Be part of the AI revolution with sustainable technology.
general: Drive meaningful innovation.
general: Make a tangible impact.
general: Join a team setting the pace for responsible, transformative cloud infrastructure.
general: Work on mission-critical role maintaining the performance and reliability of AI-optimized cloud infrastructure.
general: Build and optimize distributed, fault-tolerant storage systems at scale.
general: Competitive salary and benefits package
general: Opportunity for professional growth and development
general: Collaborative and supportive work environment
general: Chance to work on cutting-edge technology
general: Be a part of a company committed to sustainability.
general: Health insurance
general: Paid time off
general: Retirement plan

Target Your Resume for "Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!" , Crusoe

Get personalized recommendations to optimize your resume specifically for Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!. Takes only 15 seconds!

AI-powered keyword optimization

Skills matching & gap analysis

Experience alignment suggestions

Check Your ATS Score for "Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now!" , Crusoe

Find out how well your resume matches this job's requirements. Get comprehensive analysis including ATS compatibility, keyword matching, skill gaps, and personalized recommendations.

ATS compatibility check

Keyword optimization analysis

Skill matching & gap identification

Format & readability score

Tags & Categories

Answer 10 quick questions to check your fit for Senior Site Reliability Engineer, Storage Careers at Crusoe - San Francisco, California | Apply Now! @ Crusoe.

10 Questions

~2 Minutes

Instant Score

Related Books and Jobs

No related jobs found at the moment.

Privacy Terms & Conditions About Us Refund Policy Recruiter Login Sitemap