<Back to Search
Senior Cloud Support Engineer
San Jose, CAApril 1st, 2026
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.About This Role:Crusoe Cloud is revolutionizing high-performance computing by offering sustainable, low-cost GPU compute power. As a Senior Cloud Support Engineer, you'll play a crucial role in empowering our customers to leverage this technology for groundbreaking advancements in fields like AI/ML, physics simulations, and computational biology. You will be the primary point of contact for technical support, ensuring our customers can seamlessly utilize Crusoe Cloud to achieve their goals. This role directly impacts Crusoe's mission by enabling our customers to accelerate their research and development, contributing to a more sustainable future. You will be involved in exciting projects, working with cutting-edge technologies and collaborating with a talented team to solve complex challenges. The ideal candidate is a highly motivated and experienced technical professional with a passion for customer success, a deep understanding of cloud technologies, and a commitment to Crusoe's values. This is a full-time position.What You'll Be Working On:Customer Support: Provide exceptional technical support to customers via Zendesk, meeting SLAs and maintaining high CSAT (95%+).On-Call Rotation: Participate in a 24/7 on-call rotation to ensure timely resolution of critical issues.Troubleshooting: Diagnose and resolve issues related to VMs, hardware failures, and scaling tests using CLI and internal tools.Alert Triage and Maintenance: Manage alert triage, prepare for maintenance windows, and conduct node delivery testing.Collaboration: Work closely with SRE, Networking, and Storage teams from initial triage to root cause analysis (RCA) delivery.Global Teamwork: Adhere to global team collaboration and handoff processes for ticketing and on-call procedures.Knowledge Sharing: Develop onboarding/training materials, knowledge base documentation, and standard operating procedures (SOPs).What You'll Bring to the Team:Education/Experience: Bachelor's degree in IT, Computer Science, Engineering, or a related field, or 4+ years of equivalent technical experience.Linux Proficiency: Strong command-line interface (CLI) skills in Linux environments.Version Control: Proficiency with Git for code management and collaboration.Customer Support Experience: 5+ years of experience in a customer support role, ideally within cloud, storage, or networking environments.Cloud Technologies: Experience with container orchestration (e.g., Kubernetes), workload management (e.g., Slurm, Terraform), and monitoring tools (e.g., Grafana).Public Cloud Knowledge: Familiarity with other public cloud platforms (e.g., AWS, Azure, GCP).Communication Skills: Excellent communication and customer service skills, including the ability to prioritize competing escalations.HPC Knowledge: Understanding of HPC technologies such as Infiniband, RDMA, RoCE, and Software Defined Networking (SDN).Bonus Points:Certifications: CKA, CKAD, CKS, KCNA, AWS Machine Learning - Specialty, Data Analytics - Specialty, Solutions Architect - Professional, Developer - Associate, NVIDIA AI Infrastructure and Operations, Generative AI and LLMs, Generative AI Multi-modal, Infiniband, Linux Foundation IT Associate, System Administrator.Cloud Expertise: Deep understanding of specific cloud platforms and services.Automation Skills: Experience with automation tools and scripting languages.Problem-Solving Abilities: Demonstrated ability to analyze complex technical issues and develop effective solutions.Collaboration and Mentorship: Proven ability to mentor, train, and onboard colleagues.Passion for Sustainability: A strong interest in contributing to a more sustainable future through technology.Benefits:Industry competitive payRestricted Stock Units in a fast growing, well-funded technology companyHealth insurance package options that include HDHP and PPO, vision, and dental for you and your dependentsEmployer contributions to HSA accountsPaid Parental LeavePaid life insurance, short-term and long-term disabilityTeladoc401(k) with a 100% match up to 4% of salaryGenerous paid time off and holiday scheduleCell phone reimbursementTuition reimbursementSubscription to the Calm appMetLife LegalCompany paid commuter benefit; $300 per pay periodCompensation:Compensation will be paid between $125,000 and $151,000 + Bonus. Restricted Stock Units are included in all offers. Salary will be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
368 matching similar jobs near San Jose, CA
- Leader of Data Center Delivery
- Staff C/C++ Systems Engineer - Nanolog & Linux Kernel
- Data Center Networking Software Engineer - Switch Design
- Lead Cloud Architect - AI Infrastructure & Automation
- Staff DevOps Engineer, FedRAMP GovCloud & IaC Lead
- Senior Distributed Storage Architect
- Senior Backend Engineer, Data Platform & Cloud
- Backend Engineer II: Cloud Infra & Go
- Senior TPM: Hosted Infra & Cloud Ops (Remote Eligible)
- Data Center Partnerships Lead
- System Engineer
- Chief Platform Architect for AI-Driven Security
- Lead AI Engineer (Gen AI Platform, Agentic AI andamp; LLM Infrastructure andamp; Orchestration)
- ASIC Engineer - SDC
- Full-Stack Engineer, Enterprise GenAI
- Staff Operations Engineer (MLOps)
- Oracle Planning and Budgeting Cloud Services (PBCS) Technical Lead (ESTA)
- Staff Site Reliability Engineer
- Senior Vulnerability Analyst
- Sr. Product Manager (Security AI)
- CAD HW-SW Infrastructure Engineer
- Distinguished Engineer
- Lead Forward Deployed Engineer
- Senior Manager, Forward Deployed AI Engineer
- Senior Full-Stack Engineer, Enterprise GenAI
- Architect, Site Reliability Engineering
- Python Agentic AI/ML Engineer
- BMC Engineer
- Fullstack Senior Engineer with Python (Django) and React - Latin America, Remote position
- Project Manager (Google Cloud Platform)
- Senior Site Reliability Engineer
- ServiceNow Administrator II
- Site Reliability Engineer - USDS
- Senior Software Engineer, AI Platform & Generative Tech
- M365 Copilot Solution Engineer
- Staff Cloud Engineer - Lead DevOps & Cloud Initiatives
- Senior Loyalty Platform Engineer - AI-Driven, Global Scale
- Platform Engineer - Cloud Data Platform & GenAI
- Data Platform Engineer: Build Scalable Big Data Systems
- IT Security Engineer (Korean Bilingual)