<Back to Search
Senior Cloud Support Engineer
Santa Clara, CAApril 1st, 2026
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.About This Role:Crusoe Cloud is revolutionizing high-performance computing by offering sustainable, low-cost GPU compute power. As a Senior Cloud Support Engineer, you'll play a crucial role in empowering our customers to leverage this technology for groundbreaking advancements in fields like AI/ML, physics simulations, and computational biology. You will be the primary point of contact for technical support, ensuring our customers can seamlessly utilize Crusoe Cloud to achieve their goals. This role directly impacts Crusoe's mission by enabling our customers to accelerate their research and development, contributing to a more sustainable future. You will be involved in exciting projects, working with cutting-edge technologies and collaborating with a talented team to solve complex challenges. The ideal candidate is a highly motivated and experienced technical professional with a passion for customer success, a deep understanding of cloud technologies, and a commitment to Crusoe's values. This is a full-time position.What You'll Be Working On:Customer Support: Provide exceptional technical support to customers via Zendesk, meeting SLAs and maintaining high CSAT (95%+).On-Call Rotation: Participate in a 24/7 on-call rotation to ensure timely resolution of critical issues.Troubleshooting: Diagnose and resolve issues related to VMs, hardware failures, and scaling tests using CLI and internal tools.Alert Triage and Maintenance: Manage alert triage, prepare for maintenance windows, and conduct node delivery testing.Collaboration: Work closely with SRE, Networking, and Storage teams from initial triage to root cause analysis (RCA) delivery.Global Teamwork: Adhere to global team collaboration and handoff processes for ticketing and on-call procedures.Knowledge Sharing: Develop onboarding/training materials, knowledge base documentation, and standard operating procedures (SOPs).What You'll Bring to the Team:Education/Experience: Bachelor's degree in IT, Computer Science, Engineering, or a related field, or 4+ years of equivalent technical experience.Linux Proficiency: Strong command-line interface (CLI) skills in Linux environments.Version Control: Proficiency with Git for code management and collaboration.Customer Support Experience: 5+ years of experience in a customer support role, ideally within cloud, storage, or networking environments.Cloud Technologies: Experience with container orchestration (e.g., Kubernetes), workload management (e.g., Slurm, Terraform), and monitoring tools (e.g., Grafana).Public Cloud Knowledge: Familiarity with other public cloud platforms (e.g., AWS, Azure, GCP).Communication Skills: Excellent communication and customer service skills, including the ability to prioritize competing escalations.HPC Knowledge: Understanding of HPC technologies such as Infiniband, RDMA, RoCE, and Software Defined Networking (SDN).Bonus Points:Certifications: CKA, CKAD, CKS, KCNA, AWS Machine Learning - Specialty, Data Analytics - Specialty, Solutions Architect - Professional, Developer - Associate, NVIDIA AI Infrastructure and Operations, Generative AI and LLMs, Generative AI Multi-modal, Infiniband, Linux Foundation IT Associate, System Administrator.Cloud Expertise: Deep understanding of specific cloud platforms and services.Automation Skills: Experience with automation tools and scripting languages.Problem-Solving Abilities: Demonstrated ability to analyze complex technical issues and develop effective solutions.Collaboration and Mentorship: Proven ability to mentor, train, and onboard colleagues.Passion for Sustainability: A strong interest in contributing to a more sustainable future through technology.Benefits:Industry competitive payRestricted Stock Units in a fast growing, well-funded technology companyHealth insurance package options that include HDHP and PPO, vision, and dental for you and your dependentsEmployer contributions to HSA accountsPaid Parental LeavePaid life insurance, short-term and long-term disabilityTeladoc401(k) with a 100% match up to 4% of salaryGenerous paid time off and holiday scheduleCell phone reimbursementTuition reimbursementSubscription to the Calm appMetLife LegalCompany paid commuter benefit; $300 per pay periodCompensation:Compensation will be paid between $125,000 and $151,000 + Bonus. Restricted Stock Units are included in all offers. Salary will be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
379 matching similar jobs near Santa Clara, CA
- Alarm Technician
- Security Technician
- Physician / ObGyn / California / Permanent / Big Money OBGN Job JobSan Jose, CAMarch 27th, 2026
- Staff GNC DevOps & Simulation Infrastructure Engineer
- Leader, Software Engineering
- Sr. Software Engineer (27251)
- Senior Product Manager - Data Center Switching
- Senior Server Product Manager - Cloud & Data Solutions
- Senior Software QA Engineer
- Software Engineer - Data Center
- Sr. Solution Architect - Datacenter Software Solutions (27483)
- Technology Consultant
- Principal Software Engineer - Platform Development
- Chief Platform Architect for AI-Driven Security
- ASIC Engineer - SDC
- Oracle Planning and Budgeting Cloud Services (PBCS) Technical Lead (ESTA)
- Lead AI Engineer (Gen AI Platform, Agentic AI andamp; LLM Infrastructure andamp; Orchestration)
- Staff Operations Engineer (MLOps)
- Full-Stack Engineer, Enterprise GenAI
- Lead Forward Deployed Engineer
- Apptad - Network Architect
- Platform Intern - MESan Jose, CAApril 2nd, 2026
- Senior QA Automation Engineer - Networking L2/L3
- Senior SW Engineer- Data Center Switching
- Sr. Product Manager (Security AI)
- Staff DevOps Engineer
- Network Architect
- CAD HW-SW Infrastructure Engineer
- Lead Forward Deployed Engineer
- Senior Manager, Forward Deployed AI Engineer
- Fullstack Senior Engineer with Python (Django) and React - Latin America, Remote position
- BMC Engineer
- Physician / Neurology / California / Permanent / Neurologist Physician
- Python Agentic AI/ML Engineer
- Senior Manager, Site Reliability Engineering (FedRAMP) - ThousandEyes
- Senior Site Reliability Engineer - Applied Machine Learning
- Senior Software Engineer Back End, Infrastructure Management
- Senior Digital Design Engineer
- Senior Software Engineer (Generative AI/Machine Learning) (10029)
- Senior Software Engineer