JOBSEARCHER

CloudOps / SRE Engineer

About Autonomize AIAutonomize AI is revolutionizing healthcare by streamlining knowledge workflows with AI. We reduce administrative burdens and elevate outcomes, empowering professionals to focus on what truly matters — improving lives. We're growing fast and looking for bold, driven teammates to join us.The OpportunityWe’re looking for a CloudOps / Site Reliability Engineer to lead the charge in building a fully automated, secure, and scalable multi-cloud infrastructure for our AI-powered healthcare platform. Your mission: keep our deployments lightning-fast, reliable, and invisible. You’ll own the orchestration of services across AWS, Azure, and GCP, automating everything from infra provisioning to rollbacks — with security and uptime built in. This is a builder role — ideal for someone who can go deep into CI/CD, lives for IaC, and thinks deployment velocity is just as important as resiliency.Key ResponsibilitiesMulti-Cloud Infra Management: Design and manage highly available, scalable, and secure infrastructure across AWS, Azure, and GCPEnd-to-End Automation: Build deployment workflows using Terraform, Ansible, Helm, ArgoCD, GitHub Actions or equivalentCI/CD at Scale: Own automated delivery pipelines for infrastructure and applications across staging and productionReliability Engineering: Define and uphold SLAs/SLOs; own incident management, blameless postmortems, and error budgetsSecurity & Compliance: Implement and continuously harden controls for HIPAA, SOC2, and zero-trust environmentsMonitoring & Observability: Deploy and maintain logs, metrics, and alerting systems using Prometheus, Grafana, Datadog, etc.Documentation & Process: Create robust runbooks, architectural diagrams, and continuous improvement loopsInstallation and configuration of AI Platform and Solutions at customer deployments Support in various IT / Info sec discussions and reviews with customersGuide the offshore team as necessary and help with automation of deploymentsMust-Have Qualifications5+ years in SRE/CloudOps roles with production-grade infrastructure experienceExpertise in AWS, and solid hands-on experience in Azure and GCPProven track record with Infrastructure as Code (Terraform preferred) and modern deployment frameworksDeep CI/CD experience including automated rollbacks, blue/green or canary deploymentsSkilled in Kubernetes, Docker, and container orchestrationExperience with secure cloud architectures, RBAC, IAM, and secrets managementBias for automation — scripting in Python, Bash, or GoCulture fit: you take full ownership, run toward complexity, and operate in the final mileBonus Prior experience supporting healthtech, life sciences, or other regulated domainsImplemented policy-as-code tools like OPA/GatekeeperExperience running GPU workloads, ML pipelines, or scalable microservicesContributions to open-source DevOps/SRE communitiesWhat We OfferA chance to make a real impact in the future of healthcareAutonomy, ownership, and the ability to chart your own growth pathCompetitive compensation and benefits100% employer-paid health, vision, and dental insuranceRetirement plans (401k), disability insurance, employee assistance programsHow To ApplyPlease submit your resume and a brief cover letter to careers@autonomize.ai explaining why you are the ideal candidate for this role. We are excited to meet someone who is eager to bring their skills, enthusiasm, and creativity to our team!