Senior DevOps Engineer
We are working with one of healthcare's fastest growing disruptors who are Forbes ranked as one of "America's Top Startup Employers 2026". They have raised over $200m dollars by top tier investors like a16z and Costanoa Ventures, and did a Series C of over $100m. They have also been certified as a "Great Place to Work" for the past 6 years in a row.They are providing Gen AI solutions for the healthcare revenue cycle, empowering health systems to streamline their operations, so they can focus on delivering quality patient car. Their AI native product launched in 2024 and since then revenue has grown by 20x and deployments have been recognized nationally as "one of the most comprehensive real-world uses of GenAI in healthcare finance to date".Their customer base represents more than $120B+ in net patient revenue and includes the most innovative health systems in the country, like Cleveland Clinic, Duke, Stanford, and Johns Hopkins.They are looking for a new DevOps/Infrastructure Engineer. You'll help build and maintain the foundational infrastructure that supports our SaaS applications, including Kubernetes, Terraform-managed cloud resources, and GitHub-based CI/CD pipelines. The primary focus is on proactive improvements: reducing operational toil, improving visibility into system behavior, and enabling product teams to move fast with confidence.What You'll DoInfrastructure Management: Build, manage, and optimize infrastructure using Terraform, GitHub CI/CD, and Kubernetes.Monitoring & Observability: Create visualizations and alerts that provide actionable insights using tools like Grafana, Prometheus/Mimir, OpenSearch, and Sentry.Automation & Reliability: Identify manual or error-prone processes and replace them with automated, repeatable systems.Production Troubleshooting: Diagnose and resolve production issues across application and infrastructure layers.Documentation: Capture knowledge in runbooks, setup guides, and architecture diagrams to support operational maturity.Collaboration: Partner with engineers across teams to drive adoption of DevOps and infrastructure best practices.Scalability Planning: Help scale infrastructure and monitoring systems to meet growing demands.Incident Participation: Participate in an on-call rotation and support incident response processes as needed.Skills & QualificationsObservability: Experience with metrics, logs, and traces using tools such as Grafana, Prometheus/Mimir, OpenSearch, Sentry, or similar.Infrastructure as Code: Proficient with Terraform, Kubernetes, and containerization tools.Programming Skills: 5+ years of experience with Python.Linux Systems: Comfortable working with Linux-based environments and writing shell scripts.Communication: Strong collaboration skills with a focus on asynchronous, written communication.Documentation: Commitment to clear, comprehensive documentation and process standardization.Initiative: Self-starter mindset with a proactive approach to solving operational challenges.Version Control: Skilled in Git/GitHub-based workflows.Nice to havesCloud Experience: AWS (preferred), GCP, or Azure cloud infrastructure management.Networking Fundamentals: Familiarity with TCP/IP, DNS, routing, and load balancing concepts.Security: Understanding of cloud and infrastructure security best practices.Performance Tuning: Experience tuning application or infrastructure performance in production environments.What They OfferFlexible paid time off (PTO)Expansive coverage for health, dental, and visionEmployer contribution to Health Savings Accounts (HSA)Generous parental leave policyFull employee coverage for life insurance#J-18808-Ljbffr