JOBSEARCHER

Lead SRE

About the Role:As a Lead SRE, you will be a hands-on technical leader responsible for the stability, scalability, and performance of Coupa’s production platform. You will work closely with engineering teams to identify and resolve infrastructure issues, improve system reliability, and guide automation and monitoring best practices.This role offers a unique opportunity to contribute to a globally scaled platform, guide operational excellence, and collaborate with international teams.Key Responsibilities:Lead efforts to troubleshoot and resolve complex infrastructure and application issues.Drive improvements in system performance, observability, and automation.Build and manage infrastructure using Terraform, Chef or Ansible, and AWS services.Administer Linux-based systems, including web servers, application servers, and databases.Participate in and enhance on-call practices (approx. once per quarter).Support and maintain Kubernetes-based deployments in production.Provide support for Ruby-based applications and contribute to long-term system improvements.Coordinate change management processes and post-incident reviews.Collaborate with DevOps, QA, and engineering teams to ensure resilient and scalable systems.RequirementsRequirements:4+ years of hands-on Platform/System Engineering experience using Go, Python, Java, Ruby, or equivalent programming languages.3+ years of experience in an engineering role, working with a diverse and distributed team located across the globe.Hands-on experience with containerization technologies (Docker, Kubernetes, GitHub Actions, ArgoCD, EKS, AKS, ECS).Exposure to Infrastructure as Code (IaC) with Multi-Cloud Deployments.Proven experience building and reliably running modern full-stack cloud applications using public cloud technologies (AWS, Azure, GCP) at scale.Effective written and verbal communication skills to properly articulate complex technical problems to all levels of the organization and customers.Confidence in the ability to own and deliver a roadmap tied to business priorities.A passion for excellence, a natural problem solver, and a critical thinker who enjoys digging deep to understand issues and solve hard problems.Degree in Computer Science, Computer Engineering, or a related field (or equivalent experience).Experience with modern infrastructure management systems (Chef, Ansible, Terraform).Expertise in building Platform-as-a-Service (PaaS) solutions.