JOBSEARCHER

Site Reliability Engineer

EngflowAustin, NYApril 9th, 2026
About EngFlowAt EngFlow, we help developers save time by accelerating software builds and tests. Our cloud-based, distributed service optimizes developer workflows through remote execution and caching, improving efficiency, productivity, and product quality.Backed by top investors, EngFlow is redefining how companies build software and ship well-tested products. Our solutions speed up builds by a factor of 10 or more, while our observability platform provides actionable insights for optimization. Founded by key contributors to Bazel, we build tools that empower engineering teams-from startups to Fortune 500 companies-to enhance developer velocity and improve build performance.Learn more about our mission, culture, and team: EngFlow | VideoWe're looking for an experienced SRE to join our engineering team. You'll be at the intersection of software engineering and systems operations - ensuring our distributed infrastructure is highly available, performant, and scalable while enabling our engineers to move quickly and confidently.Key ResponsibilitiesDesign, build, and maintain cloud infrastructure for our distributed build acceleration platformAutomate everything: from deployment pipelines to monitoring and recoveryManage scalability and reliability for high-throughput, low-latency systemsImplement and maintain observability: logging, metrics, tracing, and alertingWork closely with product and engineering teams to embed reliability into every featureDiagnose and resolve production incidents quickly, and feed learnings back into systems designOptimize cost, performance, and resilience across multi-cloud environmentsRequirements4+ years in SRE, DevOps, or Production Engineering rolesExperience managing Kubernetes in productionStrong background in cloud infrastructure (GCP or AWS) and IaC (Terraform preferred)Solid knowledge of networking, security, and distributed systemsTrack record of improving system availability and developer productivityA knack for debugging complex, cross-system issues under pressureBenefits