<Back to Search
Site Reliability Engineer Lead
Houston, TXApril 5th, 2026
Brief Description:We are seeking an Site Reliability Engineer Lead to own and evolve the reliability, scalability, and operational excellence of cloud-native data platforms running primarily on Google Cloud Platform (GCP). This role supports data systems that ingest, process, and serve large volumes of operational data from oilfield and energy environments. The ideal candidate is a cloud-first SRE with deep GCP experience, strong Python engineering skills, and a track record of leading reliability initiatives for data-intensive systems.Detailed Description:* Lead SRE practices for GCP-based data platforms* Design and own SLIs, SLOs, error budgets, and reliability metrics* Build and maintain cloud-native observability (monitoring, logging, alerting)* Lead incident response for production cloud systems and drive postmortems* Partner with data engineering and platform teams to design reliable architectures* Automate operational workflows using Python* Drive improvements in CI/CD, infrastructure as code, and deployment safety* Mentor engineers and set SRE best practices across the teamRequired Knowledge, Skills, and Abilities:* 7+ years in SRE, Cloud Platform Engineering, or DevOps* Strong hands-on experience with Google Cloud Platform, including:* GCP: GKE, Compute Engine, Cloud Storage, Pub/Sub (or equivalents)* Cloud Monitoring & Logging* BigQuery* Dataflow* Datastream* IAM and networking* Composer/AIrflow* Kubernetes: deployment, scaling, reliability patterns* CI/CD: GitHub Actions, GitLab CI, or similar* Observability: GCP Cloud Monitoring, Logging* Experience supporting cloud-native data systems (batch and streaming)* Production experience with Python for automation, tooling, or services* Infrastructure as Code experience (Terraform strongly preferred)* Experience operating systems in 24/7 production environmentsMinimum Qualifications:* Bachelor's degree in Business, Information Technology, Computer Science, or a related field.* 5+ years experience in Site Reliability Engineering, Cloud Platform Engineering, or DevOps* 3+ years operating production workloads on Google Cloud Platform (GCP)* Prior technical leadership experience (lead engineer, tech lead, or ownership of reliability initiatives)* Ability to understand and speak English at a level of proficiency allowing employee to issue, receive and respond to both safety and operations-related directions in EnglishPreferred Qualifications:* Oil and Gas Industry knowledge* Technology/Digital Industry knowledge
475 matching similar jobs near Houston, TX
- Production Engineer II
- Software Engineer I (GBS)
- Front End/UI (Angular)
- Job Title: Intune Engineer/End Point Engineer
- Data Engineer
- SrZ/OS Systems programmer
- System Administrator
- Travel IT Technician
- C++ Software Developer
- Senior AI/ML Solution Architect
- Data Engineer
- Embedded Software Engineer
- Solace Developer
- API Monster United States Mixed Test Auto 192206
- API Monster United States Mixed Test Auto 140371
- AWS Cloud Administrator
- Operations System Engineer I
- Senior Systems Engineer
- Safety Superintendent
- Firmware Engineer
- Mac OS Storage Driver Developer – SCSI
- Salesforce Marketing Cloud Developer
- Enterprise Network Engineer
- Field Service Technology Delivery Consultant/Manager
- Assistant District Service Manager: Field Ops Lead
- Plant Manager/Maintenance Supervisor - Extrusion - Houston, TX
- Senior Technology Architect - .Net/Angular
- Marketing Systems & Analytics Coordinator
- Systems Engineer
- Sales & Design Professional - Security Integration
- Director of Telecom, Data Networks & Cloud Strategy
- Technical Business Systems Analyst
- Golf Course Maintenance Leader - Senior Assistant Superintendent
- Mobile Testing Lead - Strategy, Automation & Leadership
- Director of Software Engineering : CCB Risk Decisions
- Senior SAP PP System Architect (ECC/S4HANA)
- Lead Software Engineer - Python/Java - Trading application
- Android Sales Expert
- Information Security Manager
- Salesforce - DevOps Release Manager