Site Reliability Engineer
Job Title: Site Reliability Engineering Location: Dallas, TX Job Description: Technical leader defining reliability strategy, platform architecture, and SRE maturity across teams and products. Key Responsibilities Architect highly available, secure, scalable platforms Define SRE standards, roadmap, and best practices Own availability, resiliency, and DR strategies Lead high-severity, cross-team incidents Drive large-scale automation and platform improvements Mentor senior engineers and influence leadership Mandatory Skills (Skill → Experience) Linux, networking & distributed systems design – 8–12 yrs Cloud architecture (AWS / Azure / GCP at scale) – 7–10 yrs Kubernetes, platform engineering & service mesh – 6–8 yrs Infrastructure as Code & governance (Terraform) – 6–8 yrs Automation & systems programming (Python / Go) – 6–8 yrs Observability strategy (metrics, logs, tracing) – 6–8 yrs SRE practices (SLOs, error budgets, toil reduction) – 6–8 yrs Soft Skills Strategic and systems-level thinking Strong technical leadership and influence Executive-level communication skills Coaching and mentoring senior engineers Ownership of reliability vision and outcomes