JOBSEARCHER

Site Reliability Engineer

Site Reliability Engineer (Contract-to-Hire) (Onsite Interview)Location: Dallas, TX (Hybrid/Onsite – Local Only)Duration: 3-Month Contract to HireInterview: OnsiteSchedule: Mon-Fri, 8 AM–5 PM PSTJob Description:Seeking an experienced Site Reliability Engineer (SRE) with 7+ years of SRE experience and strong production engineering background. Candidate should have hands-on experience in incident management, on-call support, RCA, automation, observability, and infrastructure reliability.Required Skills:Strong experience with Azure, Kubernetes, DockerCI/CD using GitHub ActionsMonitoring/Observability tools (Dynatrace preferred)Automation using Ansible, Python, BashSupport of Java applications in productionLinux and Windows administrationStrong understanding of SLIs, SLOs, Error BudgetsExperience leading or contributing to major production incidentsResponsibilities:Manage and improve system reliability, scalability, and performanceSupport production environments and participate in on-call rotationDrive incident response, root cause analysis, and corrective actionsBuild automation to reduce operational toilEnhance observability, monitoring, and operational reportingCollaborate with engineering teams on reliability improvements