<Back to Search
Senior Site Reliability Engineer
Raleigh, NCMarch 20th, 2026
No sponsorship will be provided for this role.Location: On site at location listed in posting.Weekly Schedule: Monday-Friday, 9am-5pmWe are seeking a Senior Site Reliability Engineer who will be the guardian of our Azure infrastructure reliability. This role focuses on building comprehensive observability platforms, implementing intelligent monitoring systems, and proactively identifying issues before they impact production. You will create the tools and automation that predict, detect, and prevent problems rather than simply reacting to them. Your primary mission is ensuring our Azure infrastructure and applications never surprise us with failures.The ideal candidate has deep expertise in Azure Monitor, Application Insights, Log Analytics, and KQL, combined with strong scripting skills in Python or PowerShell. You should have 5-7+ years of experience implementing observability platforms and a proven track record of preventing incidents through proactive monitoring and automation. You'll work with technologies like Prometheus, Grafana, OpenTelemetry, and Azure services (AKS, App Services, Azure SQL, Cosmos DB) while building self-healing automation and predictive analytics tools that keep our systems healthy.Key Responsibilities:Design and implement comprehensive observability stack across all Azure resources and applicationsBuild intelligent alerting systems with anomaly detection and predictive capabilities to prevent incidentsCreate self-healing automation and auto-remediation tools that resolve issues without human interventionDevelop internal monitoring platforms, dashboards, and CLI tools for engineering teamsWrite KQL queries and analyze metrics/logs to identify optimization opportunities and predict failuresImplement continuous resource monitoring for Azure quotas, costs, security posture, and service healthBuild capacity forecasting and trend analysis tools to prevent resource exhaustionReduce alert noise while improving coverage and actionability of monitoring systemsParticipate in light on-call rotation (prevention-focused approach reduces reactive incidents)About UsFirst Horizon Corporation is a leading regional financial services company, dedicated to helping our clients, communities and associates unlock their full potential with capital and counsel. Headquartered in Memphis, TN, the banking subsidiary First Horizon Bank operates in 12 states across the southern U.S. The Company and its subsidiaries offer commercial, private banking, consumer, small business, wealth and trust management, retail brokerage, capital markets, fixed income, and mortgage banking services. First Horizon has been recognized as one of the nation's best employers by Fortune and Forbes magazines and a Top 10 Most Reputable U.S. Bank. More information is available at www.FirstHorizon.com.Benefit HighlightsMedical with wellness incentives, dental, and visionHSA with company matchMaternity and parental leaveTuition reimbursementMentor program401(k) with 6% matchMore -- FirstHorizon.com/First-Horizon-National-Corporation/Careers/Our-BenefitsFollow UsFacebookX formerly TwitterLinkedInInstagramYouTube
Showing 50 of 36,192 matching similar jobs
- Senior Systems Engineer - Openshift / AKS
- Site Reliability Engineer
- Senior Cloud Platform Engineer
- Site Reliability Engineer [Hybrid]
- Senior Technology Site Reliability Engineer
- Senior Technology Site Reliability Engineer
- Senior Technology Site Reliability Engineer
- Senior Technology Site Reliability Engineer
- Senior Platform Infra Engineer - Kubernetes & Cloud
- Staff Developer Relations Engineer, Cloud Platform Evaluations Team
- Security-Cleared Platform Engineer - AWS, Kubernetes, Hybrid
- Senior Technology Site Reliability Engineer
- Senior Platform Engineer - Kubernetes & Cloud-Native Scale
- Senior Site Reliability Engineer Cloud Platform
- Senior Site Reliability Engineer, Identity Platform
- Senior Cloud Engineer
- Platform Engineering Leader: Scale Cloud, Empower Devs
- Platform DevOps Engineer - AI/Cloud Automation & CI/CD
- Senior Cloud Platform Engineer - Remote
- Senior Platform Engineer
- AWS Cloud Engineer
- Senior Infrastructure Engineer - Scale & Secure Cloud (AWS)
- GCP Cloud Infrastructure Architect: Build Scalable Global Infra
- Sr. Cloud Engineer III (5780)
- Professional, Cloud Platform Engineer
- Cloud Platform Engineer - Azure & AWS, Hybrid Cloud
- Senior Cloud Java Engineer - AWS, Kubernetes, APIs
- Azure Cloud Engineer
- Site Reliability Engineer, Discovery
- Sr Observability Engineer
- Platform EngineerEnglewood, COMarch 25th, 2026
- Site Reliability Engineer - Video Infrastructure
- Lead Site Reliability Engineering - Network
- Site Reliability Developer 6
- Site Reliability Developer 6
- Site Reliability Developer 6
- ELH Site Reliability Engineer Lowell, SVL, Austin
- Site Reliability Developer 6
- Sr Infrastructure Engineer - Azure Cloud
- Senior Site Reliability Engineer (SRE)