JOBSEARCHER

Site Reliability Engineer

ARCHIVED

We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.

job Description:We are seeking a Site Reliability Engineer (SRE) with 5 to 7 tears, with strong to expert-level knowledge of the AWS ecosystem to support and operate highly available, scalable cloud platforms. The role requires hands-on expertise across core AWS services, including Kafka, Redis, CloudWatch, Kubernetes (EKS), EC2, Secrets Manager, Route53, Lambda, RDS, DynamoDB, and AWS Transfer Family.The candidate will be responsible for ensuring system reliability, performance, and observability in a production environment, with a strong emphasis on monitoring, automation, and infrastructure scalability. Deep experience with Grafana for metrics and dashboards, Terraform for infrastructure as code, and Docker-based containerization is required.The ideal candidate is comfortable working in fast-paced production environments, proactively identifying reliability risks, and implementing automated solutions to improve uptime and operational efficiency. Experience with automated build and deployment pipelines (CI/CD) is highly desirable, enabling continuous delivery and operational consistency across environments.