JOBSEARCHER

SRE Centric Engineer

AmpcusWildwood, NJMay 17th, 2026
Rate: SRE Centric EngineerLocation: New Jersey, NJ (Onsite)Type: Contract Required Qualifications8+ years in Site Reliability Engineering, Production Engineering, or equivalent roles. Deep expertise in distributed systems, resilience engineering, and large‐scale production operations. Strong proficiency with observability stacks:Metrics, logs, tracesSplunk, ELK, New Relic, synthetic monitoring, APMAdvanced experience with service‐level objectives (SLOs), SLIs, error budgets, and reliability governance. Expertise in Kubernetes, container orchestration, and workload reliability patterns. Strong skills in incident management, on‐call response, war‐room leadership, and RCA methodologies. Proven ability to engineer automation/self‐healing systems (auto‐remediation, failure‐mode detection). Strong scripting/automation skills in Python, Bash, or similar languages. Solid understanding of traffic distribution, load balancing, session handling, and failure isolation. Expert debugging and performance troubleshooting across the full stack (network, compute, services). Experience with AWS (EKS/ECS, SQS/SNS, S3, CloudFront, etc.). Preferred QualificationsExperience implementing AIOps, alert correlation, noise reduction, or automated RCA frameworks. Background in building paved paths, golden templates, or policy‐as‐code reliability gates. Experience with reverse proxy troubleshooting, including rate limits, affinity, and routing logic. Prior experience in high‐throughput government or regulated environments. Performance/load testing experience (designing tests, analyzing throughput, identifying bottlenecks). Strong understanding of release reliability, risk recording, and continuous deployment safeguards. Familiarity with monitoring‐as‐code or dashboards‐as‐code practices. Hands‐on experience with infrastructure‐as‐code (Terraform preferred). Experience range : 10 to 12 years