Sr. Observability Engineer (Remote)
Sr. Observability Engineer*12-month contract to hire*Fully RemoteOptomi, in partnership with one of our premier clients, is seeking a Senior Observability Systems Engineer to help design, implement, and optimize enterprise observability and monitoring solutions across large-scale cloud environments. This role partners with infrastructure, application, and platform engineering teams to improve logging, monitoring, automation, and operational visibility for modern application and infrastructure services.Requirements7+ years of hands-on infrastructure, systems engineering, or platform engineering experienceStrong experience with JavaScript and/or TypeScript5+ years developing applications and services in AWS environmentsExperience supporting and developing Java-based applicationsExperience with deployment and automation tools such as Git, Terraform, Harness, and NPMHands-on administration experience with observability platforms including Dynatrace, Splunk, or CriblExperience building dashboards, alerts, and reporting using DQL and/or SPLExperience integrating observability platforms with enterprise IT operations tools (e.g., ServiceNow, BigPanda, ReadyAPI)Scripting experience with Python and/or PowerShellExperience building scalable observability pipelines for logs, metrics, and tracesFamiliarity with Kubernetes environments (EKS/ACK), EC2, DocumentDB, and cloud-native monitoring patternsStrong understanding of Infrastructure as Code, automation, and configuration managementExperience working in fast-paced enterprise environments with strong communication and stakeholder management skillsResponsibilitiesDesign, deploy, and optimize observability solutions across cloud and containerized environmentsBuild and maintain monitoring, logging, alerting, and reporting capabilities for infrastructure and applicationsAutomate observability platform operations through scripting and Infrastructure as CodeOnboard and integrate infrastructure and application data sources into monitoring platformsSupport governance, operational standards, and platform optimization initiativesPartner with engineering and operations teams to improve reliability, visibility, and incident response capabilitiesEvaluate and implement enhancements across observability and automation toolingProvide technical leadership, operational reporting, and continuous improvement recommendations across enterprise environments