<Back to Search
Sr Observability Engineer
Holmdel, NJMarch 26th, 2026
Software Guidance & Assistance, Inc., (SGA), is searching for an Sr Observability Engineer for a CONTRACT assignment with one of our premier Insurance services clients in Holmdel, NJ or Bethlehem, PA.Responsibilities:We are seeking a dedicated and detail oriented Senior Observability Engineer with expertise in Splunk, App Dynamics, Open Telemetry and Zenoss to join our Enterprise Observability Engineering team. The ideal candidate will be responsible for the administration, configuration, and maintenance of our observability tools to ensure optimal performance and reliability of our IT systems.Administer and configure Splunk, AppDynamics, OTEL and Zenoss platforms to meet organizational monitoring needs.Perform regular updates, patches, and upgrades to observability tools to ensure they are up-to-date and secure.Continuously monitor the health and performance of the Splunk, APPD and Zenoss systems.Ensure data integrity and availability within the observability platforms.Provide support to internal users, assisting with troubleshooting and resolving issues.Develop and deliver training sessions for users to effectively utilize the monitoring tools.Create and manage dashboards, reports, and alertsWork with stakeholders to define monitoring requirements and implement appropriate alerting mechanisms.Manage the onboarding, Alert creation.Optimize system performance by tuning configurations and managing resource utilization.Maintain comprehensive documentation of configurations, processes, and procedures.Develop and enforce best practices for monitoring and observability within the organization.Collaborate with IT and DevOps teams to ensure comprehensive monitoring coverage.Participate in incident response efforts, using observability data to assist in troubleshooting and resolution.Required Skills:Bachelor's degree in Computer Science, Information Technology, or a related field.Minimum of 5–7 years in Observability/Monitoring/Site reliability engineering with a focus on Splunk, AppDynamics and Zenoss.Proven experience in Implementing, Managing and Maintaining observability tools.Proficiency in Splunk and AppDynamics (including configuration, administration, and implementation).Proficiency in Zenoss (including setup, configuration, and maintenance).Strong in MELT, Metrics, Events, Logs and Traces; hands-on troubleshooting and supportOpenTelemetry: instrumentation patterns, context propagation, collectors, sampling etcMaintain platform reliability, upgrades, patching, and security hardeningExposure to Kubernetes observability (cluster/workload metrics, events, service discovery)Strong knowledge of IT infrastructure, applications, and networking.Experience with scripting and automation tools (e.g., Python, Bash).Familiarity with cloud environments (e.g., AWS, Azure) is required.Excellent problem-solving and analytical skills.Strong communication and collaboration abilities.Ability to work independently and in a team-oriented environment.Preferred Skills:Experience with other monitoring and observability tools (e.g., Prometheus, Grafana).Knowledge of DevOps practices and CI/CD pipelines.Hands-on Infrastructure-as-Code (Terraform/Ansible) and Git-based workflowsSGA is a technology and resource solutions provider driven to stand out. We are a women-owned business. Our mission: to solve big IT problems with a more personal, boutique approach. Each year, we match consultants like you to more than 1,000 engagements. When we say let's work better together, we mean it. You'll join a diverse team built on these core values: customer service, employee development, and quality and integrity in everything we do. Be yourself, love what you do and find your passion at work. Please find us at .SGA is an Equal Opportunity Employer and does not discriminate on the basis of Race, Color, Sex, Sexual Orientation, Gender Identity, Religion, National Origin, Disability, Veteran Status, Age, Marital Status, Pregnancy, Genetic Information, or Other Legally Protected Status. We are committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities in employment, and our services, programs, and activities. Please visit our company to request an accommodation or assistance regarding our policy.#LI-AK1
Showing 100 of 35,641 matching similar jobs in Springbrook, ND
- Cloud Platform Engineer - Contract
- AWS Cloud Engineer
- Platform EngineerBlacksburg, VAMarch 26th, 2026
- Dynatrace Engineer
- Senior Platform Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Senior Platform Engineers
- Cloud Platform and DevOps Engineer
- Senior Full-Stack Engineer, Portal Platform
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Principal/Senior Principal Engineer DevOps*
- IT Platform Engineer Senior
- Senior Cloud DevOps Engineer - 100% Remote
- Site Reliability Engineer [Hybrid]
- Sr DevOps Cloud Engineer
- Lead Engineer
- Production/Application Support Engineer – Backend (Java/Python + SQL)
- Cloud AWS Engineer(Level IV)
- Geotechnical Engineer
- Cloud Engineer
- Sr. Staff Site Reliability (SRE) / DevOps Engineer
- Site Reliability Engineer (SRE) - AI Infrastructure
- Senior Technology Site Reliability Engineer
- Senior Data Platform Engineer — Scalable AWS Data Pipelines
- Middleware Engineer
- FULL TIME Lead Platform Engineer with Python Programming, AWS Cloud & Observability experience - HYBRID ONSITE (DIRECT HIRE)
- Senior Technology Site Reliability Engineer
- Senior Technology Site Reliability Engineer
- Senior Technology Site Reliability Engineer
- SRE
- SRE
- Vice President, Cloud Platform Engineer - Cloud Modernization
- Sr SRE
- Alibaba Cloud-SRE of Container Service-Bellevue