Incident Manager
Incident ManagerMount Laurel, NJ or West Chester PA Full-time Must Have Technical/Functional SkillsIncident Management, SRE and operations engineering, reliability architecture, Automation and observability, executive communicationRoles & ResponsibilitiesIncident Manager - Resources to provide technical leadership for enterprise wide, high severity incidents, problem investigations, and high risk changes, while shaping reliability strategy, governance, and operational standards across complex, distributed platforms.Drive Incident resolution management by directing cross functional teams through high impact outages, systemic problem resolution, and large scale change events.Creating scripts in ELK, Grafana, AppDynamics, COPAuto-executing predefined queries in ELK, Grafana, AppDynamics, COP for real-time issuesAttaching live query outputs (metrics, logs, traces) directly to alerts/incidentsEliminating manual tool navigation for IM and Alert teamsEnhancing alert systems with contextual intelligence, including metric deviations and anomaly trends, relevant log snippets and patterns, and identifying affected CIs and downstream impacts