<Back to Search
Site Reliability Engineering Manager
San Jose, CAMarch 20th, 2026
Job Description:Mandatory to have working experience as SRE manager especially in Retail domain application support ( NOT CLOUD /DevOps)Must have working knowledge on SRE principles such as Logs, metrics, availability metrics, uptime, ticket tracking, e-com services, ITIL framework specifically on Alerts, Incident, change management, CAB, Production deployments, Risk and mitigation plan, SLA, SLI, SLOHands on experience in Monitoring, Logging, Alerting, Dashboarding, and report generation in any observability tools Prefer DataDog or other tools such as Splunk/Dynatrace/ELK/Grafana). This engagement is a customer using Dynatrace,Splunk, PagerDuty hence it is good to have this expertiseMandatory to have work experience in leading Level 2/Level 3 application support team based out of IND who provide 24x7 coverage.Should know how to gather & communicate SRE requirement from customers and define SRE roadmap.Working experience on how to gather requirements on health of applications, services to monitor, setting service levels.Must have good knowledge on eCommerce platforms in microservice architecture, Sterling OMS , Retail Applications like XStore.Should be able to lead P1 calls, brief about the P1 to customer, proactive in gathering leads/ customers into the P1 calls till RCA, PIR etc.Should have knowledge on building process , framework by following ITSM principles, SOP, runbooks, handling any ITSM platforms (JIRA/ServiceNow/BMC Remedy)Must know how to work with the Dev team, cross functional teams.Should be able to generate WSR/MSR by extracting the tickets from ITSM platforms, present to customers and client leaders.Manage overall SRE delivery, customer focus mindset , closely work with customer leaderships.Preferred:Be a client face at customer site collaborating with client leadership.Ability to clearly communicate and understand a technical idea/concept.Ability to work in a professional environment while interacting with peers and stakeholders, collaborating with offshore teams.Excellent written and verbal communications skills.Motivated, goal driven, influential, innovative, curious, and open minded, fun to work with, collaborator.Capability to work with people in different time zones.Ability to operate in a fast-paced, evolving environment and appropriately prioritize tasks, and keep abreast of the latest technology.Collaborate with cloud architecture, infrastructure team, project management team, and technology services, management team.Create and maintain detailed documentation.
Showing 200 of 41,156 matching similar jobs in Springbrook, ND
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Infrastructure Manager
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Senior Platform Engineers
- Site Manager- Industrial Maintenance
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Site Reliability Engineering (SRE) Automation and Orchestration Engineer
- Regional Ops Manager - Houston
- Senior Manager, Maintenance
- VP, Security Architecture & Secure Systems (Hybrid)
- Senior Full-Stack Engineer, Portal Platform
- Cloud Platform and DevOps Engineer
- Senior AI/ML Cloud Delivery Lead
- Senior Backend Software Engineer (New York City, Los Angeles, or San Francisco)
- Head of Infrastructure
- Principal/Senior Principal Engineer DevOps*
- IT Platform Engineer Senior
- Lead Software Engineer (Global Payment Network)
- Senior Cloud DevOps Engineer - 100% Remote
- Grouting Superintendent
- Maintenance Manager
- Mgr Production
- Sr. Network Engineer L4
- Systems Engineer
- Site Reliability Engineer [Hybrid]
- Outside Plant Field Engineer (Ames, Iowa)
- Cloud Infrastructure Engineer
- Senior Software Engineer (Global Payment Network)
- AIML Engineer
- Maintenance Manager - Hiring Immediately
- Operations Maintenance Manager
- Field Performance Advisor
- Senior Gen AI Cloud Engineer
- Platform Engineer IV
- Sr DevOps Cloud Engineer
- Production Manager
- Chief Technology Officer Orange County 4 5x a week
- Maintenance Engineer
- Head of Infrastructure