- UpvoteDownvoteShare Job
- Suggest Revision
As a Site Reliability Engineer (SRE) you will employ software engineering to automate critical IT operations tasks, including production system management, change management, and incident response.
$137,800 a yearFull-timeRemoteExpandApply NowActive JobUpdated 4 days ago - UpvoteDownvoteShare Job
- Suggest Revision
The WEX Site Reliability Engineering (SRE) team is looking for individuals passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance.
RemoteExpandApply NowActive JobUpdated Yesterday - UpvoteDownvoteShare Job
- Suggest Revision
Secrets and configuration management · Monitoring systems and services, providing incident and emergency response to triage and resolve system or client issues · Management of the application ecosystem improving platform infrastructure and applications with high reliability, resiliency, performance, and quality · Supporting documentation, knowledge articles, and runbooks · Designing, building, and Implementing SRE patterns that adhere to our client’s security Top Qualifications: 1.
ExpandApply NowActive JobUpdated 14 days ago - UpvoteDownvoteShare Job
- Suggest Revision
Your responsibilities will span both Cloud SRE and on-premise systems administration, with a focus on managing the stability of advanced GPU services and optimizing monitoring, alerting, and incident response frameworks, guided by Service Level Indicators (SLIs) and Objectives (SLOs.
Full-timeExpandApply NowActive JobUpdated 1 month ago - UpvoteDownvoteShare Job
- Suggest Revision
You will have the opportunity to learn and contribute to various aspects of site reliability engineering including monitoring, automation, incident response, and infrastructure optimization while making contributions that will help make Client users experiences better.
$135,000 a yearFull-timeExpandApply NowActive JobUpdated 1 month ago - UpvoteDownvoteShare Job
- Suggest Revision
Develop and enforce SRE best practices, including incident response, post-mortem analysis and capacity planning. Use monitoring data to drive actionable insights and contribute to incident response strategies.
ExpandApply NowActive JobUpdated Yesterday - UpvoteDownvoteShare Job
- Suggest Revision
Participate in incident response and troubleshooting. 2+ years of hands-on experience as a Site Reliability Engineer or equivalent role. Participate in 24x7 Site Reliability rotations and escalation workflows.
RemoteExpandApply NowActive JobUpdated Yesterday - UpvoteDownvoteShare Job
- Suggest Revision
You will help ensure swift incident response and scalable emergency handling, fostering greater reliability and resilience in managing complex systems. System Reliability and Incident Management: Ensure the reliability, availability, and performance of services.
$137,800 a yearFull-timeRemoteExpandApply NowActive JobUpdated Yesterday - UpvoteDownvoteShare Job
- Suggest Revision
Non-sales roles are typically eligible for a quarterly or annual bonus based on their role and applicable plan. Experience with Cloud Computing platforms (AWS, Azure, GCP) Experience with one or more of the following languages: C#, Java, GoLang, Python.
$122,000 a yearFull-timeRemoteExpandApply NowActive JobUpdated Yesterday - UpvoteDownvoteShare Job
- Suggest Revision
Cyber Security and Incident Response: Develop and implement OT Cyber Security and Incident Response Plans. Our client is seeking a skilled SCADA Engineer to configure, maintain, and support their SCADA and Load Management infrastructure.
ExpandApply NowActive JobUpdated Today - UpvoteDownvoteShare Job
- Suggest Revision
As a Staff Site Reliability Engineer (SRE), you will be playing a pivotal role in ensuring the reliability, scalability, and performance of our cloud-based services. Minimum 12 years experience as a Site Reliability, DevOps, or Software Engineer with proficiency in one or more high-level languages (such as Python, GoLang, Ruby, Java, or JavaScript) required.
$220,000 a yearFull-timeExpandApply NowActive JobUpdated 1 month ago - UpvoteDownvoteShare Job
- Suggest Revision
Respond to incidents coordinated by SRE and Incident Response teams. As a Sr. Site Reliability Engineer II, you are instrumental in helping make our Petabyte scale Kubernetes-centric ProArchive application resilient.
$160,000 a yearFull-timeExpandApply NowActive JobUpdated Today - UpvoteDownvoteShare Job
- Suggest Revision
As a Sr. Site Reliability Engineer, you are instrumental in helping make our Petabyte scale Kubernetes-centric ProArchive application resilient. Site Reliability Engineer III - Kubernetes Administration.
$145,000 a yearFull-timeExpandApply NowActive JobUpdated 3 days ago - UpvoteDownvoteShare Job
- Suggest Revision
Expertise using Infrastructure as code (IaC) and deployment automation with tools such as Terraform, Helm, Gitlab or equivalent. Write and maintain infrastructure as code for core systems (terraform, terraform modules and kubernetes helm charts); build and maintain CI/CD pipelines.
$220,000 a yearFull-timeExpandApply NowActive JobUpdated 1 month ago - UpvoteDownvoteShare Job
- Suggest Revision
You will have the opportunity to learn and contribute to various aspects of site reliability engineering including monitoring, automation, incident response, and infrastructure optimization while making contributions that will help make Apple users experiences better.
ExpandApply NowActive JobUpdated 1 month ago
site incident response jobs Title: site engineer
FEATURED BLOG POSTS
10 Importancies of Setting Realistic Goals
We’ve all heard how important it is to set professional and personal goals. Developing and establishing goals keeps us motivated and moving forward in life. But not all goals are created equal. If you’re chasing goals that are too lofty, you’ll end up disappointed when you cannot reach them. Setting goals that are achievable and measurable is the key to success.
Email Etiquette Principles - Why is it Important
Why is email etiquette important? Let's imagine you're hiring for a new role, and you’ve just received the email below.
10 Reasons HR is Important to an Organization
"Nothing we do is more important than hiring and developing people."
7 Importances of Organizational Culture and How to Build It
The world of work has drastically changed in the past few years. Where a good salary and a nice office might have been enough to attract talent in the past, employees today expect flexibility, growth opportunities, and a healthy work environment. In fact, 77% of applicants say they’d consider a company’s culture before applying for a job.
Collaborative Recruiting: The Key to a Better Talent Acquisition Strategy
Talent acquisition is a multi-stage process where candidates undergo various application steps before getting hired. The unfortunate reality is that it is a labor-intense system, with the hiring manager and recruiter often handling all of the work on their own. Ask any one of them, and you will hear about the overabundance of applications and the demanding task of filtering through them to find the best candidates. The quality of talent suffers under the weight of all that work on one person's hands. It's not easy, but as many companies are starting to realize, there is a better way. The future of talent acquisition lies in collaborative recruiting!
4 Talent Acquisition Trends Going Into 2023
For better or worse, a side effect of the COVID-19 pandemic was a marked shift in talent acquisition practices worldwide. With the struggle to retain talent that began in 2020, companies have had to rethink recruitment strategies. The result has been new talent acquisition trends that are well on their way to becoming commonplace. These are the practices that are going to become even more widespread going into 2023.
Why is Professionalism Important & How to Be Professional
You might have heard the word professionalism thrown around in the workplace, but do you know what it means? And do you know how to maintain professionalism no matter the circumstances?