Site Reliability Engineer (Level 2)
The Site Reliability Engineer (Level 2) is responsible for operating and enhancing the performance, availability, and reliability of cloud and on-premises infrastructure. This individual consults on more complex observability scenarios, streamlining server and batch operations, and contributing to the efficient management of data center resources. Consults with cross-functional teams to identify opportunities for process improvements, implement best practices, and support critical business operations.Essential Tasks/Major Duties:Develop, implement, and maintain observability tools to monitor cloud and on-premises systems.Create dashboards, alerts, and reports to track system health, performance, and availability.Proactively leverage observability tools and identify opportunities.Analyze metrics and logs to identify trends, prevent potential issues, and optimize system performance.Collaborate with FinOps teams to monitor resource utilization and ensure cost-effective operations across cloud environments.Support the lifecycle of cloud and on-premises servers, including provisioning, patching, configuration, and decommissioning.Troubleshoot and resolve server-related issues, ensuring minimal downtime. Implement and enforce server security policies and compliance requirements.Schedule, monitor, and manage batch processes to ensure timely execution of critical tasks.Identify and resolve batch failures or delays, coordinating with relevant teams to ensure smooth operations.Optimize batch jobs for improved performance and resource utilization.Manage on-site and remote data center operations, ensuring proper functioning of hardware, power, cooling, and network infrastructure.Coordinate with vendors and service providers for hardware maintenance, replacements, and upgrades.Maintain accurate inventory of data center assets and ensure compliance with organizational standards.Participate in on-call rotations to address system incidents and outages promptly.Conduct root cause analysis and implement solutions to prevent recurrence of issues. Document and communicate incident resolution processes to relevant stakeholders.Work closely with cross-functional teams, including DevOps, Networking, and Application Development, to implement and maintain system integrations.Maintain and create comprehensive documentation for configurations, processes, and incident resolutions.Provide training and support to team members and other departments.Knowledge, Skills & Abilities:Bachelor’s degree in computer science, Information Technology, or a related field, or equivalent experience.3 years of experience working with monitoring and observability tools (e.g., Datadog, PagerDuty).Certified Datadog Fundamentals or equivalent experience required.Certified PagerDuty Administrator or equivalent experience required.3 years of experience in cloud operations or server management roles.Certified AWS SysOps Administrator or equivalent experience required.3 years of progressive server administration experience (Windows, Linux).3 years of experience in designing, implementing, and managing IT workload automation solutions to optimize scheduling, orchestration, and execution of enterprise workflows across on-prem and cloud environments.Experience leveraging artificial intelligence to drive innovation and solve complex problems. Demonstrated ability to utilize AI-driven solutions that optimize processes, enhance decision-making, or create transformative business outcomes.3 years working with cloud platforms (AWS, Azure, OCI).Certified AWS Cloud Practitioner or equivalent experience required.Strong experience with data center infrastructure and knowledge of best practices.Proficiency in scripting and automation tools (Python, Bash, PowerShell).Strong understanding of networking and identity management in cloud environments.Working knowledge of security best practices and compliance standards.Working knowledge of agile methodologies.Excellent troubleshooting, problem-solving, and communication skills.Salary/Rate: $50-$55/HR (depends on experience level). This is a contract position with candidates expected to work 40 hours/ week.About The CompanyPeterson Technology Partners (PTP) is an Equal Opportunity Employer committed to creating a transparent, inclusive, and human-centered hiring experience.For more than 28 years, PTP has operated as one of the top IT staffing and recruiting firms in the USA—built on trust, long-term partnerships, and technical excellence.Based in the Chicago suburb of Park Ridge, IL, our team of more than 500 employees and consultants is dedicated to:Helping every client make the best hiring decisions possibleMatching professionals with the right IT jobs and career opportunitiesAs part of that commitment, we believe in providing clear information about how our hiring technologies work and how your data is used. The following section outlines our AI-assisted interview process and your rights as a candidate.AI-Assisted Interview Experience (Pete & Gabi – Rebecca)To provide a consistent, fair, and flexible experience for all candidates, we use AI-assisted tools to support parts of the interview process. This includes our proprietary AI platform Pete & Gabi, which includes AI recruiter Rebecca.These AI hiring tools help us:Conduct recorded video interviewsTranscribe interviewsSummarize candidate responsesGenerate job-related insightsStreamline communication and schedulingPlease note that:The AI does NOT make hiring decisions; all decisions are made by our human recruiters, hiring managers, or client partners.The AI does not evaluate facial expressions, emotions, or physical traits; it is used only to support fairness, consistency, and efficiency.If you prefer a non-AI interview format, we will gladly provide an alternative.Technical or Case Interviews (Role-Dependent):When applying for certain tech jobs, you may participate in:A technical interviewA coding challengeA case studyA client-specific assessmentWe will always explain what to expect in advance so you can prepare with confidence.Human Review & Selection:Every candidate's profile—including interviews, conversations, and assessments—is reviewed by experienced recruiters and hiring leaders.AI insights may assist with organization and evaluation, but final decisions are always human-driven.Your Rights as a Candidate:At PTP, every candidate has the right to:Request a non-AI interview pathAsk how your data is being usedRequest access to transcripts or interview recordingsRequest deletion of your AI-recorded interviewReceive clear, timely communicationOur goal is to ensure you feel respected, informed, and supported throughout your experience.Our Commitment:For more than 28 years, PTP has focused on putting people first—candidates, consultants, employees, and clients.We're committed to a hiring process that is:TransparentCompliantEquitablePowered by innovative technology that enhances—not replaces—human judgmentWelcome to the future of hiring at Peterson Technology Partners.We're excited to learn more about you.Equal Employment Opportunity:Peterson Technology Partners is an Equal Opportunity Employer. All qualified applicants will receive consideration without regard to race, color, religion, national origin, gender identity, sexual orientation, disability, veteran status, or any other protected characteristic.