JOBSEARCHER

Site Reliability Engineer (SRE) - CA

Client: Financial Services Team: TBAJob Title: Software Engineer 4 / Site Reliability Engineer (SRE)Location: Concord, CA - Hybrid (3 days onsite; Mon & Tues preferred)Contract Length: 12 months (possible extension or conversion)Pay Rate: $79 - $85Top Requirements:5+ years of experience with observability and monitoring tools (Grafana, Splunk, ThousandEyes, AppDynamics)Experience with Kubernetes/OpenShift (OCP) and containerized environmentsStrong understanding of databases (Postgres, MySQL) and system monitoring/analysisPlusses:Experience with object storage (S3, NAS)Ability to analyze and monitor network traffic end-to-endExperience building monitoring strategies and alerting frameworksExposure to Skan.AI or similar third-party platformsExperience in enterprise-grade environments with governance and reliability standardsJob Summary:In this contingent resource assignment, you may: Consult on complex initiatives with broad impact and large-scale planning for Software Engineering. Review and analyze complex multi-faceted, larger scale or longer-term Software Engineering challenges that require in-depth evaluation of multiple factors including intangibles or unprecedented factors. Contribute to the resolution of complex and multi-faceted situations requiring solid understanding of the function, policies, procedures, and compliance requirements that meet deliverables. Strategically collaborate and consult with client personnel.Day-to-Day Responsibilities:Design and implement end-to-end observability and monitoring strategies for enterprise systemsBuild dashboards, alerts, and monitoring solutions using tools like Grafana, Splunk, and AppDynamicsMonitor and analyze system performance, latency, and data flow across platformsIdentify bottlenecks, thresholds, and performance issues across distributed systemsWork with Kubernetes/OpenShift environments to monitor containerized applicationsAnalyze network traffic and collaborate with networking teams to improve visibilityMonitor and support databases (Postgres, MySQL) and storage systems (S3, NAS)Integrate third-party systems (e.g., Skan.AI) into enterprise monitoring frameworksEnsure reliability, availability, and performance of production systemsCollaborate with global teams (including India) to troubleshoot and resolve issuesRecommend and implement improvements to enhance system resilience and operational efficiencyEEO EmployerApex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at [email protected] or 844-463-6178.Everforth Apex is a world-class IT services company that serves thousands of clients across the globe. When you join Everforth Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Everforth Apex uses a virtual recruiter as part of the application process. Click here for more details.