JOBSEARCHER

GCP Cloud Site Reliability Engineer

Job Title: GCP Cloud Site Reliability Engineer (SRE)Location: Chicago, IL (Onsite Interview Mandatory)Duration: Long Term ContractInterview Mode: In-Person / OnsitePreference: Local Candidates OnlyJob DescriptionWe are seeking an experienced GCP Cloud Site Reliability Engineer (SRE) to join a high-performing infrastructure and cloud engineering team in Chicago, IL. The ideal candidate will have strong recent hands-on experience with Google Cloud Platform (GCP) and solid Python scripting/automation expertise. This role requires candidates who are comfortable attending an onsite interview and working in a hybrid/on-site environment.The SRE will be responsible for maintaining reliability, scalability, automation, monitoring, and operational excellence across cloud-native platforms and enterprise applications hosted on GCP.Required SkillsStrong recent experience with Google Cloud Platform (GCP)Hands-on experience in Site Reliability Engineering (SRE) or Production Support environmentsStrong programming/scripting experience using PythonExperience with:GCP Compute EngineKubernetes / GKECloud Monitoring & LoggingIAM & SecurityCloud NetworkingTerraform or Infrastructure as CodeExperience with CI/CD pipelines and deployment automationStrong understanding of:Incident managementRoot cause analysisMonitoring and alertingReliability engineering principlesHigh availability and scalabilityExperience with Linux/Unix administrationKnowledge of DevOps and cloud operational best practicesResponsibilitiesDesign, build, and maintain reliable and scalable cloud infrastructure on GCPAutomate operational and deployment tasks using Python and scripting toolsMonitor system health, performance, and availabilityTroubleshoot production issues and perform root cause analysisImprove platform reliability, observability, and operational efficiencyCollaborate with development, infrastructure, and security teamsSupport CI/CD and infrastructure automation initiativesImplement proactive monitoring and alerting solutionsParticipate in on-call rotation and incident response activitiesPreferred QualificationsExperience with Docker and Kubernetes orchestrationExposure to Terraform, Ansible, or similar automation toolsKnowledge of cloud security and compliance standardsExperience working in enterprise production environmentsGoogle Cloud certifications are a plusSoft SkillsExcellent communication and collaboration skillsStrong analytical and problem-solving abilitiesAbility to work in a fast-paced environmentSelf-motivated with strong ownership mindset