JOBSEARCHER

AIOps Engineer - INTL Mexico

Build AI agentic flows that automate incident response and operational tasks.Use LLMs to analyze alerts, logs, and SOPs, then decide the correct actions without human involvement.Replace repetitive, manual incident work with automation that follows existing processes.Improve system reliability through better alerting, observability, and automated remediation.Integrate AI-driven automation with monitoring, logging, and cloud services.Partner with SRE, DevOps, and platform teams to safely deploy and scale automation.Continuously improve automation based on real production signals and outcomes.$24-$28/hourWe are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.Required Skills & ExperienceStrong understanding of LLMs and how they can be used to make decisions, trigger actions, and automate workflows.Proven ability to turn manual SOPs and runbooks into automation, not just follow them.Strong experience with automation using Python (Go is also acceptable).Experience working in incident response, reliability, SRE, DevOps, or platform operations environments.Comfort working in cloud-native systems, especially GCP.Experience with production observability - knowing what signals matter, what's breaking, and why.Required Technical ExperienceGoogle Cloud Platform (GCP)Automation: Python (Go is acceptable)Observability:Google Managed Prometheus (GMP)Grafana EnterpriseLog configuration and analysisGoogle Cloud Services:Kubernetes (GKE)Cloud LoggingBigQueryPub/SubGoogle Cloud StorageGeneral understanding of Google networkingDeveloper Tools:GitHub CopilotGitHub Copilot for workflows