JOBSEARCHER

DevOps SRE

Key ResponsibilitiesEnhance platform reliability, performance, and observabilityBuild dashboards and alerts using APM tools (Splunk, ELK, Grafana, Prometheus, GCL)Proactively identify performance bottlenecks and system risksSupport incident management and root cause analysisCollaborate with Engineering, Security, Networking, and Infrastructure teamsAutomate operational tasks using Shell scripting and DevOps toolsSupport CI/CD pipelines and release processesRequired Skills8+ years of Software Engineering experience4+ years in Site Reliability EngineeringStrong experience with APM / monitoring tools (Splunk, ELK, Grafana, Prometheus)Experience with distributed systems, relational & NoSQL databasesKnowledge of Redis, Memcache, MQ, KafkaHands‑on Shell scripting, Ansible (YAML)Experience with CI/CD tools (Git, Jenkins, UCD or similar)Experience with Kubernetes / OpenShift, PCF, AWS or AzureTech stack: Java/J2EE, Spring Boot, Python, Kafka, Oracle, MongoDB