JOBSEARCHER

Digital Site Reliability Engineer- REMOTE

Ntt DataMemphis, TNMay 10th, 2026
Job DescriptionWe are seeking a highly skilled and experienced Reliability Engineer to join our team. The ideal candidate must have a strong background in technology, with specific expertise in Kubernetes, Gitlab, Dynatrace, GraphQL, Node, React, with a good understanding of CI/CD pipelines. The candidate must be comfortable with ambiguity, learning new things and have perseverance similar to "if at first I don't succeed, try and try again".ResponsibilitiesCollaborate with cross-functional teams to develop and maintain release architectures and monitor frameworks.Provide system design consulting and critical support to the development team prior to program launch.Identify and solve sophisticated performance and scaling issues, working with engineers to avoid bottlenecks and meet traffic demands.Mentor and guide team members, helping them grow in their roles.Identify and implement automation and monitoring tools to improve the efficiency and effectiveness of SRE processes.Take ownership of any critical incidents and work towards timely resolution and prevention of future occurrences.RequirementsFive (5) to Seven (7) years of professional experience in technology or a related field.Two (2) years of experience with Kubernetes/EKSTwo (2) years of experience with CI/CD pipelines.Two (2) years of experience with a sophisticated observability platform including RUM and APM.Good to Have RequirementsFamiliarity with reading and understanding JavaScript (Node.JS).Capabilities utilizing Dynatrace APM and RUM (other APM or RUM may be applicable) - Dynatrace Associate Certification is a plus.Intermediate to Advanced skills in BASH shell scripting, Python and DockerIntermediate skills with on-prem Gitlab CI pipeline creation, troubleshooting, and configuration of Gitlab CI.Preferred QualificationsSolve sophisticated performance and scaling issues, working with engineers to ensure that we avoid bottlenecks and meet traffic demands through organic growth and marketing events.Strong problem-solving skills and the ability to work in a fast-paced environment.Communicate effectively with stakeholders, including management, to provide updates, recommendations, and solutions for any SRE-related issues.Excellent communication and collaboration skills.Experience with Kubernetes/EKS and pod life cycle management including readiness and liveness checks.Experience with building and supporting CI/CD pipelines and production releases.Working knowledge of complex CDN cached website architectureBasic QualificationsMinimum 5 years Source Code Management (SCM) and DevOps-Containerization-EKS. Minimum 1 year Platform Administration-Monitoring-Dynatrace.NTT DATA is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status. For Pay Transparency information, please click here.This contact information is for accommodation requests only and cannot be used to inquire about the status of applications.J-18808-Ljbffr