JOBSEARCHER

Software Site Reliability Engineer II (Remote/Hybrid PST)

Italent DigitalRemoteMay 22nd, 2026
Job DescriptioniTalent Digital is a leading, woman- and minority-owned global technology consulting company. We are seeking two(2) Software Site Reliability Engineer II to join our diverse and dynamic global team. This is a long term, ongoing opportunity to assist our Fortune 500 tech client in the Silicon Valley. Client is in the global financial technology platform that powers prosperity for the people and communities we serve. With approximately 100 million customers worldwide using a broad product portfolio.The roles may be remote or hybrid in the PST zone. The individual selected will be instrumental in helping us continue to deliver excellence to our base of leading global accounts.You will also interact closely with iTalent's Communities of Practice, expand your network, and grow your career. This is a unique chance to meet others who think differently and are passionate about challenging the status quo!Job Title: Software Site Reliability Engineer IIJob OverviewJoin the FinTech Payments Platform as a Software Engineer II. You will be a part of a trusted financial expert empowering financial prosperity for businesses and consumers through a convenient, powerful, AI-native fintech platform providing fast and easy access to funds at the time of need. Assisting in the process of millions of transactions every day across various payment methods.ResponsibilitiesBe the first level of support and handle and investigate incidents, production issues, and alertsIdentify, design and build tools that are focused on tooling work and observability work, ensuring high availability, scalability, and performance of our production systemsEnsure the highest standards for engineering design, implementation, and testingAccurately scope effort, identify risks and clearly communicate trade-offs with team members and other stakeholdersInvestigate production issues and provide valuable insights to the core teamsPursue and resolve complex technical problems and share key learningsStay aware of industry trends and make technology choices and strategic decisionsCollaborate closely with peers, cross-functional teams and business units to define, prioritize, sequence and scope business and functional requirements and drive results forwardRequired qualifications and skills2+ years of related experience with SRE/NOC teamSix Sigma experience (Green or Black preferred)Expert in one of the following: Automation, Monitoring tools, Cloud OperationsSolid AWS experienceSolid and comfortable with backend or full stack coding and scripting: strong experience with Java/J2EE, Go, Python, REST, SOAP, JSONSkilled in software development lifecycle processes. Experience with SCRUM and Agile DevelopmentKnowledge of current trends and best practices in the modern SaaS technology landscapeExperience in leveraging Amazon Web Services for building scalable applicationsHigh adaptability and flexibilityWork well under pressureHave a passion for working on systems that are highly reliable, maintainable, scalable, and secureHigh energy, self-starter with a positive mindsetSkills:Operational Excellence:Proactively identifies and resolves product stability issues, thereby improving quality and availabilityExpertise in designing and implementing advanced CI/CD and automation/resiliency concepts such as Progressive Rollouts and Failure Modes and Effects Analysis (FMEA)Identifies and drives resiliency, cost optimization, and process improvementsManages and performs on-call duties to ensure operational excellence and quick resolution of production incidentsSoftware Fundamentals:Writes and reviews code to eliminate complexity while ensuring security, scalability, performance, testability, resiliency, and maintainabilityExpert at diagnosing and resolving cross capability issues, with a focus on tooling and observabilityEnhances test coverage including unit tests, end-to-end tests, and integration tests to maintain production system robustnessExperience with metrics, monitoring and alerting tools such as Splunk, Wavefront, AppDynamics, Prometheus, and PagerdutyDesign and Architecture:Promotes standard practices for tooling, monitoring, and observabilityDevelops tools that focus on improving system observability, including metrics, logging, and tracingCommunications:Ability to convince people of their design, especially for tooling and observability solutions that ensure system reliability and performanceIs receptive to feedback from peers and acts accordingly, particularly in high-pressure incident resolution scenariosCollaborates with other team members to solve problems more effectively, emphasizing cross-functional collaboration during production incidentsDemonstrated ability to explain complex technical issues to both technical and non-technical audiencesPreferred qualifications and skillsPreferred ExperienceExperience with large-scale payment systemsEducationBachelor's DegreeCompany descriptionAbout iTalent Digital:A woman- and minority-owned digital consulting company, we celebrate individuals and diversity, cultivating a culture where our people can excel and lead balanced lives. Recruitment at iTalent is guided by an unwavering principle: Only hire the best. Because we have the best people, we have the privilege of working with the best clients, doing the best work, and effecting transformative change at work and in our communities.What you get:You get the chance to work with some of the best brands and high-performance teams out there! iTalent offers our W2 consultants’ excellent benefits such as medical, dental, vision, life insurance, paid holidays and PTO, and 401K + matching. We are growing and we want to see you grow!Log onto iTalentdigital.com to learn more about what working at iTalent can mean for you