System Engineer
Role: System EngineerLocation: Irving, Texas(Hybrid)Responsibilities:Lead the resolution of complex, high-impact production issues, performing deep root cause analysis and implementing long-term fixes.Provide technical consultation and guidance to engineering teams on system design, architecture improvements, and operational readiness.Evaluate and implement system enhancements, platform optimizations, and design alternatives to improve performance and stability.Collaborate with cross-functional teams to ensure operational excellence, system resilience, and service reliability.Leverage AI-driven observability tools for anomaly detection, intelligent alerting, and noise reduction.Utilize automation frameworks and AI-assisted troubleshooting tools to streamline incident management and reduce MTTR.Incorporate predictive analytics and AIOps insights into capacity planning and proactive issue prevention.Ensure adherence to enterprise standards, governance frameworks, and risk/compliance requirements.Contribute to documentation, training materials, and knowledge sharing across teams.Required Skills:5+ years of experience in Systems Engineering, Site Reliability Engineering (SRE), or Production Support roles.Strong experience handling L2/L3 production support, including critical incident management and root cause analysis.Hands-on experience with:Databases: SQL Server, Oracle, MySQLOperating Systems: Unix / LinuxMonitoring Tools: AppDynamics, ITRS Geneos (or similar)Schedulers: Autosys or equivalentExperience with IT infrastructure controls, operational standards, and governance frameworks.Exposure to AI-driven monitoring / AIOps platforms in production environments.Familiarity with automation and scripting (e.g., Python, Shell).Strong analytical, problem-solving, and decision-making skills.Excellent communication and stakeholder management abilities.Preferred QualificationsExperience using Generative AI / LLM-based tools for diagnostics, log analysis, or documentation.Knowledge of test-driven development (TDD) and Agile practices.Experience designing or supporting large-scale distributed systems.Exposure to financial systems or accounting platforms.Industry certifications (e.g., ITIL, Cloud, or relevant technologies).RegardsPraveen Kumar Talent Acquisition Group – Strategic Recruitment Managerpraveen.r@themesoft.com| Themesoft Inc