Vice President AI/ML
Job Title: Vice President – AI/ML Engineer (Individual Contributor)Location: Jersey City, Dallas, Menlo Park, Seattle (Onsite)Employment Type: Full-TimeAbout the CompanyOur client is a leading global investment banking, securities, and investment management firm serving a diverse client base including corporations, financial institutions, governments, and high-net-worth individuals. Headquartered in New York, the firm has a strong global presence across major financial centers. The organization is known for its strong culture built on integrity, excellence, innovation, and teamwork, with a focus on delivering best-in-class solutions to its clients.Business Unit OverviewEnterprise Technology Operations (ETO) is part of the firm’s Core Engineering division, focused on delivering scalable production management services. The unit emphasizes operational excellence, automation, and risk reduction through advanced engineering and data-driven solutions.Within ETO, the Production Runtime Experience (PRX) team leverages software engineering, machine learning, and automation to improve monitoring, alerting, and operational workflows across large-scale systems.Team OverviewThe Machine Learning and AI team within PRX focuses on applying advanced Machine Learning and Generative AI (GenAI) to optimize large-scale infrastructure and application environments.By combining statistical modeling, anomaly detection, predictive analytics, and LLM-driven agentic systems, the team delivers scalable, reliable, and cost-efficient production management solutions.Role OverviewThis is a hands-on individual contributor role focused on designing and implementing GenAI-powered agentic solutions to improve production operations.You will work on building intelligent systems capable of diagnosing issues, reasoning through complex scenarios, and executing automated actions in large-scale production environments.Key ResponsibilitiesDesign and develop Agentic AI systems using LLMs, retrieval, and secure execution frameworksBuild and productionize LLM-based solutions, including RAG pipelines and evaluation frameworksIntegrate AI solutions with observability, incident management, and deployment systemsDevelop systems for automated diagnostics, remediation, and workflow optimizationCollaborate with engineering and operations teams to translate business needs into AI solutionsImplement governance, safety, and reliability mechanisms for AI systemsOptimize performance, scalability, and cost efficiency of AI applicationsDrive engineering best practices, design reviews, and continuous improvementQualificationsBachelor’s degree in Computer Science, Engineering, Applied Mathematics, or a related field (Master’s/PhD preferred)7+ years of experience in software engineering and/or machine learningRequired SkillsStrong programming experience in Python, Java, Go, or C/C++ (Python preferred)3+ years building and deploying production-grade machine learning systemsHands-on experience with Large Language Models (LLMs) including prompt engineering, fine-tuning, and RAG pipelinesExperience building agent-based AI systems and tool integrationsSolid understanding of machine learning algorithms, statistics, and data structuresExperience with REST APIs, distributed systems, and microservices architectureStrong problem-solving skills and ability to work in high-impact environmentsPreferred SkillsExperience with cloud platforms (AWS preferred)Familiarity with containerization and orchestration (Docker, Kubernetes)Experience with MLOps, model deployment, and monitoring frameworksExposure to financial services or large enterprise environmentsKey CompetenciesStrong analytical and problem-solving mindsetAbility to communicate complex concepts clearlyCollaborative and team-oriented approachHigh ownership and accountability