AI/ML Engineer
Senior AI/ML Engineer — Customer Data PlatformCDP MISSION: Our mission is to be the authoritative source of truth for customer data — delivering timely, high-quality data at scale to power the contextual experiences that drive the growth of this company. Every customer profile must be accurate, trusted, and available when it matters, across every touchpoint, for the entire US adult population.Job OverviewWe are seeking a Senior AI/ML Engineer to lead the design and development of the advanced AI systems that make our Customer Data Platform (CDP) the authoritative source of truth for customer data — covering the entire US adult population.This role owns the intelligence layer of CDP: production-grade identity resolution at massive scale, and LLM-powered interfaces that make trusted customer data accessible to every stakeholder in the organization. You will architect systems that resolve billions of customer records into accurate, unified profiles — and build the natural language interfaces that let business users query and understand that data without writing SQL.You will drive architecture decisions, define best practices, and lead the development of systems where accuracy, trust, and timeliness are non-negotiable.Job Responsibilities — Identity ResolutionDesign and lead end-to-end identity resolution architecture, combining probabilistic models, ML, and embedding-based techniques to build the authoritative customer identity graphBuild and optimize large-scale entity matching systems across billions of records and multiple data domains — ensuring every US adult is accurately represented in CDPArchitect advanced candidate generation and blocking strategies (LSH, phonetic encoding, semantic similarity) that balance precision with computational feasibility at population scaleDesign high-precision matching pipelines using ensemble approaches (rules + ML + LLM-based validation) to maximize accuracy of golden customer profilesDevelop scalable clustering and graph-based approaches for unified customer identity resolution with clear confidence scoring and auditabilityLead implementation of embedding pipelines and similarity search systems using transformer models for semantic-level identity matchingJob Responsibilities — AI/LLMArchitect and build LLM-powered systems for entity resolution, including zero-shot and few-shot classification workflows that handle edge cases traditional models missDesign and implement RAG-based architectures for enriching and contextualizing customer data from unstructured sourcesLead development of NLQ-to-SQL platforms, enabling business users to query CDP — the authoritative source of truth — using natural languageTranslate ambiguous business questions into structured queries with schema awareness, semantic layers, and guardrails that protect data integrityDefine best practices for prompt engineering, evaluation, and LLM observability — ensuring AI outputs meet the trust standards CDP demandsDesign and optimize vector search architectures (Pinecone, Qdrant, pgvector) for large-scale retrieval across customer dataEvaluate and integrate emerging frameworks such as LangChain, LangGraph, and agentic workflows where they strengthen CDP capabilitiesEducation and Work ExperienceBachelor's or Master's degree in Computer Science, Data Science, or related field6+ years of experience in ML/AI engineeringProven experience building production-grade entity resolution or identity graph systems at scaleExperience designing LLM-based applications in enterprise environments with high accuracy and trust requirementsTechnical SkillsAdvanced programming: PythonDeep expertise in ML algorithms for similarity, classification, and clustering — particularly in identity resolution contextsStrong experience with transformer models, embeddings, and semantic search at population scaleHands-on experience with LLM APIs and orchestration frameworksStrong SQL and experience with distributed data processing (Spark, Dask)Experience with vector databases and ANN search systems (FAISS, Pinecone, etc.)Expertise in ML lifecycle management (MLflow or equivalent)Understanding of data governance, privacy, and security requirements for customer identity dataKnowledge, Skills, and AbilitiesStrong system design and architectural thinking for AI/ML systems at population scaleAbility to balance precision, recall, and scalability in identity resolution systems — understanding that accuracy directly impacts CDP's authority as the source of truthStrong understanding of data semantics and customer domain modeling across diverse data sourcesLeadership in driving AI engineering best practices, standards, and quality benchmarksAbility to collaborate across data engineering, product, security, and business teams to deliver trusted customer intelligenceLicenses and CertificationsAt least 18 years of ageLegally authorized to work in the United StatesTravelTravel Required: NoLocationAtlanta/Frisco