<Back to Search
Senior Research Engineer, Training Data Infrastructure in Foundation Models
Cupertino, CAApril 2nd, 2026
Our team is dedicated to solving the high-quality training data problem at the scale required to train advanced Foundation Models. We believe that the advanced model performance (including reasoning, coding, and agentic planning) fundamentally depends on a data-centric approach to Machine Learning. Our objective is to engineer a large-scale system that acquires, processes, and curates the data required to advance the state of the art in Artificial Intelligence. We are seeking a Senior Research Engineer who possesses a deep understanding of distributed systems and a strong intuition for Machine Learning. You will join a culture that values engineering craftsmanship, privacy, and rigorous scientific inquiry, utilizing advanced cloud technologies to build the data systems that powers our most capable models.This position operates at the convergence of Software Engineering and Machine Learning Research. Unlike traditional backend roles, this position requires you to design systems where the outcome is the statistical distribution and quality of data itself. You will work alongside Research Scientists to transform theoretical observations into concrete, scalable engineering solutions. Your core focus will be the architecture of our Data Acquisition, Processing, and Repository Management systems for Large Model training. You will lead technical efforts to enable active, quality-driven data curation, including filtering, deduping, synthetic data generation and data mixing, ensuring our models are trained on the highest-quality information available.Research Collaboration: Experience working within or closely with ML research organizations (e.g., as a Research Engineer), with an ability to translate research results into engineering implementations. Domain Knowledge: Familiarity with lifecycle of modern LLM training, end-to-end workflows, and underlying system architecture. Complex Data Types: Experience in processing complex data modalities beyond plain text, such as source code repositories, images, videos, and audios.Education: Bachelor's degree in Computer Science, Electrical Engineering, or Mathematics. Technical Expertise: 4+ years of software engineering experience with a specific focus on Data Infrastructure, Distributed Systems, or AI/ML Engineering. Language Proficiency: Expert fluency in Python, and strong competence in system languages such as C++. Cloud Architecture: Extensive experience architecting solutions on major public cloud platforms (e.g. GCP) to build scalable data systems (e.g. with Apache Beam, GCS) Performance Engineering: Deep experience profiling and optimizing high-throughput data systems. Demonstrated ability to debug distributed bottlenecks (e.g., stragglers, I/O saturation), optimize data formats and provide efficient data storage solutions.
1,286 matching similar jobs near Cupertino, CA
- Manager, AI & Automation
- Applied Sensing u0026 Health Software Engineer, Sensing u0026 Connectivity
- Machine Learning Engineer - Health AIML
- Senior Software Engineer - AI Agentic Product Dev
- AI Cluster Validation Engineer
- LLM & Agentic AI R&D Intern
- Knowledge Engineer / Semantic Expert for AI
- Forward Deployed Engineer Manager
- Sales Engineer
- Software Engineer II - AI/ML, AWS Neuron, LLM Inference, AI/ML, AWS Neuron, Model Inference
- Senior Perception Engineer
- Applied Scientist, Prime Video - Personalization and Discovery Science
- Software Dev Engineer III, IVS Real Time Video
- Sr. Worldwide Specialist - GenAI/ML, Data & AI GTM
- Senior Pre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS
- Applied Scientist II, Amazon Stores Economics and Science (SEAS)
- Pre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS
- Principal Software Engineer, Debug Tools
- Sr. Formal Verification Engineer, Annapurna Labs
- SoC Modeling & Simulation Sr. Manager, Annapurna Labs Machine Learning Accelerators, AWS
- AR/MR Graphics / Display Systems Engineer
- Software Engineer - Traffic (ASE)
- CAD Engineer, Silicon Learning and Static Timing Analysis
- Haptics Software Engineer
- Machine Learning/Generative AI Engineering Manager - Maps Search Query Understanding
- GenAI Research Engineer
- Machine Learning Validation Automation Engineer
- Apple Silicon GPU Driver Engineer - Performance, Graphics, Games, & ML
- Sharing Experiences and Frameworks Engineer
- Software Engineer - Traffic, JVM Frameworks
- Senior Staff ML Architect
- Swift Engineer, Find My
- Research Scientist / Engineer, Foundation Model Evaluation
- AI Solutions Engineer - Innovation Lab
- Senior Applied Scientist, UI Control Models
- Staff Machine Learning Rendering Engineer - Simulation, Special Projects
- Software Engineering Manager, ASE Storage Infrastructure
- Machine Learning Engineer - Crowdsourced Sensing (Sensing u0026 Connectivity)
- Senior Software Engineer - AI, Search u0026 Knowledge Platform - Traffic Infrastructure
- On Device ML Engineer, PyTorch Interoperability, Graphics, Games u0026 ML