AI Data Engineer
We have partnered with a leading technology research organization to hire an AI Data Engineer. In this role, you will build scalable data pipelines, partner closely with Data Scientists and ML Engineers, and ensure the organization's AI/ML models are fueled by high-quality, well-structured data. This is a fully remote opportunity to contribute to impactful AI initiatives that support scientific innovation, clinical solutions, and operational excellence.About the RoleAs an AI Data Engineer, you will support data science model validation, analytics workloads, and machine learning operations by building high-quality feature tables, analytical datasets, and automated workflows. You'll collaborate with senior data staff, product owners, and AI/ML scientists to deliver reliable data assets that enhance model performance and accelerate R&D innovation.You will work across core data streams, including discovery, imaging, clinical, and operational, and contribute to the pipelines that power next-generation AI products in veterinary and animal health.Top Required SkillsSQL (advanced)PythonRNice-to-Have Skillsdbt CoreDatabricksData analysis experienceWhat You'll DoBuild scalable, reliable, distributed data pipelines to support machine learning operations and analytics workloads.Partner with data scientists, ML engineers, analysts, and data product owners to understand requirements and deliver high-quality solutions.Work with modern cloud and ML stacks, including Databricks, Snowflake, AWS, and Azure.Use Databricks (pipelines, workflows, asset bundles) to streamline engineering processes.Apply dbt Core for transformations, documentation, testing, and semantic consistency.Maintain code quality using SQL/YAML linters (SQLFluff) and enforce standards through GitHub Actions CI/CD.Develop solutions for data quality issues such as missing, duplicate, and inconsistent data.Contribute to data warehouse, data lake, data lakehouse, and data mesh architectural patterns.Build pipelines in Python to integrate diverse data types: structured tables, text documents, images, and more.Implement CI/CD systems and IaC tools like Terraform or AWS CloudFormation.Support data systems across the full lifecycle: exploration, production, monitoring, disaster recovery, and optimization.Stay current on advanced data engineering practices, including emerging technologies like Generative AI.What You BringYou have a relevant technical degree and at least four (4) years of Data Engineering experience.You are experienced with:Cloud platforms (preferably AWS)Git and Git-based workflowsdbt Core and modern data modelingSQL and NoSQL databasesCloud object storage (e.g., S3)Building, testing, and maintaining fault-tolerant data pipelinesUnderstanding data architecture concepts: warehouse, lake, lakehouse, meshYou're also eager to deepen your knowledge of AI/ML techniques, and it's a plus if you have:Certifications in data engineering or AI/MLLeveling Guide (Intermediate)Build metadata and schemas based on logical modelsWrite scripts for physical data layout and load test dataDesign and validate schemasUse ER modeling tools for intermediate tasksAdhere to data governance, naming conventions, testing principlesResolve moderately complex data problemsProvide SQL and Python scripts for tuning and validationContribute independently to team projects and semantic layer enhancementsSuggest improvements to standards and processesTake new perspectives on solving moderately complex problemsWhy This Role MattersThe performance of AI/ML modelsThe accuracy, reliability, and timeliness of analyticsThe innovation of new data streams from R&D pipelinesThe quality and discoverability of curated datasetsHow the organization advances clinical AI technologiesJoin UsIf you are an analytical, collaborative, and forward-thinking AI Data Engineer looking for a remote opportunity that combines modern data engineering with applied machine learning, we encourage you to apply. Your expertise will help shape the next generation of AI-driven products and scientific innovation.Seniority levelAssociateEmployment typeContractJob functionInformation Technology#J-18808-Ljbffr