JOBSEARCHER

Data Engineer || Python + SQL + AWS/Databricks (Only W2)

A sustainability-focused AI research lab is adding a Data Engineer to build the data systems behind its forecasting, analytics, and LLM-driven research. You will turn fragmented sources into reliable, scalable datasets that power predictive modeling and scenario analysis.๐—–๐—ข๐— ๐—ฃ๐—”๐—ก๐—ฌPave Talent is hiring on behalf of our client, a well-funded research institute working at the intersection of AI, energy systems, and decarbonization strategy. The team blends machine learning, energy modeling, and real-world data to inform decisions on electrification, EV adoption, grid interaction, and circularity. Based in the U.S. with a global scope.๐—ง๐—›๐—˜ ๐—ข๐—ฃ๐—ฃ๐—ข๐—ฅ๐—ง๐—จ๐—ก๐—œ๐—ง๐—ฌYou will define how data flows across the team and partner directly with ML researchers, technical leads, and research scientists. This is a hands-on build role in a fast-moving research environment where rapid experimentation matters as much as clean engineering.- Design, build, and maintain scalable pipelines for structured, semi-structured, and unstructured data- Develop data models and datasets for predictive modeling, scenario analysis, and LLM-based workflows- Improve data tooling and automation to speed up prototyping and research iteration- Integrate third-party APIs, external datasets, and domain-specific global data sources- Set standards for data quality, lineage, governance, and reproducibility- Support exploratory data analysis to validate assumptions, find data gaps, and improve model inputs- Partner with internal and external stakeholders on secure data access and governance๐—ค๐—จ๐—”๐—Ÿ๐—œ๐—™๐—œ๐—–๐—”๐—ง๐—œ๐—ข๐—ก๐—ฆ๐—ฅ๐—ฒ๐—พ๐˜‚๐—ถ๐—ฟ๐—ฒ๐—ฑ:- Bachelor's degree in a quantitative field (engineering, computer science, data science, or related)- 3 to 5 years in data engineering or software engineering with a strong data focus- Strong proficiency in Python, SQL, Unix tooling, and Git-based workflows- Proven track record building pipelines across heterogeneous data sources- Experience with cloud services, preferably AWS and Databricks- Experience integrating external APIs and third-party datasets- Experience with enterprise big data, ETL frameworks, and data warehousing concepts- A background in analytics, experimentation, or statistical analysis๐—•๐—ผ๐—ป๐˜‚๐˜€ ๐—ฃ๐—ผ๐—ถ๐—ป๐˜๐˜€:- Experience with automotive, manufacturing, mobility, or energy systems data- Experience with AI/ML model development, including LLM-driven or generative/agentic AI workflows๐—–๐—ข๐— ๐—ฃ๐—˜๐—ก๐—ฆ๐—”๐—ง๐—œ๐—ข๐—ก ๐—”๐—ก๐—— ๐—•๐—˜๐—ก๐—˜๐—™๐—œ๐—ง๐—ฆ๐—ฅ๐—ฎ๐˜๐—ฒ: $70 to $95 per hour, depending on experience๐—Ÿ๐—ผ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป: On-site in Los Altos, CA. Must be local or able to commute.๐—ช๐—ผ๐—ฟ๐—ธ ๐—ฎ๐˜‚๐˜๐—ต๐—ผ๐—ฟ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป: Must be authorized to work in the U.S. No visa sponsorship or corp-to-corp.Interested? Apply via LinkedIn and we'll be in touch. Confidential search; your application is fully private.๐—ฃ๐—ฎ๐˜ƒ๐—ฒ ๐—ง๐—ฎ๐—น๐—ฒ๐—ป๐˜ | ๐—›๐—ถ๐—ฟ๐—ถ๐—ป๐—ด ๐—ฅ๐—ฒ๐—ถ๐—บ๐—ฎ๐—ด๐—ถ๐—ป๐—ฒ๐—ฑ