JOBSEARCHER

Data Engineer || Python + SQL + AWS/Databricks (Only W2)

ARCHIVED

We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.

A sustainability-focused AI research lab is adding a Data Engineer to build the data systems behind its forecasting, analytics, and LLM-driven research. You will turn fragmented sources into reliable, scalable datasets that power predictive modeling and scenario analysis.๐—–๐—ข๐— ๐—ฃ๐—”๐—ก๐—ฌPave Talent is hiring on behalf of our client, a well-funded research institute working at the intersection of AI, energy systems, and decarbonization strategy. The team blends machine learning, energy modeling, and real-world data to inform decisions on electrification, EV adoption, grid interaction, and circularity. Based in the U.S. with a global scope.๐—ง๐—›๐—˜ ๐—ข๐—ฃ๐—ฃ๐—ข๐—ฅ๐—ง๐—จ๐—ก๐—œ๐—ง๐—ฌYou will define how data flows across the team and partner directly with ML researchers, technical leads, and research scientists. This is a hands-on build role in a fast-moving research environment where rapid experimentation matters as much as clean engineering.- Design, build, and maintain scalable pipelines for structured, semi-structured, and unstructured data- Develop data models and datasets for predictive modeling, scenario analysis, and LLM-based workflows- Improve data tooling and automation to speed up prototyping and research iteration- Integrate third-party APIs, external datasets, and domain-specific global data sources- Set standards for data quality, lineage, governance, and reproducibility- Support exploratory data analysis to validate assumptions, find data gaps, and improve model inputs- Partner with internal and external stakeholders on secure data access and governance๐—ค๐—จ๐—”๐—Ÿ๐—œ๐—™๐—œ๐—–๐—”๐—ง๐—œ๐—ข๐—ก๐—ฆ๐—ฅ๐—ฒ๐—พ๐˜‚๐—ถ๐—ฟ๐—ฒ๐—ฑ:- Bachelor's degree in a quantitative field (engineering, computer science, data science, or related)- 3 to 5 years in data engineering or software engineering with a strong data focus- Strong proficiency in Python, SQL, Unix tooling, and Git-based workflows- Proven track record building pipelines across heterogeneous data sources- Experience with cloud services, preferably AWS and Databricks- Experience integrating external APIs and third-party datasets- Experience with enterprise big data, ETL frameworks, and data warehousing concepts- A background in analytics, experimentation, or statistical analysis๐—•๐—ผ๐—ป๐˜‚๐˜€ ๐—ฃ๐—ผ๐—ถ๐—ป๐˜๐˜€:- Experience with automotive, manufacturing, mobility, or energy systems data- Experience with AI/ML model development, including LLM-driven or generative/agentic AI workflows๐—–๐—ข๐— ๐—ฃ๐—˜๐—ก๐—ฆ๐—”๐—ง๐—œ๐—ข๐—ก ๐—”๐—ก๐—— ๐—•๐—˜๐—ก๐—˜๐—™๐—œ๐—ง๐—ฆ๐—ฅ๐—ฎ๐˜๐—ฒ: $70 to $95 per hour, depending on experience๐—Ÿ๐—ผ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป: On-site in Los Altos, CA. Must be local or able to commute.๐—ช๐—ผ๐—ฟ๐—ธ ๐—ฎ๐˜‚๐˜๐—ต๐—ผ๐—ฟ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป: Must be authorized to work in the U.S. No visa sponsorship or corp-to-corp.Interested? Apply via LinkedIn and we'll be in touch. Confidential search; your application is fully private.๐—ฃ๐—ฎ๐˜ƒ๐—ฒ ๐—ง๐—ฎ๐—น๐—ฒ๐—ป๐˜ | ๐—›๐—ถ๐—ฟ๐—ถ๐—ป๐—ด ๐—ฅ๐—ฒ๐—ถ๐—บ๐—ฎ๐—ด๐—ถ๐—ป๐—ฒ๐—ฑ