Data Engineer- GC/Citizens -Fulltime - US Remote
๐ผ Position OverviewWe are seeking Data Engineers (all levels) to design, build, and manage robust data pipelines supporting analytics use cases within a Datalake or Lakehouse architecture.You will play a critical role in developing scalable solutions using Apache Airflow or Dagster to efficiently ingest, transform, and manage large volumes of data. As an early team member, youโll take ownership of backend components, establish engineering best practices, and deliver innovative solutions that create outstanding value for users.๐ฏ Key Responsibilities๐น Design & Implement Pipelines โ Build scalable and reliable pipelines using Apache Airflow or Dagster. ๐น Collaborate โ Work closely with Platform & Product teams on ingestion, transformation, and storage strategies. ๐น Data Modeling โ Develop and optimize schemas for analytics and reporting. ๐น Quality Assurance โ Ensure integrity, consistency, and governance across Data Warehouse, Data Lake, and Lakehouse. ๐น Optimize Workflows โ Troubleshoot and improve performance bottlenecks. ๐น Best Practices โ Contribute to standards, reviews, and engineering processes. ๐น Innovation โ Identify opportunities to improve scalability, reliability, and efficiency.โ
Success in This Role Requires๐ 3+ years in Data Engineering (focus on pipelines) ๐ Strong Python skills with Apache Airflow / Dagster ๐ Hands-on experience in Data Warehouse, Data Lake, Lakehouse ๐ Deep knowledge of ETL/ELT processes & orchestration ๐ Experience with AWS / Azure / GCP cloud data platforms ๐ Strong data modeling & schema design skills ๐ Excellent problem-solving & communication abilities ๐ US Citizen or Green Card holder๐ Ways to Stand Out๐ Experience with batch + streaming pipelines ๐ Advanced database schema design & scaling expertise ๐ Familiarity with Infrastructure as Code (Terraform, Pulumi, AWS CDK) ๐ Proven ability to align data engineering with business strategy