<Back to Search
Jr Data Engineer - Onsite - W2
Houston, TXApril 1st, 2026
Job Title: Data Engineer (Python / Spark / AWS) on W2 Location: Richmond, VA / McLean, VA / Dallas, TX Onsite (LOCALS only) Duration: Long Term Contract Interview Process:: Internal Screening Round followed by an In-Person (Face-to-Face) at VA or Dallas Tx Job Summary We are seeking talented and experienced Data Engineers with strong expertise in Python, Spark (PySpark), and AWS to contribute to large-scale data modernization and analytics initiatives. The selected candidates will design, develop, and optimize data pipelines and cloud-based data platforms that power enterprise reporting, analytics, and machine learning solutions. This role provides an excellent opportunity to work in a fast-paced, cloud-first environment leveraging modern AWS data technologies. Key Responsibilities Design, build, and maintain ETL / ELT pipelines and data ingestion workflows using Python and Spark (PySpark). Develop and manage data solutions using AWS services such as S3, Glue, EMR, Redshift, Lambda, and Athena. Implement efficient data modeling, schema design, and partitioning strategies for data lakes and warehouses. Optimize Spark jobs for performance, scalability, and cost efficiency. Collaborate with data science, analytics, and application teams to deliver reliable and clean data. Establish data quality checks, validation frameworks, and observability mechanisms. Ensure adherence to data governance, lineage, and security standards. Participate in code reviews, documentation, and continuous improvement initiatives. Required Skills Strong programming skills in Python (including Pandas and PySpark). Hands-on experience with Apache Spark / PySpark for distributed data processing. Proficiency with AWS data services S3, Glue, EMR, Lambda, Redshift, and Athena. Strong SQL skills and understanding of data modeling and schema design. Experience with workflow orchestration tools such as Airflow or AWS Step Functions. Proven ability in ETL optimization, performance tuning, and pipeline monitoring. Knowledge of data governance, lineage, and enterprise data management best practices.
481 matching similar jobs near Houston, TX
- Senior GCP Data Engineer - PySpark & BigQuery Expert
- Oracle Cloud Analytics (FDI/FAW) - Manager (Houston)
- Senior Palantir Engineer
- Data Engineer
- Azure Data Engineering and BI Lead
- AI Engineer - Manager (Houston)
- AI&Data MDM Senior Consultant – Life Sciences
- Software Engineer - Edge & IoT Platform
- Senior Data Architect
- AI Data Engineering Manager (Houston)
- Senior Architect, Data and GenAI
- AWS Sales Engineer - Data & AI
- MLOps Engineer
- Senior Fabric Developer
- Senior AI/ML Solution Architect
- Data Analyst - Onsite -W2
- ETL Report Developer
- Lead Data Engineer
- 54257: ETL(Informatica) developer ///Inperson interview
- Senior Data Warehouse Consultant (SQL, Power BI, BO)
- Data Architect
- AI Data Engineering Manager
- AI Data Engineer, Manager - Tax Transformation
- Technical Data Architect
- AI Engineer - Manager
- Software Engineer - Data Engineering IV
- Jr Data Analyst - Onsite - W2
- Senior Informatica ETL Developer
- DW/BI Project Manager for Oil & Gas Analytics
- Senior Data Ops Engineer: Build & Scale Data Platforms
- Power BI Developer
- Forward Deployed Engineer
- Lead GenAI & Data Architect - Hybrid Role
- Associate Data Engineer - Cloud, AI & Data Pipelines
- Principal Data Platform Engineer
- Principal Data Analytics Developer (Supply Chain / Manufacturing)
- Obstetrics & Gynecology Physician
- Oracle MDM/CDM Solution Lead - Manager (Houston)
- Oracle - OFSAA Solution Architect - Manager (Houston)
- Engineer, Senior Operational Analytics