<Back to Search
Jr Data Engineer - Onsite - W2
Houston, TXApril 1st, 2026
Job Title: Data Engineer (Python / Spark / AWS) on W2 Location: Richmond, VA / McLean, VA / Dallas, TX Onsite (LOCALS only) Duration: Long Term Contract Interview Process:: Internal Screening Round followed by an In-Person (Face-to-Face) at VA or Dallas Tx Job Summary We are seeking talented and experienced Data Engineers with strong expertise in Python, Spark (PySpark), and AWS to contribute to large-scale data modernization and analytics initiatives. The selected candidates will design, develop, and optimize data pipelines and cloud-based data platforms that power enterprise reporting, analytics, and machine learning solutions. This role provides an excellent opportunity to work in a fast-paced, cloud-first environment leveraging modern AWS data technologies. Key Responsibilities Design, build, and maintain ETL / ELT pipelines and data ingestion workflows using Python and Spark (PySpark). Develop and manage data solutions using AWS services such as S3, Glue, EMR, Redshift, Lambda, and Athena. Implement efficient data modeling, schema design, and partitioning strategies for data lakes and warehouses. Optimize Spark jobs for performance, scalability, and cost efficiency. Collaborate with data science, analytics, and application teams to deliver reliable and clean data. Establish data quality checks, validation frameworks, and observability mechanisms. Ensure adherence to data governance, lineage, and security standards. Participate in code reviews, documentation, and continuous improvement initiatives. Required Skills Strong programming skills in Python (including Pandas and PySpark). Hands-on experience with Apache Spark / PySpark for distributed data processing. Proficiency with AWS data services S3, Glue, EMR, Lambda, Redshift, and Athena. Strong SQL skills and understanding of data modeling and schema design. Experience with workflow orchestration tools such as Airflow or AWS Step Functions. Proven ability in ETL optimization, performance tuning, and pipeline monitoring. Knowledge of data governance, lineage, and enterprise data management best practices.
396 matching similar jobs near Houston, TX
- Salesforce Marketing Cloud Developer
- Senior Gen AI Cloud Engineer
- Machine Learning Engineer – Data and AI
- Technical Integration Lead
- Microsoft Platform Manager
- Head of Data Science and Advanced Analytics
- CRM Integration Specialist
- Principal – Platform Engineering
- SAP Business Technology Platform (BTP) and Analytics Cloud (SAC) Technical Architect
- Staff Software Engineer / Architect – Java Modernization (AWS / Kubernetes)
- Gen Ai Architect with Nova/Nova 2
- GCP Engineer
- Data Analyst 3
- Python Developer
- Information Systems Officer (Applications & Data Platform Support)
- IT Consultant 2 (ITC2)
- Data & AI Sales Origination for Databricks and Snowflake
- Gen AI Specialist (Houston, TX)
- Director of Product Operations
- Remote BI Developer: Dashboards & Data Insights
- Informatica MDM (Master Data Management) Consultant
- Snowflake Developer | ONSITE
- Aveva PI Consultant
- Senior Clinical BI Analyst
- Tableau Developer Consultant
- Senior Data Solutions Architect - Industrial AI
- WMS Solutions Architect - Manhattan Active & ERP Integrations
- Senior Magento Architect - Lead CTO Configurator & Platform
- Cloud Native App Developer (Senior)
- Data Science Intern: Predictive Modeling & Analytics
- Power BI & QuickSight Dashboard Specialist
- STIBO MDM Consultant
- Data Extraction Engineer
- Solution Architect - IBM Maximo Platform Deployment and Integrations
- C Python
- Xamarin Consultant (GC USC)
- MDM Solution Architect
- AI Analyst x4
- Clinical Data Engineer
- Mainframe Developer/Senior Software Engineer III(no third party, only W2)