JOBSEARCHER
<Back to Search

Big Data / Databricks Engineer

Role: Big Data / Databricks Engineer Location: Texas City, TX (Onsite) or chicago IL Must have: Databricks Spark Hive AWS QuickSight Python Django JD: Data Engineering & Big Data Development Design and develop scalable, highperformance data pipelines using: 1. Databricks (PySpark/SQL) 2. Apache Spark (batch & streaming) 3. Hive (query optimization, partitioning, bucketing) 4. AWS EMR (PySpark jobs for large-scale data processing) 5. Azure Data Factory (ADF) for ingestion and pipeline orchestration. Build data processing frameworks to handle structured, semistructured, and unstructured datasets. Develop highly optimized ETL/ELT workflows using Spark, SQL, Python. Create curated data models (Bronze/Silver/Gold) using Databricks Delta Lake. Optimize Spark transformations through: 1. Caching, checkpointing 2. Partition pruning 3. Adaptive query execution (AQE) Build DBT models for: 1. SQL-based transformations 2. Automated testing 3. Lineage graphs 4. Data documentation to provide transparency across pipelines.bfb3568a-762b-4989-884b-a9682aa104ca

Showing all 12 matching similar jobs