Big Data Developer (Python/Pyspark) - NJ
Job#: 3031052Job Description: Big Data Developer (Python/Pyspark)Location: Edison, NJ (Hybrid, 3 days onsite per week)Employment Type: Contract to hireRole OverviewWe are seeking an experienced Big Data Developer with strong Python expertise and hands-on experience building scalable data processing pipelines. The ideal candidate will have deep knowledge of distributed compute engines such as Apache Spark, modern lakehouse architectures, and real-time streaming platforms. This role is part of a new AI project with the potential for team growth.Key ResponsibilitiesDesign, develop, and optimize data processing pipelines using Apache Spark and PySpark.Build and maintain batch and streaming data loads for large-scale data platforms.Implement robust data models and storage layers using Lakehouse/Object Store technologies.Work with modern table formats such as Apache Iceberg or Delta Lake to support scalable data operations.Develop SQL-based transformations and ensure data quality, consistency, and performance.Integrate and manage Kafka for real-time data ingestion and event streaming.Collaborate with cross-functional teams including data engineering, analytics, and platform engineering.Participate in architectural discussions, code reviews, and performance tuning.Required QualificationsExperience: 6+ years of experience as a Big Data Developer or Data Engineer.Technical SkillsStrong hands-on experience with Python for data processing and automation.Expertise with Apache Spark and PySpark for creating data loads and building scalable ETL/ELT pipelines.Experience with Data Layer technologies including Lakehouse/Objectstore, Iceberg/Delta, and SQL.Practical experience with Kafka for distributed messaging and streaming.A solid understanding of distributed systems, data partitioning, and performance optimization.Preferred QualificationsExposure to Agentic AI frameworks such as LangGraph, LangChain, or A2A (Agent-to-Agent).Experience with cloud platforms (Azure, AWS, or GCP).Familiarity with data orchestration tools (Airflow, Dagster, Prefect).Knowledge of CI/CD practices and containerized environments.Compensation & BenefitsThe pay rate for this position is $63.00 per hour on Apex W2. Please note that the pay rate may be negotiable based on experience and skills. A benefits package is available to eligible employees.The employer is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability. The employer will not discriminate on the basis of disability and will consider qualified applicants with criminal histories in a manner consistent with applicable law.J-18808-Ljbffr