Python and Pyspark Developer
Healthcare Payer – Python and Pyspark Developer EXL Health is seeking a Python and Pyspark DeveloperThis is a full-time, Hartford, CT opportunity. Remote work is available. At EXL Health, we look and go deeper. What others may see as impossible; we welcome as a challenge. And we don’t rest until we find a better way. EXL Health uses Human Ingenuity as a catalyst to look and go deeper for improved outcomes. We combine deep domain expertise with analytic insights and technology-enabled services to transform how care is delivered, managed, and paid. Leveraging Human Ingenuity, we collaborate with our clients to solve complex problems and enhance their performance with nimble, scalable solutions. SummaryWe are seeking a highly motivated Python & PySpark Developer (3–6 years of experience) to design, develop, and optimize large-scale data processing solutions. The ideal candidate will have hands-on experience building ETL/ELT pipelines, working with distributed data processing frameworks, and delivering reliable, high-performance data solutions in cloud and on-prem environments.Key ResponsibilitiesData Engineering & Development: Design, develop, and maintain scalable ETL/ELT pipelines using Python and PySpark for batch and where applicable, streaming workloads.Big Data Processing: Work with large, complex datasets using Apache Spark to perform efficient transformations, aggregations, and data validations.Requirements Analysis: Collaborate with product owners, data analysts, and business stakeholders to translate business requirements into technical data solutions.Cloud & Platform Integration: Build and deploy data pipelines on cloud platforms such as AWS, Azure, or GCP, leveraging cloud-native services where appropriate.Performance Optimization: Tune Spark jobs and Python code for performance, scalability, and cost efficiency.Data Quality & Validation: Implement data quality checks, reconciliation logic, and error-handling mechanisms to ensure data accuracy and reliability.Documentation: Create and maintain technical documentation, including data flow diagrams, transformation logic, and operational runbooks.Agile Collaboration: Participate in Agile ceremonies, contribute to sprint planning, estimations, and code reviews, and ensure timely delivery of features.Required experience and skills· Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field · 3–6 years of hands-on experience in Python-based data engineering · Strong expertise in PySpark and Apache Spark concepts (RDDs, Data Frames, Spark SQL) · Solid understanding of ETL/ELT frameworks, data warehousing, and data modeling concepts · Proficiency in SQL and experience working with relational and analytical databases · Experience with Unix/Linux, shell scripting, and version control systems (Git) · Familiarity with cloud platforms (AWS, Azure, or GCP) and big data ecosystems · Understanding of Agile/Scrum methodologies and DevOps practices· Knowledge of Gen Ai is plus.Location and Work Timings- Hartfort – remote work available - Full time employmentEducation:Bachelor’s or master’s degree in computer science or similar relevant field What we offer: · EXL Health offers an exciting, fast-paced and innovative environment, which brings together a group of sharp and entrepreneurial professionals who are eager to influence business decisions. From your very first day, you get an opportunity to work closely with highly experienced, world-class Healthcare consultants.· You can expect to learn many aspects of businesses that our clients engage in. You will also learn effective teamwork and time-management skills - key aspects for personal and professional growth.· We provide guidance/ coaching to every employee through our mentoring program wherein every junior level employee is assigned a senior level professional as advisors.· Sky is the limit for our team members. The unique experiences gathered at EXL Health sets the stage for further growth and development in our company and beyond.