Data Engineer (Remote)
About The CompanyThe Phia Group is a service-oriented organization dedicated to assisting employee health plans across the nation. Our mission is to provide high-quality, affordable healthcare solutions to American employees and their families. We are committed to delivering innovative, cost-cutting strategies and expanding our service offerings to meet the evolving needs of our clients. Recognized for our exceptional workplace culture, The Phia Group was named one of USA Today’s Top Workplaces for 2026. Additionally, regional accolades from The Boston Globe and Louisville Business First highlight our unwavering commitment to fostering an inclusive, enjoyable, and empathetic work environment. Our talented and dedicated team is our most valuable resource, and we continue to experience growth driven by our commitment to excellence and innovation.About The RoleThe Data Engineer at The Phia Group plays a critical role in supporting the development, maintenance, and optimization of data pipelines and analytics-ready datasets. This position involves collaborating closely with multiple teams and stakeholders to address complex data challenges and facilitate data-driven initiatives. The successful candidate will be responsible for building, maintaining, and optimizing data pipelines using Azure Data Factory, ensuring reliable data ingestion, transformation, and delivery to Snowflake for analytics purposes. They will implement monitoring systems, alerts, and testing protocols to ensure data quality and performance, troubleshoot issues, and perform root cause analysis to resolve operational problems proactively. Additionally, the Data Engineer will document data structures, processes, and architectural decisions, support the development of curated datasets for analytics and machine learning, and enable data for AI/ML use cases by preparing feature-rich datasets and supporting model deployment workflows. This role is vital in improving ongoing reporting, automating processes, and supporting the organization’s data-driven growth and innovation efforts.QualificationsBachelor's degree in Computer Science, Computer Engineering, Information Technology, or a related field; or equivalent professional experience.Minimum of 5+ years of experience in data engineering or business intelligence roles, with expertise in ETL, data modeling, data architecture, and developing pipelines for analytics.Proficiency in advanced SQL, Python, or other programming languages used for data processing and automation.Experience supporting or working with AI/ML workflows, including data preparation, feature engineering, and integration with ML frameworks such as scikit-learn, TensorFlow, or PyTorch.Strong understanding of model lifecycle concepts, including training, validation, deployment, and monitoring.Expertise in working with Snowflake for data warehousing, including schema design, performance tuning, and optimization.Proficiency with version control tools such as Git and Azure DevOps, adhering to collaborative development practices.Experience designing, developing, and deploying end-to-end data pipelines using Azure Data Factory.ResponsibilitiesBuild, maintain, and optimize data pipelines utilizing Azure Data Factory to ensure reliable data ingestion, transformation, and delivery to Snowflake.Implement monitoring, alerts, and testing procedures to maintain high data quality, performance, and lineage tracking.Troubleshoot data issues and perform root cause analysis to proactively resolve operational challenges.Document data structures, processes, architectural decisions, and best practices for knowledge sharing across teams.Develop, maintain, and optimize Snowflake objects including schemas, tables, and views to produce analytics-ready datasets.Collaborate with analysts, stakeholders, and product teams to translate business requirements into technical data solutions.Prepare feature-rich datasets to support AI/ML initiatives, including feature engineering and ensuring data consistency for training and inference.Support the deployment and operationalization of machine learning models by integrating pipelines with ML workflows.Continuously improve reporting and analytics processes, automating or simplifying manual tasks and enabling self-service capabilities.Implement version control practices for all data engineering code and related documentation.BenefitsCompetitive salary package aligned with experience and qualifications.Comprehensive health, dental, and vision insurance plans.Generous paid time off and holiday leave policies.Opportunities for professional development and continuous learning.A collaborative and inclusive work environment recognized as one of the Top Workplaces.Flexible work arrangements to promote work-life balance.Employee wellness programs and resources.Equal OpportunityThe Phia Group is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, disability, sexual orientation, gender identity, or any other protected status in accordance with applicable laws.