Data Scientist
Only W2 PositionJob Title: Data ScientistRemote - EST/CSTExp: 3-4yrs Must have: 2-4 years of experience with proficiency in Pyspark, Databricks, Python, and Time Series Forecasting. They’re not looking for a GenAi expert as they understand it’s only been around for a couple of years, but needs someone with experience in Natural Language Processing and machine learning models. Submittal Requirements3-5 Must Haves (need to be highlighted in sizzle & present on resume)· Required· 2-4 years of experience· Experience with Machine Learning and Natural Language Processing (NLP)· ML coding experience with Azure DataBricks Workspace· PySpark, Python, and DataBricks.· Experience with ML models using both supervised and unsupervised learning methods.· Proficiency in Time Series Forecasting.· Preferred· Masters Degree· Generative AI experience· Familiarity with AI models such as Llama models or ChatGPT models.· Should have a strong interest in GenAI if not exposed professionally.· Most candidates experienced with NLP and ML will meet this requirement. Position Summary:We are looking for a Data Scientist that can grow, solve, drive, and create data products to support the Connected solutions portfolio. This role is a part of the Innovation, Strategy and Data science team within the Digital Platform and Innovation group responsible for bringing data science and machine learning ideas to life. Responsibilities: · Responsible for the development and implementation of predictive modeling algorithms and techniques to solve unmet needs, customer/business problems and optimize user experiences· Conduct in-depth research to stay at the forefront of AI advancements, exploring opportunities to integrate predictive and generative AI models into our products and services.· Predictive AI Modeling:· Formulate problem statements and hypotheses for diverse business challenges (clinical, operational and business process optimization problems).· Create Spark & Python code in Databricks to retrieve data from across disparate data sources and create new innovative actionable insights.· Prepare data for effective model training.· Develop, train, and evaluate predictive AI models using various tailored to specific problems.· Continuously refine and optimize models for performance, scalability, and efficiency.· Deploy models into production environments and monitor their performance.· Generative AI Modeling:· Identify opportunities where generative AI models can add value· Explore and experiment with generative models (e.g., GPT) suitable for the chosen application.· Train and evaluate generative models, fine-tuning parameters for desired outputs.· Integrate generative models into production workflows or applications.· Work with data-sets of varying degrees of size and complexity including both structured and unstructured data· Implements batch and real-time model scoring to drive actions· Develop sophisticated visualization of analysis output for various users· Determines the continuous improvement opportunities of current predictive modeling algorithms· Innovate and engage with key technology stakeholders to create a compelling vision of a data-driven enterprise environment and the impact it will have on their teams, their projects and their outcomes. Requirements: · Bachelor’s Degree in STEM (science, technology, engineering, math) related field or a similar quantitative analytics field, Master’s Degree preferred.· Minimum 2 - 4 years of professional experience with a variety of data products / data science model / algorithm development and implementing in production.· Experience with healthcare data and working in a HIPAA regulated environment preferred.· Experience with varying database structures and large datasets preferred· Experience with modern data science tools; their primary stack is Spark, Scala, Python, Databricks, and more. Experience in Microsoft Azure cloud environment is preferred· Proficiency with developing data visualization technology and capabilities (i.e., Power BI, Tableau)· Have a passion for creatively applying pragmatic and scalable approaches to Machine Learning to tackle difficult problems affecting patients and providers.· Are comfortable working on a high-performance team toward a multi-year vision with incremental deliverables.