Senior Data Engineer
At EMC, we’re all about working together to make an impact. As part of our team, you’ll have the opportunity to grow, contribute, and gain experience that matters. We strive to be caring leaders, close partners, and responsive experts—always supporting each other to do our best work. Join us, and let’s improve lives together.This position is eligible to work a hybrid schedule within the Des Moines, Iowa Metro Area**Essential FunctionsDevelops, test, and maintains prioritized engineering (design/build/test) work as a member of a data analytics product pipeline architecture and infrastructure team:Ensures delivery of solutions for organization of data, such as transformation, conformation, integration, and aggregation of dataPartners with the Data Platform Operations (DPOs), Information Security, Network, and Operations teams to design and implement new data pipelines for new business initiatives as related to data and analyticsServes as a technical expert for the Data Engineering teamEnsures best practices and internal processes are followed when solutions are designed and implementedEnsures the delivery of solutions for acquisition of data, such as landing, staging, rapid ingestion, and historical data extractionCreates processes framework, design and implementation of large-scale data conversion and data ingestion on various cloud and on-premise platforms using most optimal processing and programming techniquesInvestigates and analyzes feasibility of data integration prototypes and program requirementsDevelops specifications and plans the full range of programming actions to produce data integration componentsDevelops and maintains highly complex automated ETL/ELT pipeline architecture that cleans, transforms, and aggregates data from disparate sources using architecturally approved tools, programming languages, and services to deliver a data mesh approachDevelops and maintains highly complex automated ETL/ELT monitoring and alarming solutions using architecturally approved languages and servicesMonitors, troubleshoots, and optimizes data pipelines for quality, performance, and reliability creating a single source of truth providing business customers with highly reliable and quality dataBuilds highly complex unit tests to enable test driven development and automation to drive code reliabilityDevelops, maintains, tests, and deploys code with architecturally approved toolsets using Dev Ops practices, ETL/ELT pipelines, data catalogs and data queriesServes as a technical resource and supports team members on EMC’s data integration projects and helps them learn new technologiesLeads evaluation of new technologies including performing POCs and presenting results to others, with a goal of providing technical recommendationsBalances and designs data analytics product solutions based on cost and performance to meet business requirementsGuides business customers to develop, troubleshoot, and optimize complex SQL and ETL/ELT solutions to solve reporting, metrics, and analytics problems.Data analysis: Collaborates with cross-functional teams to understand data requirements for data analytics products and translate them into scalable and efficient data solutions:Leads complex projects and provides subject matter expertise, support and advice to other projects being handled by the teamPartners with users and stakeholders to understand data needs and business requirementsLeads the team on conceptualizing and developing new data solutions to meet the business data needs and requirementsResearches, performs analysis, and proposes effective solutions related to system developments and enhancements through collaboration with Data ArchitectureAnalyzes and reviews potential adjustments or modifications for impacts on other programsDrives best practices in data governance, data security, and privacy principles to support data lineage and reference data needs.Collaborates with cross-functional teams to understand business and downstream application (BI reporting, Data Science, Data Governance, other System integrations) and consumer requirements to deliver data-driven solutions that implement best practices for data management, security, and governanceData ModelingDevelops highly complex data objects for business analytics using data modeling techniquesModels front-end and back-end data sources to help draw a more comprehensive picture of user flows throughout the system and to enable powerful data analysisWorks with stakeholders to understand highly complex business needs and translates them into a conceptual data model, defining entities, attributes, and relationshipsBuilds highly complex detailed logical models based on the conceptual model, specifying data types, constraints, and keys to ensure data integrityImplements highly complex logical models into appropriate database management system, considering database features, storage optimization, and performance considerationsDesigns highly complex data pipelines to extract data from source systems, transform it as needed based on the data model, and load it into the target databaseDocumentationParticipates in creating and maintaining documentation related to the design of applications and interfaces to enterprise and third-party systems in accordance with departmental and team standardsCollaborates with data governance on ensuring what is being produced in data analytics products enables data governance to publish a data assetCollaborates with data governance to develop and maintain data analytic products, artifacts and user documentation for business customersWorks with business customers and software development teams to gather and document requirements for data publishing and data consumption via data mesh architecture.Maintains systems documentationEducation & ExperienceBachelor’s degree, preferably in a computer related field or equivalent relevant experienceEight years of data integration development experience, or related experienceKnowledge, Skills, & AbiltiesExcellent SQL Knowledge and experience working with relational databasesExcellent analytics skills to work with unstructured datasets preferredExcellent knowledge of applicable programming languages such as PythonStrong knowledge of cloud services such as RDS, S3, SNS, CloudWatch and LambdaAdvanced knowledge of DBT/Informatica or other ETL/ELT tools preferredExcellent knowledge of SnowflakeExcellent knowledge of data modelingExcellent ability to build processes supporting data transformation, data structures, metadata, dependency and workload managementExcellent knowledge of orchestration or scheduling toolsExcellent ability to manipulate, process and extract value from large, disconnected datasetsAdvanced knowledge of DevOps and/or DataOps practices, CI/CD pipelines, and containerization technologies preferredExcellent verbal and written communication skillsStrong attention to detail, organizational and multi-tasking skills required with the ability to adapt to changing prioritiesAbility to maintain confidentialityThe hiring salary range for this position will vary based on geographic location, falling within either of the following:$103,954 - $149,372 or $114,895 - $164,309A hiring range represents a subset of the full salary range. The actual salary will depend on several factors, including relevant education, skills, and experience of an applicant, geographic location, and business needs.For information relating to the benefits EMC Team Members receive as part of a comprehensive rewards package, please visit www.emcins.com/careers.Our employment practices are in accordance with the laws that prohibit discrimination due to race, color, creed, sex, sexual orientation, gender identity, genetic information, religion, age, national origin or ancestry, physical or mental disability, medical condition, veteran status, active military status, citizenship status, marital status or any other consideration made unlawful by federal, state, or local laws.All of our locations are tobacco free including in company vehicles.