Data Engineers (Analytics Data Platform)
ARCHIVED
We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.
Data Engineers (Analytics Data Platform) Location: Scottsdale AZ (Onsite from Day 1)Duration: FTE or C2HMust have skill set: Spark, S3, Glue, AWS Redshift and infrastructure, AWS Data Lake Formation and Glue components, data security, SQL, and Python6-8 years of IT experience focusing on enterprise data architecture and management.Experience in Conceptual/Logical/Physical Data Modelling & expertise in Relational and Dimensional Data ModellingExperience with Databricks & on Prem, Structured Streaming, Delta Lake concepts, and Delta Live Tables requiredExperience with Spark scalaData Lake concepts such as time travel and schema evolution and optimizationStructured Streaming and Delta Live Tables with Databricks a bonusExperience leading and architecting enterprise-wide initiatives specifically system integration, data migration, transformation, data warehouse build, data mart build, and data lakes implementation / supportAdvanced level understanding of streaming data pipelines and how they differ from batch systemsFormalize concepts of how to handle late data, defining windows, and data freshnessAdvanced understanding of ETL and ELT and ETL/ELT tools such as Data Migration Service etcUnderstanding of concepts and implementation strategies for different incremental data loads such as tumbling window, sliding window, high watermark, etc.Familiarity and/or expertise with Great Expectations or other data quality/data validation frameworks a bonusFamiliarity with concepts such as late data, defining windows, and how window definitions impact data freshnessAdvanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design performance optimization)Indexing and partitioning strategy experienceDebug, troubleshoot, design and implement solutions to complex technical issuesExperience with large-scale, high-performance enterprise big data application deployment and solutionArchitecture experience in AWS environment a bonusFamiliarity working with Lambda specifically with how to push and pull data, how to use AWS tools to view data for processing massive data at scale a bonusExperience with Gitlabs and CloudWatch and ability to write and maintain gitlabs for supporting CI/CD pipelinesExperience working with AWS Lambdas for configuration and optimization and experience with S3Familiarity with Schema Registry, message formats such as Avro, ORC, etc.