GCP Data Engineer + Java
KANINI is seeking a highly skilled Data Engineer with deep expertise in Google Cloud Platform (GCP) and modern data architecture. The ideal candidate will have hands-on experience designing scalable data pipelines, implementing Medallion Architecture, and building robust enterprise-grade data solutions.This role requires strong technical proficiency in BigQuery, PySpark, Dataflow, and Airflow, along with a solid understanding of cloud data governance, performance optimization, and CI/CD practices.Key ResponsibilitiesDesign, develop, and maintain scalable batch and real-time data pipelines on GCPImplement and manage Medallion Architecture (Bronze, Silver, Gold layers) for data processingBuild high-performance data transformations using Python and PySparkDevelop and optimize complex SQL queries for analytical workloadsWork extensively with BigQuery for large-scale data processing and performance tuningDevelop and deploy pipelines using Cloud DataflowOrchestrate workflows using Cloud Composer (Apache Airflow)Manage data storage and lifecycle using Google Cloud Storage (GCS)Implement version control and CI/CD pipelines using Git-based toolsEnsure data security, governance, and access control using GCP IAMOptimize data solutions for performance, scalability, reliability, and cost-efficiencyRequired Skills & ExperienceStrong hands-on experience with Google Cloud Platform (GCP)Expertise in BigQuery (partitioning, clustering, query optimization)Proven experience implementing Medallion Data ArchitectureStrong programming skills in Python and PySparkHands-on exposure on JavaAdvanced proficiency in SQL (complex joins, window functions, performance tuning)Hands-on experience with Cloud DataflowExperience with Cloud Composer (Airflow) for orchestrationExperience working with Google Cloud Storage (GCS)Knowledge of version control systems (Git) and CI/CD practicesStrong understanding of GCP IAM and cloud security best practicesPreferred QualificationsExperience working with large-scale enterprise data platformsKnowledge of data warehousing and data lake conceptsFamiliarity with real-time streaming frameworksExperience in data governance and data quality frameworksExposure to Agile/Scrum methodologies