Application developer
ARCHIVED
We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.
Job TitleAbout the RoleWe are seeking a skilled professional to lead L2/L3 support for enterprise Astro environments, specifically focusing on the Astronomer-managed Apache Airflow. This role involves managing Incident, Problem, and Change Management for the Data Orchestration Platform, performing advanced root cause analysis, and improving the reliability and efficiency of data workflows.ResponsibilitiesLead L2/L3 support for enterprise Astro environments.Own Incident, Problem, and Change Management for the Data Orchestration Platform.Conduct advanced root cause analysis for pipeline failures, scheduler issues, and infrastructure bottlenecks.Enhance DAG reliability, ensure SLA adherence, and reduce Mean Time to Recovery (MTTR).Establish support playbooks, runbooks, and operational standards.Design and maintain dynamic DAG frameworks for scalable pipeline onboarding.Enable automated onboarding of new data domains, sources, and transformation workloads.Support complex ETL/ELT workflows across Data Lakes and Data Warehouses.Education QualificationBackground in Computer Science, Information Technology, or a related field.Required SkillsExpertise in Data Engineering & Orchestration.Experience with Data Lakes (S3-based architectures) and Data Warehouses (e.g., Snowflake, Redshift).Background Check Clearance (BGC) required.Nice to Have SkillsPython Development & Automation:Develop production-grade Python-based DAGs, operators, and plugins.Implement configuration-driven, reusable DAG generation frameworks.Automate deployment, environment provisioning, and pipeline lifecycle management.Enforce coding standards, version control, and CI/CD best practices.Optimize task parallelism, retries, and failure handling strategies.Cloud & Kubernetes Operations (AWS Focus):Manage Astro deployments on AWS (EKS, S3, RDS, IAM, CloudWatch, VPC).Troubleshoot and optimize Kubernetes-based Airflow clusters.Perform capacity planning and resource tuning for workers and scheduler nodes.Implement Infrastructure-as-Code (Terraform/CloudFormation preferred).Ensure high availability, disaster recovery, and cloud resilience.DescriptionThe role involves developing, supporting, and maintaining applications using open source development platforms such as C, C++, Perl, Python, Node JS, and Django. The candidate will utilize their expertise in open source technologies to analyze and resolve issues, collaborate with cross-functional teams to design and implement solutions, and provide technical guidance and support to clients on open source platform implementation, customization, and integration.The Company offers the following benefits for this position, subject to applicable eligibility requirements: medical insurance, dental insurance, vision insurance, 401(k) retirement plan, life insurance, long-term disability insurance, short-term disability insurance, paid parking/public transportation, paid time off, paid sick and safe time, hours of paid vacation time, weeks of paid parental leave, and paid holidays annually – as applicable.