JOBSEARCHER

Data Engineer, Human Cohorts

Who We AreCalico (Calico Life Sciences LLC) is an Alphabet-founded research and development company whose mission is to harness advanced technologies and model systems to increase our understanding of the biology that controls human aging. Calico will use that knowledge to devise interventions that enable people to lead longer and healthier lives. Calico’s highly innovative technology labs, its commitment to curiosity-driven discovery science, and, with academic and industry partners, its vibrant drug-development pipeline, together create an inspiring and exciting place to catalyze and enable medical breakthroughs.Position DescriptionCalico is seeking a Data Engineer to join our highly collaborative Engineering team and focus on developing high-performance research data infrastructure for large human cohorts. To succeed, you will need to be an enthusiastic team player, detail-oriented, extremely organized, and comfortable working on complex data, software, and scientific problems.In this position, you will be the engineering lead for data infrastructure to support our human biology teams. You will drive projects from requirements-gathering to production deployment, engineering high-performance data systems that integrate with our internal data systems and our internally-developed AI platform.Position ResponsibilitiesEnd-to-End Project Ownership: Collaborate with data scientists and bench scientists to gather requirements, architect solutions, and deploy production-grade software that facilitates data movement, transformation, analysis, and visualizationData Flow Architecture: Define and optimize data flows across the organizationFull-Stack Tool Development: Develop data systems and internal web applications (using React and Python) that allow stakeholders to review, visualize, and communicate complex scientific dataMentorship & Leadership: Serve as a strong technical voice within a larger Engineering team; provide mentorship to junior engineers across Calico and help onboard future hiresEngineering Excellence: Champion best practices for infrastructure-as-code, CI/CD, and containerization while helping to set standards for data engineering at CalicoPosition RequirementsBS/MS/PhD in Computer Science, Data Science, or a related technical field, or equivalent practical experience4+ years (for BS/MS) or 1-2 years (for PhD) of professional software or data engineering experience developing robust, production-grade, and high-performance R&D-focused information systemsExperience working with large-scale biological datasetsFluency in Python and SQL with a strong grasp of software and data engineering principles (testing, modularity, design patterns, data modeling)Demonstrated experience developing and deploying cloud-based applications on Google Cloud Platform (GCP) (preferred), AWS, or AzureStrong experience with modern web frameworks and infrastructure, specifically FastAPI, React, Kubernetes, and TerraformProven ability to lead complex projects involving diverse stakeholders (e.g., ML engineers, computational biologists, bench scientists) from concept to productionExperience enforcing robust data governance policies and compliance with internal information security standards and best practicesMust be willing to work onsite at least four days per weekThe estimated base salary range for this role is $191,000 - $195,000. Actual pay will be based on a number of factors including experience and qualifications. This position is also eligible for two annual cash bonuses.