JOBSEARCHER

Data Application Engineer

The MissionDrug discovery has a translation problem: more than 95% of drugs that succeed in animal models fail in humans. We're building the alternative: human-first drug discovery, powered by organoids and AI, running on real human biology from the very first experiment.Our platform is 87% concordant with clinical patient data - a vast improvement over the 3% translational success rate of animals. We’ve demonstrated the ability to model immunotoxicology, immunogen stimulation, and two autoimmune diseases with more on the way. Numerous pharma partners including 3 Fortune 500 companies are already using the platform. We've raised ~$30M from AIX Ventures, Marc Benioff, Jeff Dean, and Y Combinator. With the FDA Modernization Act 3.0 and the FDA's March 2026 validation framework, the regulatory tailwinds only continue to get stronger.The opportunity ahead of the company is generational: build the first scaled engine for generating real human biological data, and use it to fundamentally change how medicines are discovered.The RoleThis role is a strategic and operational extension of leadership within the Data & Infrastructure team. You can carry full context across the department's workstreams and act as a trusted proxy, making decisions, unblocking teams, and driving execution with minimal oversight. The right candidate has fluency in modern data warehousing platforms (such as Palantir Foundry, Snowflake, Databricks, etc.), genuine curiosity about the science, and enough operational instinct to self-organize around the highest-value work without waiting to be told what to do.This is not a pure individual contributor role. It sits at the intersection of technical execution, project management, and strategic planning, and is designed for someone who can operate across those modes fluidly depending on what the department needs.What You Would OwnData Systems & Platform InfrastructureDrive the buildout of experimental data pipelines, storage architecture, and analytical toolingDefine and enforce data standards, schemas, and governance as dataset volume and complexity grow; partner with Automation and Science teams to ensure those standards reflect real experimental workflows, preventing data debt before it startsBuild and evolve the ontology (actions, objects, links) that represents our biological workflows in Foundry, and develop bespoke React applications that scientists and customers want to useDevelop a working understanding of what data we have, what it is worth, and where the gaps are, then build workflows that unlock that valueChampion data-driven discovery across the company, raising quantitative literacy and helping scientists move from raw data to insight with increasing autonomyIdentify technical debt and infrastructure gaps; scope and prioritize remediationScience Team PartnershipDevelop a genuine, working-level understanding of science teams' priorities, experimental roadmaps, and active book of work by being in the room, not relying on secondhand summariesEnsure Data & Infrastructure builds toward what science actually needs, not what looks logical from a systems perspective in isolationIdentify where data capture or pipeline gaps are creating friction for researchers and treat those with the same urgency as internal engineering prioritiesBuild enough trust with science leadership to anticipate needs and scope work proactivelyAutomation Team CoordinationStay current on the automation team's roadmap so that data infrastructure remains compatible with the physical platform as it evolvesWhere the two intersect (instrument integration, data ingestion, metadata standards), manage sequencing and dependencies sensibly without gatekeepingEnsure the data layer keeps pace with expanding automation capabilities so increased experimental volume produces well-structured datasets, not cleanup backlogsKeeping Things MovingSelf-organize around the department's highest-value work; seek out, sequence, and prioritize what needs doing rather than waiting for a task listOwn the operating rhythm: sprint planning, roadmap reviews, cross-functional syncs, dependency trackingSurface risks and tradeoffs early on infrastructure delivery timelinesTranslate technical constraints into business terms for BD, finance, and partnership discussions, where data infrastructure or security posture is relevantMandatory ExperienceWhat we are looking forExperience working with major data warehousing solutions (such as Palantir Foundry, Databricks, Snowflake, etc.) and strong fundamentals in database designProficiency with React frameworks for user-facing tools and visualizations Familiarity with cloud infrastructure (AWS) and modern data engineering practicesStartup or scale-up experience where scope is fluid and resourcefulness mattersExposure to life sciences data (assay data, LIMS, genomics, or similar) is desirable; you should be comfortable following a science team discussion and translating it into data and infrastructure implicationsWorking StyleHigh agency; you default to action with incomplete informationClear communicator who can move between engineering architecture discussions and leadership briefings in the same afternoonBuilds systems, closes loops, creates structure where none existsGenuinely curious about the science, not someone who treats scientific context as overheadWould Stand OutPrior "glue" role at a startup spanning technical and organizational domainsFamiliarity with data governance, regulatory data requirements, or GxP-adjacent environmentsTrack record of earning trust with wet-lab or scientific teams as a non-scientistExperience with distributed teams across US and European time zonesParallel Bio is an equal opportunity employer committed to fostering an inclusive and respectful workplace. We encourage applications from individuals of all backgrounds, regardless of age, gender, ethnicity, religion, disability, or sexual orientation.Compensation Range: $165K - $190K