Data Engineer - Austin
About The CompanyBiorce is a pioneering Healthtech company dedicated to revolutionizing drug development through the power of AI. We are passionate about accelerating medical advancements and improving patient outcomes.Our team comprises seasoned clinical research professionals, data scientists, and AI experts, working collaboratively to bridge the gap between cutting-edge technology and real-world clinical needs.With an unwavering commitment to revolutionize healthcare, we envision a world where all patients benefit from accelerated and cost-effective access to treatments. Biorce is poised to redefine the landscape of healthcare, shaping a future where innovation and accessibility converge for the betterment of humanity.About The RoleFollowing our successful expansion into the U.S. and continued growth across Europe, we are seeking a Data Engineer to help drive our AI and Data engineering from our Austin hub.Reporting directly to our Data / AI leadership, this person will play a critical role in driving the development of scalable, reliable, and efficient data pipelines in the Google Cloud Platform ecosystem.This is an exciting opportunity to build and optimize the data backbone of Biorce’s next-generation platform, using modern GCP-native tools such as Data Fusion, BigQuery, and Cloud Storage, in a high-impact, fast-iterating environment.Who We’re Looking ForWe are looking for a skilled Data Engineer to join our growing AI and data team.Someone who can work closely with data scientists, AI engineers, and DevOps to design and operationalize robust data flows that fuel advanced analytics, machine learning, and regulatory-grade insights. This person should be able to shape the evolution of Biorce’s data and AI architecture while ensuring scalable, reliable, compliant, and cost-efficient data operations.Key ResponsibilitiesDesign, develop, and maintain scalable ETL/ELT pipelines using Google Cloud Data Fusion, Dataflow, Pub/Sub, and BigQuery.Build and orchestrate complex data ingestion workflows from diverse clinical, research, and third-party sources.Collaborate with data scientists to enable seamless model training, feature generation, and inference data flows.Ensure data quality, integrity, and lineage across all systems through rigorous validation and monitoring.Develop and optimize SQL and Python-based transformations to ensure high performance and maintainability.Manage data storage, partitioning, and lifecycle strategies for efficiency and cost control.Ensure compliance with SOC2, ISO 27001, HIPAA, GDPR, and clinical data governance standards in all data operations.Continuously improve internal frameworks for ingestion, metadata management, and data documentation.Contribute to cross-functional discussions to shape the evolution of Biorce’s data and AI architecture.Requirements✅ Must-haves3+ years of professional experience in Data Engineering or related roles.Proven hands-on experience with GCP data tools: BigQuery, Cloud Storage, Pub/Sub, Data Fusion, Dataflow, Composer, and Cloud Functions.Strong proficiency in SQL and Python for data transformation and automation.Experience designing batch and streaming data pipelines with scalable and fault-tolerant architectures.Familiarity with data modeling, schema design, and data warehouse optimization.Understanding of API-based ingestion, data normalization, and pipeline monitoring.Exposure to version-controlled, modular pipeline development, such as Terraform or GitOps.Experience working collaboratively with data scientists and MLOps teams.Bachelor’s or Master’s degree in Computer Science, Engineering, or a related quantitative field.✨ Nice-to-HavesExperience with clinical, biomedical, or healthcare datasets.Familiarity with Vertex AI, AI Platform Pipelines, or ML metadata tracking.Understanding of data governance and cataloging, such as Data Catalog, Looker, or similar.Knowledge of Apache Beam, Spark, or dbt for complex transformations.Exposure to infrastructure-as-code, Terraform, and containerized workflows, Kubernetes or Docker.Experience implementing data validation frameworks, such as Great Expectations or TFX Data Validation.Strong focus on reliability, observability, and continuous improvement of data systems.Why Join Us?A dynamic work environment with an international team, where collaboration and diversity thrive.Work alongside top talent, united by a shared purpose and committed to making a real impact.Comprehensive private health coverage to ensure your physical and mental well-being.Hybrid work model offering flexibility to balance your professional and personal life.Company events to celebrate achievements and enjoy time together.Get equipped with a MacBook to enhance your productivity and work experience.Our office is pet-friendly! You’ll likely be greeted by a few wagging tails upon arrival.--By submitting this application, I agree that my personal data will be collected, processed, and retained by the company solely for the purposes of managing and assessing my candidacy.