<Back to Search
Founding ML Engineer | Evaluating Frontier Medical AI | $150k–$200k | SF
Sunnyvale, CAApril 5th, 2026
About the job
This role is being recruited by CoffeeSpace on behalf ofTessel , an SF-based startup working at the intersection of ML evaluation, clinical validation, and FDA regulatory strategy.We’re identifying a small number of exceptional ML researchers from our network.
If there’s a strong fit, we’ll introduce you directly to the founding team.Founding ML Engineer
Location:San Francisco (on-site)
Compensation:$150k–$200k base + 1-3% equity
Start timeline:ASAP
Employment type:Full-timeAbout Tessel
The next generation of diagnostic AI will detect cancer earlier, catch disease before symptoms appear, and change outcomes for millions of patients. But only if the AI actually works.Tessel builds the evidence infrastructure that proves it does.They partner with leading diagnostic AI companies and hospitals to rigorously measure, explain, and continuously monitor model performance.At Tessel, evaluation isn’t a compliance checkbox – it’s how you build AI worth trusting.Backed by leading investors and part of StartX (Stanford’s accelerator).The Founder
Founded by Lucas Tao (Stanford MS CS, former Stanford ML Group researcher at SAIL, ex-AWS engineer), with deep experience across ML systems, interpretability, and large-scale infrastructure.VC-backed, founder-led AI company already trusted by hospitals and diagnostic AI companies navigating regulatory approval.The Role
You’ll work directly with medical imaging companies ahead of FDA 510(k) or De Novo submissions, owning engagements end-to-end – from defining evaluation questions to delivering evidence that drives go / no-go decisions.This is not about building models. It’s about understanding them.Where do they generalize? Where do they break? What trade-offs are being made? What uncertainty remains?Your output is defensible, decision-grade evidence – clear enough to inform internal decisions, build customer confidence, and withstand regulatory scrutiny.You combine strong ML instincts with customer-facing judgment and consistently deliver under time pressure.Required Qualifications
Demonstrated history of non-trivial machine learning or analytical work: meaningful projects, publications, systems built, or difficult problems solved
Strong empirical ML instincts: comfortable designing experiments, analyzing failure cases, and debugging model behavior using statistical or representation-level analysis
Able to design investigations, detect spurious patterns, reason about distribution shift and uncertainty, and distinguish signal from artifact
Comfortable working with messy real-world data, imperfect ground truth, and ambiguity
High analytical ownership in Python (data to analysis to defensible conclusions)
Clear and confident communicator of technical findings to customers and non-technical stakeholdersPreferred Qualifications
3 to 5 years of experience or a strong research track record, such as published work around model evaluation, building medical imaging models, or equivalent depth
Experience evaluating, validating, or debugging real-world ML systems
Familiarity with robustness, interpretability, or safety-critical evaluation
Exposure to medical imaging, healthcare ML, or other safety-critical domains
Experience working directly with customers or cross-functional stakeholdersThis Role Is NOT For You If
You would rather optimize a metric than investigate why a model breaks on a specific patient subpopulation
You need clearly defined tasks and stable scope to be effective
You are uncomfortable presenting findings that still contain uncertainty
You want pure technical work without customer relationship ownershipWhy This Role
This is a high-impact, high-ownership role. Your evidence directly affects whether a model is submitted to the FDA, whether a hospital adopts or walks away, and whether patients get the outcome they deserve.Dozens of startups are building another model. The company that proves rigorous, continuous evaluation works in medical AI will not just set the standard here. It will define how we build and govern high-stakes AI across every sector.If you are motivated by ownership, accountability, and real-world impact, not incremental optimization or hype, this role is for you.Next steps
Apply via this LinkedIn job post
We’ll review and reach out if there’s a strong match
If aligned, we’ll introduce you directly to the Tessel teamIf this role isn’t the right fit, we may suggest and make introductions to other high-signal startup roles we’re recruiting for – always with your permission.A quick note on authenticity
This is a real, active role that CoffeeSpace is recruiting for on behalf of Tessel. We don’t post speculative roles and work directly with hiring teams.
938 matching similar jobs near Sunnyvale, CA
- Staff Backend Engineer (Typescript)
- Applied AI Engineer
- Senior ML & CV Engineer - 3D Vision & AI
- Senior ML Compiler Engineer - Autonomous Driving
- Senior ML Engineer: Real-Time Marketplace Pricing
- AI Agents (Agentic AI)
- Senior ML Engineer, Earner Growth & Equity Incentives
- Senior Software Engineer
- Applied Scientist II, Prime Video - Personalization and Discovery Science, Personalization and Discovery Science
- Applied Scientist II, Prime Video Recommendation/Search Science
- Sr. Applied Scientist, Amazon Ads
- Senior AI Leader - Generative & Agentic Systems
- Software Engineer, Android
- Applied Scientist II, Prime Video - Personalization and Discovery Science
- Fireblocks Implementation Developer
- Senior Staff Architect: AI/ML Silicon & TPU Innovation
- Senior Health Tools Engineer - AI-Powered Debugging
- Senior ML Infra Engineer - Cloud GPU Training
- Applied Scientist (AI for Healthcare)
- Applied AI Engineer - AI Agent (PhD New Graduate)
- Machine Learning Engineer
- Machine Learning Engineer
- Junior Applied AI Engineer- HRIS
- Staff Software Development Engineer
- Matterport - Senior ML Ops Engineer
- Staff Engineer
- Media Software Engineer, Speech (All Levels)
- Senior Staff Engineer (Backend) - Road Safety/Insurance
- Principal Software Engineer/ Product Architect
- Full Stack AI Engineer
- Senior Site Reliability Engineer, AI/ML
- AI Systems Engineer (Multiple Positions) (REF277851U)
- Agentic AI Developer (Python & AI)
- Customer Engineer III, Applied AI, Google Cloud
- Backend Software Engineer (Bilingual in Mandarin )
- Full Stack Engineer
- Machine Learning Research Engineer
- Founding Engineer (Confidential B2C Healthtech Startup in San Francisco)
- Head of Machine Learning
- Staff Machine Learning Infrastructure Engineer