Founding Cloud Inference Engineer (Low-Latency AI Serving)

ARCHIVED

SupportFinityMillbrae, CAJune 25th, 2026

We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.

Computer Systems Engineers/ArchitectsSoftware Publishers

A pioneering AI technology firm in San Francisco is seeking a founding member to optimize and serve models on Luminal Cloud. The role involves deploying models with advanced optimization techniques, conducting performance reviews, and enhancing scheduling processes. Ideal candidates are experienced in CUDA and GPU optimization, with hands-on knowledge of vLLM, SGLang, or TensorRT-LLM. A degree is not required, reflecting a modern approach to tech recruitment. J-18808-Ljbffr

matching similar jobs near Millbrae, CA

Founding GPU Compiler Engineer: Lead AI Compiler Stack
San Francisco Tensor CompanyMillbrae, CAJune 25th, 2026
Software DevelopersSoftware Publishers
Founding GPU Kernel Engineer
San Francisco Tensor CompanyMillbrae, CAJune 25th, 2026
Computer Systems Engineers/ArchitectsSoftware Publishers
Senior ML Infra Engineer - Monetization Systems
Ai Chopping BlockMillbrae, CAJune 18th, 2026
Software DevelopersSoftware Publishers
Performance Kernel Engineer for High-Speed GPU Inference
InferactMillbrae, CAJune 26th, 2026
Computer Systems Engineers/ArchitectsSoftware Publishers
Senior GPU Inference Performance Engineer
Solana FoundationMillbrae, CAJune 28th, 2026
Computer Systems Engineers/ArchitectsComputing Infrastructure Providers, Data Processing, Web Hosting, and Related Services