<Back to Search
Gen AI Architect
Sonoma, CAMarch 20th, 2026
Looking for a Gen AI architect with 15+ years experience and 8+years experience focusing on Model Optimization, Fine-Tuning & Strategic AI in San Francisco, CA.Role Summary:You represent the pinnacle of Applied AI engineering. You are not just using APIs; you are optimizing the models themselves. You understand the mathematics behind the attention mechanism, you know how to squeeze performance out of GPUs, and you can customize models for specific domains. You provide the high-level technical vision and handle the most difficult edge cases. .Key Responsibilities:Model Fine-Tuning: Implement PEFT (Parameter-Efficient Fine-Tuning), LoRA, and QLoRA to adapt open-source models (Llama 3, Mistral) to specific client domains.Optimization & Quantization: Perform model quantization to reduce inference costs and latency without sacrificing quality. Manage Dense Vectors and embedding optimizations.State-of-the-Art Exploration: Continuously research and implement the latest advancements (e.g., State Space Models, Long-Context optimizations) into client deliverables.Strategic Consulting: Act as a trusted advisor to C-level client executives, defining the "Art of the Possible" and guiding long-term AI roadmaps.Technical Requirements:Deep Learning: PyTorch/TensorFlow, Transformers architecture internals, Attention mechanisms.Model Ops: Serving custom models (vLLM, TGI), GPU memory management, Quantization techniques (GGUF, AWQ).Advanced Data: Training data curation, synthetic data generation, RLHF concepts.Tech Leadership: Ability to define the technical culture and set standards for the entire FDE organization.Soft Skills:Executive communication and ability to influence C-level leaders.Thought leadership and industry presence (conferences, playbooks, forums).Cross-org leadership and conflict resolution.Ability to define long-term AI vision and cultural standards.Strategic decision-making balancing cost, risk, and performance.
Showing 50 of 19,335 matching similar jobs
- Gen AI Architect
- Gen AI Architect
- Gen AI Architect
- Gen AI Architect
- Gen AI Architect
- Gen AI Architect
- Gen AI Architect
- Gen AI Architect
- Sr. Engineering Manager, AI/ML Serving Platform
- Generative AI Engineer
- Applied Machine Learning Engineer – ML & AI Systems
- Senior Software Engineer, Applied AI
- AI/ML Architect for HPC & Cloud Solutions
- Autonomy AI/ML Engineer - Edge, Multi-Agent C2
- Research Intern AI/ML & DH
- Applied Machine Learning Engineer – ML & AI Systems
- AI for Science Engineer: Transform Medicines with ML
- Senior AI/ML Systems Engineer - Production & Infra
- 1.12 Senior AI Software Engineer - Edge Model Optimization & DeploymentIrvine, CAMarch 20th, 2026
- Applied AI / ML Engineer
- Applied AI Systems Engineer
- AI Engineer Intern (USPS) - Summer 2026
- Senior AI/ML Scientist - RemoteMinneapolis, MNMarch 20th, 2026
- Android Kernel Engineer - TS/SCI (Chantilly)
- Applied AI Engineer
- Agentic AI Engineer for Enterprise Deployments
- Principal AI Software Engineer - Full-Stack Dev
- Senior Machine Learning Engineer - Ranking & Recommendations (Generative AI)
- Applied AI Engineer - Build Scalable AI Infrastructure
- AI Frameworks Engineer (OpenVINO, GenAI)
- AI Evaluation Engineer, Siri Core Modeling
- Senior GenAI Engineer
- AIML - Senior ML Researcher in Foundation Models, Responsible AISeattle, WAMarch 23rd, 2026
- Partner 20, Applied ML, Engineer, ASG
- AIML - ML Researcher in Foundation Models, Responsible AISeattle, WAMarch 23rd, 2026
- AI Engineer
- Senior Software Engineer, Applied AI
- Engineer
- Engineer
- Deep Learning Engineer II