<Back to Search
Senior Software Development Engineer, Stores Foundational AI - Rufus
Palo Alto, CAApril 4th, 2026
We're enhancing the shopping experience on Amazon through the conversational capabilities of large language models, and we're looking for innovative professionals who are passionate about technology and customer experience. You'll have the opportunity to drive breakthrough innovations in LLM inference and post-training efficiency while working alongside talented scientists, engineers, and technical program managers (TPMs) to create solutions that serve our customers.If you're excited about optimizing the computational heart of AI systems, collaborating with a dynamic team, and contributing to this evolving field, we'd love to have you join our mission to unlock unprecedented LLM performance!Key job responsibilities
We're looking for an experienced Software Development Engineer with deep expertise in GPU/customized chip kernel optimization and ML acceleration to lead projects in architecting, designing, developing, and optimizing high-performance kernel implementations for large language model. You'll guide your team in creating and optimizing innovative kernels, custom operators, and low-level optimizations that maximize hardware utilization and minimize computational overhead.
In this role, you'll establish best practices for kernel development, memory management, and parallel computing that dramatically reduce inference latency and boost throughput for transformer-based models. You'll work with your team to develop kernel fusion techniques, attention mechanism optimizations, and matrix multiplication accelerations at scale, partnering with engineers and scientists in a fast-paced environment to deliver measurable performance gains. You'll also drive technical roadmap, performance benchmarking, and optimizations focused on kernel-level improvements.
BASIC QUALIFICATIONS- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- Experience with vLLM, SGLang, TensorRT or similar platforms in production environments
- Experience with CUDA kernels or ML/low-level kernels
PREFERRED QUALIFICATIONS- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Experience with Machine Learning and Large Language Model fundamentals, including architecture, training/inference lifecycles, and optimization of model executionAmazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.Our inclusive culture empowers Amazonians to deliver the best results for our customers.
409 matching similar jobs near Palo Alto, CA
- Monetization ML Engineer: Personalization & Ads
- Staff Data Engineer - Agentic AI
- Sr. Staff Data & Applied Scientist, GenAI & Labeling Platforms
- Senior Financial Systems Developer
- GTM Automation Architect for AI-Driven Revenue Ops
- Senior Firmware Engineer - HMI
- Edge AI Engineer — Quantized Models for Real-Time Autonomy
- AI Product Engineer — Frontend, Backend & ML
- Senior Director, AI Platform Engineering
- Senior Product Manager — Cloud B2B SaaS for Quantum & AI
- Senior Software Engineer - Distributed Training
- Software Development Manager - Amazon Redshift Query Execution, Amazon Redshift Query Execution
- Software Development Manager - Amazon Redshift Query Execution, Amazon Redshift Query Execution
- Software Development Engineer, Advertiser 1P Data
- Forward Deployed Engineer
- Senior Inference Platform Engineer — Low-Latency, Multi-Tenant
- Senior Software Engineer Factory Software & MES Systems
- Senior Cellular SW Engineer - 5G/C++/Embedded Linux
- Quantum Life Sciences Chemist
- AI-Driven Marketing Engineer — Build GTM Tools
- Senior Integration/API architect
- IOS Mobile Tech Lead
- System Architect
- Machine Learning Scientist - Quant AI - Senior Associate - Machine Learning Center of Excellence
- Senior Software Engineer, Inference Platform Palo Alto
- Staff Engineer, Distributed RL Training Framework
- Residential Energy Software Engineer, Tesla Energy Device Software
- Senior Backend Infrastructure Engineer - Cloud & Scale
- Senior Kernel & Virtualization Systems Engineer
- Remote Lead Engineer, AI-Powered Governance Platform
- Senior Microservices Lead - Cloud-Native & Java Expert
- Staff Software Engineer, Product
- Data Engineer - Multimodal Systems
- Mobile Game Tester
- Remote Backend Engineer II | APIs, Scale & Growth
- Senior IVI Systems Engineer - End-to-End Automotive SDV
- Principal Machine Learning Engineer, Ads Delivery
- AI-Native Marketing & GTM Architect
- Flight Software Engineer
- Senior Kernel & Compiler Performance Engineer (GPU/AI)