<Back to Search
Embedded AI Engineer
Sunnyvale, CAApril 6th, 2026
MatchPoint Solutions is a fast-growing, young, energetic globalIT-Engineering services company with clients across the US . We provide technology solutions to various clients likeUber, Robinhood, Netflix, Airbnb, Google, Sephora, and more!More recently, we have expanded to working internationally inCanada, China, Ireland, UK, Brazil, and India . Through our culture of innovation, we inspire, build, and deliver business results, from idea to outcome. We keep our clients on the cutting edge of the latest technologies and provide solutions by using industry-specific best practices and expertise.
We are excited to be continuously expanding our team. If you are interested in this position, please send over your updated resume. We look forward to hearing from you!If your skills, experience, and qualifications match those in this job overview, do not delay your application.Job Title: Embedded AI Engineer
Location: Sunnyvale, CA
Employment Type: 6+ Month Extendable Contract
Pay Range: USD 70-80/HR
- Role Overview/Job Responsibilities
About this opportunity – Embedded AI Engineer We are seeking an experienced Embedded AI Engineer to join our team in validating PyTorch-based Large Language Models (LLMs) using CUDA SDK APIs. The successful candidate will be responsible for debugging, extending, and replacing the underlying CUDA code to ensure seamless functionality on our company-specific AI processors.
Key Responsibilities:
● Validate PyTorch-based LLMs on company-specific AI processors using CUDA SDK APIs
● Debug and troubleshoot issues related to CUDA code integration with PyTorch models
● Extend and modify CUDA code to optimize performance on company-specific AI processors
● Replace existing CUDA code with custom implementations to meet specific requirements
● Collaborate with cross-functional teams to ensure successful integration of LLMs with company-specific AI processors
● Develop and maintain validation frameworks and tools for PyTorch-based LLMs
● Analyze and optimize the performance of LLMs on company-specific AI processors Requirements
● Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related fields
● Strong experience with CUDA programming and PyTorch framework
● In-depth knowledge of deep learning models, particularly Large Language Models (LLMs)
● Proficiency in C++ and Python programming languages
● Experience with debugging and troubleshooting complex software issues
● Excellent problem-solving skills and attention to detail
● Strong communication and collaboration skills
Nice to Have:
● Experience with AI processor architecture and design
● Knowledge of other deep learning frameworks, such as TensorFlow
MatchPoint Solutions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. xywuqvp
This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.
1,009 matching similar jobs near Sunnyvale, CA
- Founding AI Engineer
- Staff Software Development Engineer (LLM)
- Senior Technical Lead- End-to-End AI Training Framework
- Senior Principal Engineer- End-to-End AI Training Framework
- LLM & Agentic AI R&D Intern
- Sr Software Engineer - AML, AI & Data Platforms (AiDP)
- Senior ML Engineer: Real-Time Marketplace Pricing
- Staff AI Software Engineer, Siri Core Modeling
- ML Runtime Optimization Engineer - Lead
- Staff Software Development Engineer (C++/go)
- Product Manager (AI/ML)
- Software Engineer (Quality), Retail and Marcom Engineering
- Senior ML Infra Engineer - Cloud GPU Training
- Full Stack AI Engineer
- Sr. Applied Scientist, Prime Video - Personalization and Discovery Science
- Sr. Applied Scientist, Amazon Ads
- Full Stack Engineer
- Applied Scientist II, Prime Video - Personalization and Discovery Science
- Principal Software Engineer/ Product Architect
- Firmware Engineer - All Levels
- Senior Software Engineer, Cloud/Backend - SCP (Hybrid)
- Embedded 5G/4G Cellular RF Software/Firmware Engineer
- AI Systems Engineer (Multiple Positions) (REF277851U)
- Sr. AR/VR/AI Experience Prototyper- Vision Products Software
- Sr. Applied AI Health Software Engineer
- Deep Learning Engineer - Perception Algorithms
- Staff Machine Learning Engineer - DashPass
- Staff Software Engineer
- Lead Research Scientist - GenAI for 3D Computer Vision
- Multimodal LLMs Research Engineer
- Media Software Engineer, Speech (All Levels)
- Matterport - Senior ML Ops Engineer
- Senior AI Research Scientist- Time-Series Foundational Models
- AI Research Scientist - Large Language Models (LLM) & Agentic AI
- Applied Research Engineer - Multimodal LLMs for Human Interaction
- Senior Java Developer
- Full Stack developer with ANSIBLE
- Spark Developer
- Sr. Full Stack Engineer
- DevOps Engineer (FortiAppSec)