Upvote
Downvote
Sr. Software Engineer - AI / ML, AWS Neuron Distributed Training
Share Job
- Suggest Revision
$150
- This role is for a senior machine learning engineer in the Distribute Training team for AWS Neuron, responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such as GPT and Llama, as well as Stable Diffusion, Vision Transformers (ViT) and many more.
- You will help lead the efforts building distributed training support into Pytorch, Tensorflow using XLA and the Neuron compiler and runtime stacks.
- Annapurna Labs was a startup company acquired by AWS in 2015, and is now fully integrated.
- If AWS is an infrastructure company, then think Annapurna Labs as the infrastructure provider of AWS. Our org covers multiple disciplines including silicon engineering, hardware design and verification, software, and operations.
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
Active Job
Updated TodaySimilar Job
Relevance
Active