Software Development Engineer - Silicon Development Infrastructure
DescriptionWe're seeking a Software Development Engineer to help architect, build, and operate theinfrastructure that accelerates silicon development at Annapurna Labs. In this role, you'llcontribute to the platforms, tooling, and automation that enable our chip design teams to iteratefaster, validate more thoroughly, and bring transformative silicon to market. You'll work at theintersection of cloud infrastructure, high-performance computing, and electronic designautomation—building systems that directly impact AWS's ability to innovate in custom silicon.This is a unique opportunity to grow your skills in infrastructure that supports chip developmentwhile working with world-class engineers across hardware, software, and operations disciplines.Key job responsibilitiesCustomer-Focused Infrastructure Development Partner with silicon design, verification, emulation, and software teams to understandtheir development workflows, pain points, and iteration cycles. Build tooling and automation that eliminates manual toil and reduces time-to-results. Gather continuous feedback from internal customers and rapidly iterate on solutions.Benchmark infrastructure based on silicon development workflows to provide internalcustomers with the optimal resources for silicon development.Own Platform Delivery and Operations Design, implement, and operate cloud infrastructure and high-performance computingclusters using schedulers like Slurm. Build and maintain CI/CD pipelines for infrastructure-as-code and service deploymentswith comprehensive testing and safe rollback mechanisms. Take ownership of platform reliability, performance, and cost efficiency from initial designthrough production operation.Drive Results Through Automation and Observability Develop monitoring, diagnostics, and alerting systems that surface actionable insights onefficiency, utilization, reliability, and cost trends. Establish incident response processes, runbooks, and documentation that enableoperational excellence. Proactively anticipate system failures and implement preventive measures, reducingoperational toil and improving system resilience.A day in the lifeEach day you will work with some of the best engineers in the industry to develop MachineLearning Accelerators. On-site in Austin, Texas, you will be part of the team that developscustom silicon and contribute to the infrastructure that enables this innovation.You might start your day investigating anomalies in job completion rates or resource utilizationpatterns. You could spend your morning collaborating with a design verification team to optimizetheir regression workflows, identifying bottlenecks and proposing improvements. In theafternoon, you might be building new tooling that simplifies infrastructure access for emulationteams, or contributing to monitoring dashboards that give teams real-time visibility into theirdevelopment velocity.You'll participate in design reviews, contribute to postmortems when incidents occur, andcontinuously refine the systems that accelerate the path from RTL to silicon. Throughout theday, you'll balance immediate customer needs—unblocking a team waiting for compute capacity—with longer-term platform investments. You'll write code, review infrastructure-as-codechanges, and collaborate across teams who depend on the systems you build.Take a look inside our labs to see what you will learn at Annapurna Labs: https://www.aboutamazon.com/news/aws/take-a-look-inside-the-lab-where-aws-makes-custom-chips https://youtu.be/rViVFrQg4HkAbout The TeamAt Annapurna Labs, your infrastructure work directly enables breakthrough innovations incustom silicon that power AWS and transform industries. You'll collaborate with world-class chipdesigners, verification engineers, and software developers who are pushing the boundaries ofwhat's possible. We offer the resources and scale of AWS with the innovation culture andtechnical depth of a focused silicon team.If you're passionate about building infrastructure that accelerates innovation, thrive on customerobsession and ownership, and want to see your work enable the next generation of AWS silicon—we want to hear from youBasic Qualifications 3+ years of non-internship professional software development experience 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience Experience programming with at least one software programming language 2+ years of non-internship professional software development experience 2+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience 3+ years of administrative experience in networking, storage systems, operating systems and hands-on systems engineering experience Knowledge of systems engineering fundamentals (networking, storage, operating systems) Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalentPreferred Qualifications 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience Bachelor's degree in computer science or equivalent Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust with demonstrated ability to write production-quality, maintainable code Experience utilizing AWS cloud solutions in a DevOps environment with infrastructure as code (CloudFormation, Terraform, CDK) Experience with Linux/Unix Experience in automating, deploying, and supporting large-scale infrastructure Experience with high-performance computing (HPC) clusters using workload schedulers like Slurm Familiarity with semiconductor development workflows or electronic design automation (EDA) environments Experience building services using AWS products Experience with CI/CD pipelines and build processes Experience with monitoring, observability, and incident management at scaleAmazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.USA, TX, Austin - 143,700.00 - 194,400.00 USD annuallyCompany - Annapurna Labs (U.S.) Inc.Job ID: A10409008