JOBSEARCHER

Senior Software Engineer, Infrastructure Software for AI (Sunnyvale)

About Infrinia.ai, powered by SoftBank: SoftBank is making significant investments ininfrastructure for AI. Through its wholly owned US subsidiary, SoftBank Corp. has establishedInfrinia team in Silicon Valley, focused on infrastructure software for AI and AI foundations formobile networks. Our goals are to challenge the norms and create products making use of ourSOTA infrastructure (like Nvidia GB200, MGX and DGX Grace & Hopper platforms) and cloud-native software. These products are geared towards centralized AI data centers as well asdistributed AI Radio Access Network (AI RAN) data centers. We are looking for experiencedpractitioners who are inspired to bring innovation and build transformative products.Minimum Qualifications:Bachelor's degree in Computer Science, Electrical Engineering, or related field.5+ years in software, hardware, engineering, including platforms and distributed systems.2 years in lead roles, leading high-impact projects, teams.Experience in building systems & systems SW, AI frameworks, and applied AI.Preferred Qualifications:Master's in a relevant field.Hands-on experience with Kubernetes and container orchestration.Experience with GPU systems and high-performance computing environments.Expertise in building scalable infrastructure to support AI workloads.Experience with AI developer frameworks, tools, and automation systems.Role: Be a key member of the infrastructure team responsible for building foundational software on top of GPU systems supporting AI workloads (training, fine-tuning and serving). Own and develop major chunks of the new AI infrastructure SW with a focus on Kubernetes and GPU systems. Drive innovation in systems software architecture and automation for maximizing resource utilization. As a Senior Software Engineer responsible for major engineering tasks, work with Engineering Leads, Product and Program Management to drive execution towards commercialization.Responsibilities:Develop and build systems software for supporting AI workloads on large-scale GPU systems.Deliver control plane for workloads including scheduling and orchestration. Deliver management plane for underlying platforms.Provide northbound APIs for customer portals to interact with the infrastructure.Contribute to Product Definition (PRD) and program execution (sprint) planning.Attract and help build engineering talent.Role model and foster a culture of humility and innovation for product delivery.Salary: The base salary for this position ranges from ($150,000-$250,000), with additional attractive biannual bonus and benefits.