<Back to Search
Senior Software Engineer - AI, Search u0026 Knowledge Platform - Traffic Infrastructure
Cupertino, CAApril 5th, 2026
Are you an expert in large-scale networking and traffic infrastructure with a passion for building next-generation platforms for machine learning systems? We're seeking a hands-on technical leader with deep expertise in Envoy, Istio service mesh, L4/L7 load balancing, and modern internet protocols (HTTP/2, gRPC, HTTP/3) to design and scale traffic platforms that power Apple's Search and ML ecosystems. If you've contributed to CNCF or networking projects such as Envoy, Istio, Kubernetes networking, or related data-plane technologies, and you're excited about building capacity-aware, metrics-driven traffic systems for ML inference and training, this role offers the opportunity to architect at Apple scale-delivering highly performant, resilient, and intelligent traffic infrastructure supporting billions of requests.The AI, Search u0026 Knowledge Platform - Traffic Infrastructure Team within Apple's Services organization builds the foundational networking and traffic management platforms that power Search and large-scale ML workloads. Our focus is on designing modern L4/L7 traffic systems that intelligently route, balance, and optimize requests across heterogeneous compute environments-including GPU-backed inference services and multi-cloud deployments. We are reimagining traffic infrastructure as a programmable, metrics-driven, and capacity-aware platform, leveraging Envoy-based data planes, Istio service mesh, and dynamic control planes to support low-latency, high-throughput ML workloads. You'll work closely with ML engineers, SREs, and platform teams to enable secure, observable, and adaptive request routing for both server-to-server and client-to-server use cases.9+ years in networking, traffic infrastructure, or large-scale distributed systems roles. Contributions to CNCF or networking open-source projects (Envoy, Istio, Kubernetes networking, eBPF, etc.). Experience with HTTP/3, QUIC, or next-generation transport protocols. Strong understanding of capacity-based routing, adaptive load balancing, and feedback-driven traffic systems. Experience supporting ML inference platforms, GPU-backed services, or latency-sensitive ML workloads. Familiarity with observability stacks (OpenTelemetry, Prometheus, Grafana) for traffic and networking telemetry. Experience operating traffic systems across multi-region, multi-cloud, or hybrid environments. Excellent communication, technical writing, and cross-functional leadership skills. B.S., M.S., or Ph.D. in Computer Science, Computer Engineering, or equivalent practical experience.BS/MS in Computer Science or equivalent practical experience. 5+ years of experience in distributed systems, networking, or traffic infrastructure engineering. Strong programming experience in Golang and Python, especially for control-plane or data-plane systems. Deep expertise in L4/L7 networking concepts, including load balancing, connection management, retries, timeouts, and congestion control. Hands-on experience with Envoy, Istio, or similar service mesh / proxy technologies. Strong understanding of HTTP/1.1, HTTP/2, gRPC, and modern transport protocols. Experience designing and operating high-throughput, low-latency systems in production. Proven ability to lead complex technical initiatives and mentor engineers.
933 matching similar jobs near Cupertino, CA
- Apple Silicon GPU Driver Engineer, Graphics, Game and ML
- Lead Engineer, ML Network Stack - Annapurna Labs
- Software Engineer II - AI/ML, AWS Neuron, LLM Inference, AI/ML, AWS Neuron, Model Inference
- Sharing Experiences and Frameworks Engineer
- Senior Site Reliability Engineer (SRE), Data - Apple Ads
- SwiftUI Previews Engineer
- AI/ML Research Engineer - Camera & Photos
- Memory Tools Engineer
- Emoji & Stickers Software Engineer
- Software Engineer - Trust & Safety Solutions Engineer
- AI/ML Engineer, Applied Data Science
- Engineering Manager - Cloud Compute Software
- Senior Quality Engineer - Apple Maps Performance
- Engineering Program Manager, Private Cloud Compute - SRE, Apple Services Engineering
- Systems Engineer - Platform Architecture
- Software Developer - Apple Pay, Wireless Technologies u0026 Ecosystems
- MacOS Engineer
- Sr. Engineering Program Manager, ML Compute Infrastructure, Apple Services Engineering
- Site Reliability Engineering (SRE) Manager, Private Cloud Compute
- SDET Engineer - Maps Core Framework QE
- Embedded Quality Engineer - Camera
- AIML - Senior ML Researcher in Foundation Models, Responsible AI
- Operations Automation Engineer, Watch Software
- On-Device ML Compiler Engineer, Model Compilation, Graphics, Games u0026 ML
- Manager, Software Development (Hands-On Technical), ML Network Stack - Annapurna Labs
- Sr. Systems Software Engineer - Video Technologies
- Senior Site Reliability Engineer, Apple Data Platform Infra SRE
- Lead Engineer, ML Network Stack - Annapurna Labs
- NAND Qu0026R Engineer
- GPU Compiler Engineer, Graphics, Game and ML
- Software Engineer II - AI/ML, AWS Neuron, LLM Inference, AI/ML, AWS Neuron, Model Inference
- Manager, Software Development (Hands-On Technical), ML Network Stack - Annapurna Labs
- Sr. Formal Verification Engineer, Annapurna Labs
- Manager, Software Development, Network Product Development
- Software Quality Automation Engineer - Phone, FaceTime and Contacts
- Senior GenAI Engineer
- On-Device ML Compiler Engineer, Model Compilation, Graphics, Games & ML
- Sr. Software Engineer, Siri Speech
- Site Reliability Engineer - Apple Maps
- GPU Performance Engineer, Platform Architecture