<Back to Search
Platform Engineer
Millbrae, CAMarch 31st, 2026
Platform Engineer At SpeakAs a Platform Engineer at Speak, you'll be the driving force behind the reliability and resilience of the systems that power our global language learning experience. You'll lead efforts to scale our infrastructure, harden our platform, and ensure that our services are fast, available, and reliable for millions of users around the world.
You'll work across our stackfrom Kubernetes on GCP to our Node.js APIs, Postgres, and Redisbuilding robust infrastructure and operational tooling. You'll own incident response, observability, and SLOs while embedding a culture of reliability throughout the engineering org.
Speak is growing rapidly, and we're pushing our systems harder every day. This is a unique opportunity to shape the future of our platform as we scale to the next 10x of users.
What you'll be doing
Own the reliability of Speak's infrastructure across GCP, Kubernetes, and our Node.js/Postgres stack
Lead response for P0/P1 incidents, drive postmortems, and ensure we're learning from every outage
Improve observability, alerting, and on-call processes so we catch issues before users do
Define and drive adoption of SLOs/SLAs for core systems and services
Build tools and frameworks to make reliability easier for product engineersthink safer deploys and infrastructure automation
Collaborate cross-functionally with Product, Engineering, and ML teams to ensure reliability is baked into everything we build
Set short term and long term roadmaps to ensure stability for our growing user base.
Be a thought leader and coach around platform and infrastructure engineering principlesblameless culture, operational maturity, and continuous improvement
What we're looking for
7+ years of experience in SRE, DevOps, Platform, and/or infrastructure-focused engineering roles, ideally with experience leading or mentoring others
Strong experience with GCP, Kubernetes, Terraform, Node.js, Python, PostgreSQL, Redis, and observability tooling like Prometheus and Sentry
Proven track record of improving reliability, scaling systems, and reducing incident frequency and severity with high traffic systems
Strong incident management and root cause analysis skillsyou know how to lead under pressure
Experience building and maintaining CI/CD pipelines and deployment safety tooling
Strong systems thinking, with the ability to identify failure points and proactively harden services
Deep sense of ownership and a desire to make infrastructure a force multiplier for the rest of the org
Bonus
Familiarity with cost optimization strategies in cloud-native environments
Background in security, chaos engineering, or disaster recovery planning
Contributions to internal tooling, automation, or developer productivity initiatives
Why work at Speak
Join a fantastic, tight-knit team at the right time: we're growing very quickly, we've most recently raised our Series C from some of the top investors in the valley, and we've achieved product-market fit in our initial markets. You'd join at a magical time when a single person could significantly change the course of the company.
Do your life's work with people you'll love working with: we care strongly about our craft and want every person at Speak to feel like they're growing every day. We believe in the idea that working with people you both enjoy and have respect for makes everything better. We hire thoughtfully and only work with people we admire deeply.
Global in nature: We're live in over 40 countries and launching in a number of new markets soon. We have dedicated offices in San Francisco, Ljubljana, Seoul, and Tokyo, and you'll have the opportunity to talk to users in each of these regions on a regular basis as well as travel.
Impact people's lives in a major way: Learning a language is one of the single most life-changing skills one can learn, and right now 99% of people never achieve their goal because the process is broken. We're helping millions of people achieve their goals and improve their lives.
Speak does not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
Showing all 24,318 matching similar jobs
- Platform Engineer - Site Reliability Engineering
- Staff Site Reliability Engineer
- Site Reliability Engineer
- Senior Site Reliability Engineer (Remote)RemoteMarch 31st, 2026
- Senior Site Reliability Engineer – Platform Discovery
- Site Reliability Engineer Staff
- Site Reliability Engineer Principal (Software Engineering)
- Senior Site Reliability Engineer (Platform Focus)
- Site Reliability Engineer, Compute - USDS
- Site Reliability Engineer (AWS)
- Senior Site Reliability Engineer
- Site Reliability Engineer Principal (Software Engineering)
- Site Reliability Developer 6
- Site Reliability Developer 6
- Site Reliability Developer 6
- Site Reliability Developer 6
- Lead Site Reliability Engineer
- Senior Site Reliability Engineer
- ELH Site Reliability Engineer Lowell, SVL, Austin
- Senior Lead Site Reliability Engineer
- Site Reliability Engineer (AWS)
- Senior Site Reliability Engineer
- Senior Site Reliability Engineer I
- Engineer Lead, Site Reliability
- Engineer Lead, Site Reliability
- Site Reliability Engineer
- Site Reliability Developer 6
- Site Reliability Developer 6
- Site Reliability Developer 6
- Site Reliability Developer 6
- Site Reliability Engineer - Trading
- Site Reliability Engineer
- Sr/Staff Site Reliability Engineer, Consumer Apps Chicago, IL; Redwood City, CA
- Platform Engineer - DoD AECC Kubernetes ExpertRadford, VAMarch 27th, 2026
- Site Reliability Engineer_Pipeline
- Senior Site Reliability Engineer I
- Site Reliability Developer 6
- Site Reliability EngineerCharlotte, NCMarch 31st, 2026
- Principal Cloud Site Reliability Engineer, Actimize
- Senior Site Reliability Engineer