Site Reliability Engineer
3 x SRE – SF (AI Infra Start Up) - Up to $240K + Equity + Benefits
TogetherWeTech is hiring a Site Reliability Engineer for a high-growth AI infrastructure startup, backed by top-tier investors and building the developer platform for the next wave of AI-first engineering teams.
As part of the core infrastructure team, you’ll own platform reliability and scalability, designing systems that support containerised, sandboxed development environments, automating observability, and managing incident response across critical services.
Key Benefits:
Competitive Equity
Private Health Insurance
Flexible Time Off
Daily Lunch & Onsite Perks
Key Requirements & Responsibilities:
5+ years of experience in SRE, DevOps, or Infrastructure Engineering
Strong skills in Python or Go
Deep experience with Docker, Kubernetes, and distributed systems
Proven ability to manage cloud infrastructure with IaC tools like Terraform or Pulumi
Familiarity with observability tooling (Prometheus, Grafana, etc.)
Confident in leading incident response, root cause analysis, and on-call practices
Strong architectural judgement for scalability, fault tolerance, and reliability
If you’re a systems engineer who thrives in infrastructure and developer experience, and you want to work in a product-led, fast-moving environment, let's connect.
Better, Together