Sr. Software Engineer, Cloud Infrastructure - Slack
ARCHIVED
We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.
DescriptionAbout the TeamAt Slack, the Cloud Engineering team is the backbone of our infrastructure — a dynamic group of Cloud Engineers, Financial Analysts, and AWS Subject Matter Experts united by a single mission: keeping Slack fast, reliable, and cost-efficient for millions of users worldwide.We tackle unique, large-scale challenges that very few engineers ever get to work on. From designing the systems that power global real-time communication to writing software that brings deep visibility into our cloud infrastructure, our work has direct impact every single day. We partner with teams across Slack to maximize cloud value, champion cost-conscious engineering, and build a platform that scales with our ambitions. If you're energized by high availability, resilience, and the right technology choices — this is the team for you.Slack has a positive, diverse, and supportive culture — we look for people who are curious, inventive, and work to be a little better every single day. In our work together we aim to be smart, humble, hardworking and, above all, collaborative. If this sounds like a good fit for you, read on ahead!What You Will Be DoingLead software projects end-to-end — from scoping and architecture through delivery, iteration, and long-term ownershipArchitect and build a next-generation internal platform that gives engineering teams a powerful foundation to innovate quicklyDeliver cutting-edge solutions leveraging containerization, virtualization, and a broad suite of AWS servicesAuthor, extend, and improve Terraform modules that power infrastructure-as-code across SlackDesign and implement an in-house system to deploy, manage, and scale applications for service ownersPartner directly with development teams to identify performance bottlenecks and drive cloud efficiency improvementsBuild strong, trusted relationships with service owners — serving as a go-to advisor on cloud architecture and best practicesChampion a culture of platform efficiency by sharing knowledge, writing runbooks, and leading internal enablement sessionsMentor and grow junior engineers, scaling the impact of the team through thoughtful technical leadershipMake a measurable financial impact — driving millions of dollars in cloud cost savings annuallyParticipate in on-call rotation and collaborate with our operations team to triage and resolve production incidents with urgency and precisionBuild observability and introspection tooling that gives engineers deep, real-time visibility into system health and bottlenecksWhat You Should HaveU.S. Citizenship or Permanent Residency (Green Card holder). We are unable to provide visa sponsorship for this role.Genuine curiosity about how cloud infrastructure works — and a passion for sharing that knowledge with your teamProven ability to analyze, optimize, and improve reliability in high-traffic, production internet applicationsA strong mentoring instinct and commitment to engineering excellence: you lead by example in code reviews, testing, design docs, and debuggingDeep, hands-on AWS experience — broad familiarity across many services with deep expertise in at least a fewDemonstrated experience deploying cloud applications and managing infrastructure-as-code using Terraform and/or CloudFormationStrong ability to troubleshoot and debug complex issues across infrastructure, applications, and distributed systemsA track record of professional software development you're proud of — you can point to real-world systems you've built, scaled, and improvedExperience working with Kubernetes (K8s)Qualifications7+ years of professional experience in cloud engineering or a closely related discipline, working in a collaborative team environmentStrong computer science fundamentals: data structures, algorithms, distributed systems, programming languages, and information retrievalBachelor's degree in Computer Science, Engineering, or a related field — or equivalent training, fellowship, or work experienceProficiency in one or more functional or imperative programming languages — Python, Go, or PHP preferredHands-on experience with software engineering, scripting, automation, and orchestration tools (e.g., Bash, Chef, Jenkins, Terraform)Extensive, production-grade experience provisioning, configuring, and maintaining AWS environmentsExperience managing large-scale Kubernetes systems (EKS or Bare Metal)Bonus PointsDeep expertise in core AWS services such as EKS, EC2, IAM, Fargate, S3, or LambdaAWS Professional or Specialty certification(s)Experience designing or operating large-scale, high-volume distributed systemsA proven history of driving significant cloud cost reductions at scaleFamiliarity with observability tooling (e.g., Datadog, Prometheus, OpenTelemetry) and SRE practices