JOBSEARCHER

DevOps Engineer - AWS

About TensorWaveOur mission is simple: deliver seamless, secure, reliable, and resilient AI compute at scale. We've built a versatile cloud platform that eliminates infrastructure barriers, empowering builders to focus on innovation instead of fighting their stack. Because breakthrough AI should move at the speed of ideas, not infrastructure.About The RoleWe are hiring an AWS Cloud Engineer to design, provision, optimize, and support the AWS infrastructure powering our AMD GPU AI/HPC platform. This is a hands-on execution role — you'll work closely with Rust backend engineers, TypeScript developers, SREs, and platform teams to keep cloud infrastructure reliable, cost-efficient, and scalable. The goal is simple: reduce cloud bottlenecks and give our engineering teams a solid foundation to build on.What You’ll DoOwn the full lifecycle of AWS infrastructure across dev, staging, production, and customer-facing environments — provisioning, scaling, monitoring, security, cost optimization, and decommissioningBuild and maintain Infrastructure-as-Code (Terraform, Pulumi, AWS CDK, CloudFormation)Implement cloud patterns for high availability, auto-scaling, secure service communication, and customer environment provisioningBuild and maintain CI/CD workflows for cloud infrastructure and hosted servicesImprove observability through metrics, logging, alerting, dashboards, and runbooksTroubleshoot AWS networking, compute, storage, IAM, and deployment issuesParticipate in incident response, post-incident reviews, and root cause analysisDocument architecture, operational processes, and best practicesWho You AreRequired Qualifications5+ years in cloud infrastructure, DevOps, SRE, or platform operationsHands-on AWS experience: VPCs, EC2, S3, IAM, CloudWatch, Route 53, load balancers, security groups, private networkingProficiency with IaC tooling (Terraform strongly preferred)Strong Linux fundamentals — networking, process management, storage, troubleshootingExperience with CI/CD, Git-based workflows, and monitoring/alerting platformsClear communicator who can document infrastructure and collaborate across engineering teamsPreferred QualificationsExperience with AI/ML, GPU, or HPC workloadsKubernetes on AWS (EKS or self-managed)Observability platforms: Prometheus, Grafana, Loki, OpenTelemetry, DatadogAWS cost optimization: right-sizing, savings plans, lifecycle policies, taggingStartup or high-growth infrastructure environment backgroundWhat we offerStock Options 100% paid Medical, Dental, and Vision insurance for Employees Company Health Savings Account Contributions 100% paid Short Term and Long Term Disability Insurance for Employees Life and Voluntary Supplemental Insurance Options Other Insurance Options, such as Pet & Legal Insurance Various Supplementary Health Benefits, such as discounted Virtual Healthcare Appointments and Serious Illness Support Flexible Spending Account 401(k) Employee Assistance Program Flexible PTO Paid Holidays Parental Leave Other In-Office Perks Equal Employment OpportunityTensorWave is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate on the basis of any protected status under applicable law.Reasonable AccommodationsTensorWave provides reasonable accommodations in accordance with applicable laws. If you require accommodation during the hiring process, please contact accomodations@tensorwave.com.Employment EligibilityAll offers of employment are contingent upon verification of identity and authorization to work in the United States, as required by law.Background ChecksWhere permitted by law, employment may be contingent upon the successful completion of a job-related background check.Data Privacy NoticeBy submitting an application, you acknowledge that TensorWave may collect, use, and retain your personal information for recruiting and employment-related purposes in accordance with applicable data privacy laws.