Platform & Production Operations Engineer
Platform & Production Operations EngineerAbout Smart AccessSmart Access is the AI execution layer for supply chains. We're a Frontline Execution Platform that helps the world's largest warehouses, manufacturers, and logistics operations close the gap between defined standards and actual frontline behavior. Our system connects standards, observations, coaching, and intelligence into a single execution loop that drives consistent, measurable performance on the floor.We work with operational leaders, frontline supervisors, safety teams, and continuous improvement leaders to systematically close the execution gap — the distance between what standards say should happen and what actually happens. As supply chains adopt AI, Smart Access is positioned to become the operational intelligence layer that AI agents rely on to drive frontline action.The Company just closed a Series A funding round, marking a significant milestone in its growth journey. This is a pivotal moment to join; the team is lean, the trajectory is steep, and the roles being hired now will be foundational to how Smart Access scales. If you thrive in a high-ownership, high-impact environment and want to help build something from the ground up, this is the opportunity.The RoleSmart Access is seeking a Senior Platform & Production Operations Engineer to help build, operate, and scale the infrastructure that powers our AI-enabled frontline execution platform.This is a hands-on role that combines elements of cloud infrastructure, DevOps, platform engineering, and production operations. You will be responsible for maintaining reliable production systems, improving deployment and operational processes, strengthening observability and monitoring, and partnering closely with engineering teams to ensure our platform scales securely and efficiently.As an early member of a growing engineering organization, you will have significant ownership and influence over how our infrastructure, deployment practices, operational tooling, and production processes evolve.This role is ideal for an engineer who enjoys both building systems and operating them in production.What You'll DoOwn and improve production operations across Smart Access cloud environmentsMonitor, troubleshoot, and resolve production incidents and performance issuesImprove reliability, availability, scalability, and operational excellence across the platformBuild and maintain cloud infrastructure and deployment automationPartner with application engineers to support development, testing, deployment, and production readinessDevelop monitoring, alerting, observability, and operational dashboardsParticipate in incident response, root cause analysis, and operational reviewsImprove security, access controls, secrets management, and operational compliance practicesOptimize cloud infrastructure utilization and cost efficiencyCreate operational runbooks, documentation, and repeatable support processesBecome deeply familiar with the Smart Access application stack and operate independently within 6–9 monthsRequired Qualifications8+ years of experience operating and supporting production SaaS applicationsExperience managing cloud-based applications in production environmentsStrong experience with Google Cloud Platform (GCP)Experience supporting highly available, multi-tenant SaaS platformsExperience with CI/CD pipelines and deployment automationExperience with infrastructure-as-code tools, preferably TerraformStrong troubleshooting and production incident management skillsExperience implementing and operating monitoring, observability, alerting, and incident response processesExperience working closely with software engineering teams in agile environmentsStrong written and verbal communication skillsAbility to operate independently and take ownership in a fast-moving startup environmentPreferred QualificationsExperience operating applications built on Google Cloud RunExperience supporting applications built with FastAPI and DjangoExperience with containerized application deploymentsExperience supporting AI-enabled or data-intensive applicationsExperience with observability platforms such as New Relic, Datadog, Grafana, Prometheus, OpenTelemetry, or Google Cloud MonitoringExperience with security, compliance, and operational best practicesExperience working within high-growth startup environmentsTechnical EnvironmentCurrent technologies include:Google Cloud Platform (GCP)Google Compute Engine (GCE)Cloud RunFastAPIDjangoPostgreSQL Cloud SQLRedis MemorystoreNew RelicCloud MonitoringCloud LoggingSentryBitbucket PipelinesTerraformGCP Secret ManagementBitbucket Secrets ManagementAdditional Environment DetailsPrimary runtime environment: Cloud RunInfrastructure as Code: Terraform (existing implementation with opportunities for modernization and improvement)CI/CD Platform: Bitbucket PipelinesDatabase Technologies: PostgreSQL Cloud SQL, Redis MemorystoreSecrets Management: GCP and BitbucketAreas We'd Like You to Help ShapeWe're looking for an engineer who can bring operational expertise and recommendations as we continue to mature our platform. Experience evaluating and improving infrastructure automation, observability, deployment workflows, and operational tooling is highly valued.What Success Looks LikeWithin your first 6–9 months, you will:Become self-sufficient in operating the Smart Access production environmentDevelop a deep understanding of the application architecture and operational workflowsImprove visibility into system health, reliability, and performanceStrengthen deployment and operational processesReduce operational friction for engineering teamsHelp ensure the platform scales reliably as Smart Access continues to growWhy Join Smart Access?Join immediately following recent, successful Series A financingHelp shape the infrastructure foundation of a rapidly growing companyWork directly with experienced engineering and product leadersTake ownership of meaningful technical challenges with visible business impactPlay a foundational role in building the operational intelligence layer for the future of supply chain execution