JOBSEARCHER

Principal Engineer

A2zDenver, COMay 6th, 2026
Why This Role ExistsWe operate a multi-tenant automotive SaaS platform serving thousands of dealer groups across the United States. Our backend — event-driven serverless on AWS (Lambda, EventBridge, DynamoDB, S3, Step Functions) — orchestrates everything from dealer onboarding to inventory management to real-time transaction processing. That platform works. Now we need to make it think.We are building agentic AI systems: autonomous, tool-using agents that observe platform state, reason over dealer context, take action through production APIs, and learn from outcomes. These are not chatbots bolted onto a dashboard. They are first-class platform services — backed by AWS Bedrock, connected to production systems via MCP servers — that make decisions, execute workflows, and close loops without human intervention unless guardrails say otherwise.This Principal Engineer owns that entire surface. You are not advising on AI strategy from a whiteboard. You are writing agent code, defining tool interfaces, building evaluation harnesses, setting cost and latency budgets, and shipping production AI workflows that touch real dealers and real money. You set the engineering patterns the team follows, you help make the build-vs-buy calls, and when an agent misbehaves at 2 AM, your architecture is what determines whether it fails safe or fails loud.Scope & Scale5000+ destination dealer tenants, each with isolated databases and per-tenant configurationBillions in annual Gross Merchandise Value (GMV) flowing through platform transactionsTens of thousands of API requests per minute across REST, SOAP, and event-driven integration surfacesData pipelines spanning 6 integration domains with multi-protocol vendor connectivityWhat You Will OwnOwnership and core development of agentic AI systems — designing, building, and operating the AI agent infrastructure (AWS Bedrock, MCP servers) that powers intelligent automation across the platform. You are not advising on AI strategy; you are writing the agent code, defining the tool interfaces, building the evaluation harnesses, and shipping production AI workflowsAI agent lifecycle end to end — from prompt engineering and tool-use design through guardrails, evaluation, cost optimization, and production observability. You own the patterns the team uses to build with AI: how agents connect to production systems, how we evaluate output quality, how we manage model costs at scale, and how we roll back when an agent misbehavesSystem design and technical decision-making for migration waves — from identity/tenant services through core domain extraction and frontend decompositionThe dual-write framework, API Gateway traffic-splitting, and per-tenant feature flag rollout that make every migration step reversibleCross-cutting concerns: observability (OpenTelemetry, CloudWatch), security posture (Auth0 consolidation, IAM), and data architecture (DynamoDB single-table design, Aurora consolidation)Mentoring and force-multiplying senior ICs — establishing patterns, reviewing designs, and raising the technical bar across 5 engineering teamsConsolidate and strategize 30+ different integrations and make the future integrations easierTechnical EnvironmentCloud Services: High-availability AWS stack including Lambda, EventBridge, DynamoDB, S3, ECS Fargate, Aurora, API Gateway, CloudWatch, and Secrets ManagerDevelopment Languages: Modern Python and Java (Spring Boot) alongside TypeScript/React (Next.js 16) frontends, with legacy domain coverage in PHP/LaravelAI & Agentic Systems: Advanced agentic workflow orchestration utilizing lean AWS Bedrock AgentCore, MCP servers, or LangChain/LangGraph frameworksData Engineering: Complex data architectures featuring DynamoDB single-table design, MySQL/Aurora, S3 data lakes, Glue Data Catalog, Athena, and Data pipelinesInfrastructure & Security: Enterprise-grade CI/CD and observability via CloudFormation, Auth0 consolidation, OpenTelemetry, and CircleCIIntegration Surfaces: Multi-protocol connectivity spanning REST, SOAP/XML, EventBridge event-bus patterns, SES processing, and Playwright browser automationFirst 12 MonthsMonths 1-3: Immerse in the codebase. Audit the current architecture across all stacks. Publish the first Architecture Decision Record (ADR) for the next migration wave. Establish your design review cadence with the teamMonths 4-6: Drive the AI/agentic integration layer — Bedrock-powered automation in at least one production workflow. Establish the patterns for how the team builds with AI going forward; both agentic insight retrieval agentic workflow automationMonths 7-9: Own and deliver the first migration wave end-to-end — from design doc through production cutover with dual-write validation. Stand up the observability baseline (OpenTelemetry instrumentation, dashboards, SLOs)Months 10-12: Second migration wave in production. Architecture runway documented for the next 12 months. The team operates at a higher technical bar because of patterns you setRequirementsYou Should Have8+ years of software engineering experience with at least 3 years in a Staff / Principal / Architect roleBuild Cloud native solutions with emphasis on speed to marketDeep hands-on experience (About 40-50% in the code, 40-50% in Design) with AWS serverless (Lambda, EventBridge, DynamoDB, StepFunctions) and traditional service architectures (ECS, RDS, API Gateway)Experience spending 10-20% of your time in mentoring, cross-team alignment and operating mechanismsTrack record of leading monolith-to-services migrations — strangler fig, dual-write validation, traffic-splitting, canary rolloutsFluency across multiple languages: you can review PHP, architect & prototype Python, architect Java services, and reason about TypeScript frontendsExperience with event-driven architectures, configuration-driven workflow engines, and DynamoDB single-table designStrong opinions on observability and the discipline to instrument before you migrateThe ability to write a design doc that a senior IC can implement without ambiguity, and the judgment to know when to write code yourself insteadPreferred QualificationsAutomotive, fintech, or multi-tenant marketplace platform experienceExperience with data pipelines (NiFi, Glue/Athena) or ETL/data lake toolingFamiliarity with Auth0 Organization model, M2M apps, and Actions for JWT enrichmentExperience with browser automation (Playwright) in production integration flowsBenefitsAbout A2Z SyncA2Z Sync is a fast-paced and innovative automotive SaaS company seeking to make life better for our customers. We offer you a fun, casual, and collaborative culture, while fostering an environment where you work hard, see your results, and feel your impact. We are committed to our employees, and this starts with providing benefits that allow you to care for you and your family.MissionAt A2Z Sync, we replace the friction of disconnected systems with the velocity of a single platform. We integrate digital insights with in-store operations to deliver transparent transactions that bring clarity to the car buyer and increased profitability to the dealer.Our Values: We Are DRIVENDealership Obsessed: We measure our success by the dealer's wins and the trust of their buyers, not just our own codeRelentless Ownership: No lone wolves, but no pass-backs either. We don't say "that's not my job."Invent with Purpose: We don't chase "shiny" tech. We replace guesswork with intelligence, building the "data backbone" that turns raw information into a competitive advantageValue Every Perspective: We are Better Together. We check egos at the doorEvolve or Evaporate: Change is our constant. We stay ahead by learning faster than the competitionNow Over Next: Perfection is the enemy of progress. We prefer action over endless analysisHere's how we are doing it:A2Z Sync offers comprehensive medical, dental, and vision benefitsEmployer provided STD/LTD and life insuranceMatching 401k planUnlimited paid time off, including 10 paid holidaysReal ownership of a high-stakes AI surface — your roadmap, your architecture decisions, your metrics