Data Engineer
ARCHIVED
We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.
Senior Data Architect – Data EngineeringLocation: San Francisco, CAReports To: VP of EngineeringFLSA Status: ExemptEmployment Type: Full-TimeCompensation: $140,000 – $160,000 annually (based on experience)About CargomaticCargomatic is transforming the local trucking industry with cutting-edge technology that connects shippers and carriers in real time. Every product that humans build, grow, or sell has spent time on a truck. Local trucking is the lifeblood of every regional economy, yet this $82 billion industry still relies heavily on outdated systems. Cargomatic is bringing transparency, efficiency, and intelligence to local freight through modern technology and data-driven solutions.We are solving complex, real-world logistics problems every day. If you thrive in a fast-paced environment, enjoy building scalable systems, and want to help shape the future of AI-powered logistics, we’d love to meet you.Position SummaryCargomatic is seeking a Senior Data Architect – Data Engineering to design and build scalable, cloud-native data infrastructure that powers analytics, machine learning, and AI-driven applications. This role combines deep data architecture expertise with hands-on experience in modern data platforms and LLM-enabled application development.You will lead the design of enterprise-grade data models, architect RAG systems, implement agentic workflows, and integrate secure, production-ready LLM capabilities into our ecosystem. This is a high-impact role with significant ownership, visibility, and opportunity to shape the future of intelligent logistics technology.Key ResponsibilitiesData Architecture & EngineeringDesign and build scalable, cloud-native data pipelines (batch and streaming) supporting analytics, ML, and AI-powered applicationsArchitect enterprise-grade data models across data lakes, warehouses, and real-time systems (Snowflake, Databricks, Kafka, DBT)Define standards for data governance, reliability, performance, and cost optimizationOptimize storage formats and distributed data systems (Parquet, Delta Lake, Iceberg)AI & LLM-Enabled SystemsDevelop Retrieval-Augmented Generation (RAG) systems integrating structured and unstructured enterprise dataDesign and implement agentic workflows using frameworks such as LangChain, LangGraph, LlamaIndex, n8n, or similarIntegrate LLM APIs (OpenAI, Anthropic, or similar) into secure, production-ready applicationsImplement guardrails, validation layers, monitoring, and evaluation frameworks to mitigate hallucination, prompt injection, and data security risksBackend & API DevelopmentBuild secure backend APIs (Python/FastAPI) to expose AI-powered capabilitiesEnsure observability, monitoring, and cost controls across AI and data servicesContribute to microservices architecture and distributed system designCollaboration & LeadershipPartner cross-functionally with Product, Engineering, and Operations to translate business requirements into scalable technical solutionsMentor junior engineers and contribute to architectural standards and best practicesDrive innovation in data engineering and AI-powered logistics systemsQualificationsBachelor’s degree in Computer Science or equivalent practical experience8+ years of software or data engineering experience in production environmentsStrong expertise in data modeling, distributed systems, and scalable cloud architecturesHands-on experience with ETL/ELT frameworks and streaming technologies (Kafka, Spark, HEVO, Snowflake, DBT, etc.)Advanced SQL skills and deep understanding of modern storage formatsProficiency in Python and RESTful API developmentExperience integrating LLM APIs into production applicationsStrong understanding of system reliability, observability, and cost management in cloud environmentsDesired ExperienceExperience building RAG pipelines including embeddings, vector search, chunking strategies, and hybrid retrievalExperience designing multi-agent or agentic AI workflows with orchestration frameworksKnowledge of LLM evaluation, monitoring, and tracing tools (LangSmith or similar)Experience with microservices architecture and distributed system designExposure to transportation, logistics, or supply chain domainsActive GitHub contributions or demonstrated passion for emerging AI and data technologiesWhy Join Cargomatic?We offer competitive compensation and a comprehensive benefits package, including:Medical, Dental, and Vision insurance401(k) with company matchFlexible Spending Accounts (FSA)Company-paid Life and Disability insuranceFlexible Paid Time Off (PTO) and company holidaysPaid Parental LeaveEmployee Assistance Program (EAP)Opportunity to build cutting-edge AI solutions in a high-growth logistics technology companyCollaborative, high-impact team environmentCargomatic is proud to be an Equal Opportunity Employer. We are committed to creating a diverse and inclusive workplace where all employees feel valued and empowered to succeed.