<Back to Search
Senior ML Infrastructure Engineer
Millbrae, CAMarch 31st, 2026
Senior ML Infrastructure EngineerGridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid. We pioneered a groundbreaking new class of grid management called active grid response (AGR), focused on monitoring the electrical, physical, and environmental aspects of the grid that affect reliability and safety. Gridware's advanced Active Grid Response platform uses high-precision sensors to detect potential issues early, enabling proactive maintenance and fault mitigation. This comprehensive approach helps improve safety, reduce outages, and ensure the grid operates efficiently. The company is backed by climate-tech and Silicon Valley investors.
As a Senior ML Infrastructure Engineer, you will work directly in the Automation org with the core ML, Ops, and Analytics teams to help improve and build out the infrastructure around model deployment and monitoring. This role is essential to helping scale out the amount of time saving's Gridware brings to customers.
Responsibilities
Design, build, and maintain the infrastructure, tooling, and workflows that enable reliable, scalable deployment of ML models to production.
Develop monitoring and observability systems to track model performance, data drift, data quality, and overall system health.
Create and maintain end-to-end testing frameworks and simulation environments to validate models and pipelines prior to deployment.
Work closely with Data Engineering and Platform Engineering teams to ensure ML systems integrate cleanly with broader Gridware infrastructure and operational standards.
Improve CI/CD pipelines for ML workloads, ensuring reproducibility, safe rollout, and automated rollback strategies.
Required Skills
5+ years of experience building production ML infrastructure
Strong software engineering skills and proficiency in Python
Experience with cloud platforms (AWS) and container orchestration (Kubernetes)
Familiarity with feature stores, model registries, or centralized metadata systems (i.e. MLFlow)
$190,000 - $210,000 a year
At this time, Gridware is unable to provide visa sponsorship or immigration support for this role. We're only able to consider candidates who are currently authorized to work in the country of employment without visa sponsorship now or in the future.
This describes the ideal candidate; many of us have picked up this expertise along the way. Even if you meet only part of this list, we encourage you to apply!
Benefits
Health, Dental & Vision (Gold and Platinum with some providers plans fully covered)
Paid parental leave
Alternating day off (every other Monday)
"Off the Grid", a two week per year paid break for all employees.
Commuter allowance
Company-paid training
Showing all 39,101 matching similar jobs
- Software Engineer, AI Developer Tooling
- Elastic AI Engineer
- Senior Back End Engineer - Manta Cares
- Principal Software Engineer - Commerce
- Senior Backend Engineer (Kotlin)
- Sr. Backend Engineer - Kotlin
- ML Operations Engineer
- Associate AI Engineer
- Staff Software Engineer
- Machine Learning Engineer
- Staff Software Engineer
- Staff Software Engineer, MLOps - Hybrid
- AI Systems Analyst III
- AI Automation Analyst
- Gen AI Architect
- Entry level Java Spring Microservices Developer/Data analyst/AI engineer
- Sr DevOps Cloud Engineer
- Executive Role - AI Strategy & Innovation Lead / Senior AI/ML Engineer & Lead Data Scientists
- junior fullstack software programmer/AI engineer/Data Scientist
- Senior Software Engineer - AI Fraud Detection
- Manager, Software Engineering, Full Stack (Global Payment Network)
- Senior Backend Software Engineer (New York City, Los Angeles, or San Francisco)
- Senior Back End Engineer AI Engineering San Francisco / Hybrid
- Senior DevOps Engineer for AI-Powered Infra (Remote)RemoteMarch 28th, 2026
- Remote Senior Software Engineer - Cloud Distributed Systems
- Staff Software Engineer, Analytics (Remote US)
- Azure VM DevOps Lead — Windows & Linux Automation (Remote)
- Platform Engineer – AI/ML CI/CD Infra (Remote)RemoteMarch 31st, 2026
- SRE
- AI Fleet Management Solutions
- Machine Learning Engineer - Hospital
- Senior Engineer - Payments Modernization
- Senior AI/ML Engineer - MLOps & Production AI Systems - Remote or Hybrid in MN/DC
- Sr. DevSecOps Engineer
- Lead / Principal Software Engineer - (Onsite) Washington DC, Philadelphia, or Wilmington DE
- Senior Backend Software Engineer
- Senior Forward Deployed AI Engineer, Enterprise
- Principal Software Engineer, Machine Learning
- Data & Automation Lead: Proofpoint & ML ModelsTempe, AZMarch 27th, 2026
- Principal AI Application Engineer