Senior Software Engineer, Platform
Gridmatic is a high-growth startup and a new kind of energy company, delivering affordable, clean power by optimizing renewable energy and grid-scale batteries. With offices in the Bay Area and Houston, we bring together Silicon Valley–style innovation with deep, hands-on expertise in real-world power markets and energy retail.As solar and wind become the fastest-growing sources of electricity, variability from weather and grid conditions makes energy prices more volatile. Gridmatic tackles this challenge with industry-leading forecasting and optimization—and gives our team the opportunity to work on problems that truly matter. Forecasting and trading energy are the foundation of what we do. We ingest large-scale data—weather, prices, load, and grid conditions—to build probabilistic machine learning forecasts that drive real operational decisions. Our work directly determines when power is bought, stored, or deployed, turning uncertainty into value for customers and the grid.Our impact is measurable. Gridmatic is the most profitable participant in ERCOT’s wholesale market and operates the top-performing battery asset in CAISO. Profitable without venture capital, we offer a collaborative, low-ego environment where rigorous thinking, autonomy, and continuous learning are core to how we work.The RoleWe're looking for a Senior Software Engineer to join our Platform team and build the foundational infrastructure that powers Gridmatic. Our platform challenges are shaped by the nature of energy markets: forecasts and trading decisions run on tight schedules, battery dispatch commands must execute reliably in real time, and ML models need to train and deploy continuously as new data arrives.What You'll DoDesign and build our compute platform, creating infrastructure that supports production services, batch jobs, and ML training workloadsWork on real-time systems for operating battery assets, where reliability directly impacts both revenue and grid stabilityEstablish patterns, tooling, and best practices that help teams across Gridmatic run services reliablyMake architectural decisions that shape how we build software as we growWhat We're Looking ForSignificant experience building and operating production infrastructure on a public cloud platform (we run on GCP, but AWS or Azure experience translates well)Hands-on experience with Kubernetes (we run on GKE and it's foundational to our platform)Proficiency in PythonExperience with infrastructure-as-code tools like TerraformEither already know Go or have experience with a similar systems language (C++, Java, Rust) and are excited to work in Python and Go day-to-daySystems thinking—understanding how components interact, where failures can cascade, and how to build for efficiency, scalability, and resilienceClear communication, whether writing a design doc, reviewing code, or explaining a complex system to someone new to itOpinions about how to build reliable infrastructure, informed by experience with what works and what doesn't at scaleNice to HaveBackground in ML infrastructure or training pipelinesExperience with workflow orchestration tools (Flyte, Temporal, Airflow, or similar)Familiarity with observability tooling (we use Grafana + Google Cloud Monitoring)Experience with real-time systemsStrong Linux fundamentalsPrior work in domains where latency and reliability have direct business consequences$209,000 - $235,000 a yearJoin our team and make a difference! Click below or email us atWe may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.