MLOps Engineer (Python) — AI Platform
If you've ever said "the model is fine — the infrastructure is broken"... this one's for you.AI Infrastructure Engineer (Python)$150,000 – $180,000 + Performance BonusNew York City | Hybrid (4 Days Onsite)$3 Billion Boutique Hedge Fund─────────────────────────────THE OPPORTUNITY─────────────────────────────A $3B+ boutique hedge fund has AI agents running in production. They work. Now they need someone to make them scale, stay up, and run fast under real-world load.This is not a research role. No Jupyter notebooks. No "we're exploring AI." It's already built — now they need the engineer who owns it.You'll be the person who keeps production humming, scales what's working, and builds the tooling that makes every data scientist on the team faster.─────────────────────────────WHAT YOU'LL ACTUALLY DO─────────────────────────────Write production-grade Python — services, pipelines, shared libraries, internal toolingOwn containerized deployments end to end — Docker, CI/CD, versioning, runtime managementKeep AI agents running at scale — high availability, performance, resilience under loadBuild observability into everything — logging, tracing, monitoring, alertingManage cloud infrastructure with Terraform across Azure and GCPDeploy and maintain workflow orchestration (Prefect)Troubleshoot production issues across the full stack — app to infra─────────────────────────────YOU'RE THE RIGHT FIT IF...─────────────────────────────3–5 years in Software Engineering, DevOps, or MLOpsPython is your primary language — and you've shipped it in productionDocker and containerized cloud deployments are second natureYou've used Terraform in a real environment, not just tutorialsYou think in systems — you debug end to end, not just your layerCI/CD and Git-based release workflows are part of your daily lifeYou're genuinely curious about AI infrastructure — not just checking a boxBonus points for:Azure (Container Apps, Key Vault, ACR, VNets) | GCP (Cloud Run, GKE, Vertex AI) | Prefect | LangChain | MCP | Langfuse | MLflow | dbt | Snowflake | Multi-cloud─────────────────────────────WHY THIS ROLE IS DIFFERENT─────────────────────────────AI agents are already in production — you're scaling something real, not building a proof of conceptGreenfield tooling mandate — you'll build the shared infrastructure the whole team runs onDirect impact — the platform you maintain shapes investment decisions at an institutional levelSmall, sharp team — real ownership, not a ticket queueStable, long-term seat at a fund that has been around and isn't going anywhereRoom to grow deeper into AI/ML concepts alongside the platform─────────────────────────────PLEASE READ BEFORE APPLYING─────────────────────────────Full-time, direct hire / W2 onlyUS Citizens and Green Card holders only — no visa sponsorshipNo C2C, no consulting, no third-party agencies─────────────────────────────This role won't be open long. If this sounds like you — or someone you know — DM me directly.#Python #DevOps #MLOps #AIInfrastructure #CloudEngineering #Docker #Terraform #Azure #GCP #LangChain #AgentAI #NowHiring #TechJobs #NewYorkJobs #SoftwareEngineer #PythonDeveloper #HedgeFund #AIEngineer #MachineLearning #Hiring