Machine Learning Engineer

ScalejobsNew York, NYJune 5th, 2026

Computer Systems Engineers/ArchitectsComputer Systems Design and Related Services

About The RoleThe role involves architecting and scaling machine learning systems that power core product features, focusing on the bridge between experimental research and high-availability production environments. The engineer will collaborate with cross-functional teams to integrate predictive models into real-time application workflows, ensuring high throughput and low latency for global users.The team focuses on solving complex problems in natural language processing and predictive analytics, utilizing modern LLM orchestration and MLOps practices. This position is critical for driving the technical roadmap of model deployment, monitoring infrastructure, and continuous integration of new machine learning capabilities.Key ResponsibilitiesBuild and maintain end-to-end ML pipelines using Python, PyTorch, and orchestration tools like Airflow or Prefect to automate training and deployment workflowsDevelop and optimize Retrieval-Augmented Generation (RAG) systems and agentic workflows using frameworks like LangChain or LlamaIndex for enterprise-grade applicationsImplement robust monitoring and observability for models in production, tracking metrics for data drift, concept drift, and inference latency using tools like Prometheus or Weights & BiasesOptimize model inference performance through quantization, pruning, or the use of specialized engines like TensorRT and ONNX RuntimeContainerize machine learning services using Docker and Kubernetes to ensure scalable and reproducible deployments across cloud environments (AWS or GCP)Collaborate on data strategy and feature engineering, building scalable data transformation modules with PySpark or SQL to support high-dimensional model inputsWhat We Are Looking For3–6 years of professional experience in machine learning engineering or software engineering with a focus on data-intensive applicationsProven track record of deploying and scaling at least two machine learning models in a production environment serving significant trafficAdvanced proficiency in Python and deep familiarity with deep learning frameworks such as PyTorch or JAXHands-on experience with vector databases like Pinecone, Weaviate, or Milvus and their integration into semantic search systemsBS, MS, or PhD in Computer Science, Data Science, Mathematics, or a related quantitative technical fieldBonus: Experience with parameter-efficient fine-tuning (PEFT), distributed training techniques (DeepSpeed/FSDP), or contributing to open-source ML libraries

Machine Learning Engineer

matching similar jobs near New York, NY