Azure Capacity Manager
Principal Azure Capacity Manager - Contract-to-Hire New York, NY Our leading financial services client is seeking a highly experienced Principal Azure Capacity Manager to lead enterprise cloud capacity planning and optimization initiatives within a highly regulated Azure environment. This role will focus on ensuring scalable, resilient, and compliant infrastructure capacity across compute, storage, networking, databases, and platform services while partnering closely with Site Reliability Engineering (SRE), Security, and Infrastructure teams.Key ResponsibilitiesLead end-to-end Azure capacity management including forecasting, monitoring, optimization, and governance.Develop capacity models and establish buffer standards to support performance, resiliency, and disaster recovery objectives.Partner with SRE teams to implement autoscaling, performance tuning, observability, and Infrastructure as Code (IaC) best practices.Analyze utilization and performance trends to drive optimization, cost efficiency, and architecture improvements.Support disaster recovery readiness and failover capacity planning.Participate in change management and governance processes, ensuring proper documentation, compliance, and operational controls.Deliver executive-level reporting on capacity health, forecasting, utilization, and cost-performance metrics.Collaborate cross-functionally across engineering, operations, security, and finance teams.Required QualificationsBachelor’s degree in Computer Science, Engineering, or related field.10–12+ years of experience in infrastructure capacity planning, performance engineering, or cloud operations.Strong experience working in regulated enterprise environments; financial services experience preferred.Deep understanding of Azure infrastructure services, scalability, and governance.Experience translating operational telemetry and analytics into strategic infrastructure decisions.Excellent communication, stakeholder management, and executive presentation skills.Technical SkillsAzure Monitor, Log Analytics, Cost Management, Reservations/Savings PlansAKS, VMSS, App Services, autoscaling strategiesTerraform, Bicep, ARM templates, CI/CD integrationPrometheus, Grafana, k6, JMeterAzure networking, security, Key Vault, and Managed HSMAzure Policy, governance frameworks, and configuration managementPreferred ExperienceFedRAMP High or highly regulated cloud environmentsMulti-region Azure architecture and disaster recovery planningCollaboration with SRE teams on SLIs/SLOs and reliability engineering practicesContinuous monitoring, audit support, and operational governance initiatives