Senior IT VMware Systems Engineer
The VMware Virtualization SME provides expert level strategy, design, and 24 × 7 × 365 support for the Americas region virtualization platform. The role owns performance monitoring, automation, and continuous improvement of the VMware ecosystem—including VMware Cloud Foundation 9 (VCF 9), Aria Operations, vRealize Log Insight (VRLI), vRealize Network Insight (VRNI), NSX T, and vSAN—while pioneering AI driven operational models and cost optimization initiatives.Core Responsibilities• Platform Architecture & Migrationo Lead the migration to VMware Cloud Foundation 9 (VCF 9), ensuring zero downtime, compliance, and alignment with security baselines.o Design and evolve the integrated VMware stack (vSphere, NSX T, vSAN, Aria Operations, VRLI, VRNI) to meet ultra low latency and high availability requirements.• Automation & AI Enabled Operationso Implement Infrastructure as Code (Ansible, Terraform, PowerCLI) to automate provisioning, patching, and lifecycle management of the virtualization layer.o Deploy agentic AI assistants (LLM powered chat ops) for ticket triage, predictive alerting, and automated root cause analysis within Aria Operations.o Create self healing playbooks that remediate common performance or capacity events without human intervention.• Performance Monitoring & Capacity Managemento Configure, fine tune, and maintain monitoring thresholds, alarms, and dashboards in Aria Operations, VRLI, and VRNI.o Use AI driven anomaly detection to anticipate capacity bottlenecks and latency spikes before they affect production.• Process Improvement & Standardisationo Facilitate environment wide process improvement initiatives (change, release, and incident management) to increase efficiency and consistency.o Ensure all deployments adhere to group standards, best practices, and security hardening guides (CIS, VMware Hardened Base Image).• Vendor & Global Team Collaborationo Interface with VMware, storage, networking, and hyper converged hardware vendors; coordinate with global IT teams to keep the platform aligned with enterprise standards.• Disaster Recovery & Business Continuityo Participate in DR planning, testing, and execution for the virtualization environment; maintain RPO ≤ 5 seconds and RTO ≤ 15 minutes for critical workloads.• Operational Supporto Provide Tier 2/3 support for production workloads, including 24 × 7 on call rotation.o Conduct thorough morning and end of day health checks using scripted tools and AI generated health scores.o Perform OS and firmware upgrades, mandatory security patches, and storage system updates through automated pipelines.• Stakeholder & Service Managemento Liaise with application owners to gather requirements, design standards, and deploy consistent virtual infrastructure services.o Maintain the service catalogue for internal business lines, regularly reviewing consumption, pricing, and performance metrics.• Reporting & Governance