Infrastructure Engineer: Server Performance (Remote)
Job Summary
As the largest online distributor of restaurant supplies and equipment, WebstaurantStore offers a catalog of more than 430,000 products supported by fast, reliable shipping.Nearly allour technological design, development, and system management are handled in-house,allowing us to build custom, innovative solutions in a rapidly evolving e-commerce landscape.
Due to this growth, weare seeking an Infrastructure Engineer with a focus in ServerPerformance to supportouron-premises infrastructure,centered ondesigningand creatinghigh performance computingsolutionsthat power AI workloads and other resource-intensive business operations.
This role is primarily focused on on-premises infrastructure and does not focus on cloud administration.
Responsibilities
Administer andoptimizephysical servers(Dell), SQL Server environments, GPU-acceleratedAIand 3Drenderservers, and VMware vSphere/ESXivirtual infrastructure (vCenter HA).
Own infrastructure performance tuning acrosscompute, virtualization, and operating system layers to improve reliability, efficiency, and scalability.
Proactivelyidentifybottlenecks, capacity constraints, and performance risks using monitoring and observability tools suchas PrometheusandGrafana.
Investigatephysical serverissues end to end, perform root cause analysis, anddeterminethe mostappropriate correctivesolution.
Manage physical and virtual infrastructure and support partner teams including DBRE, Security, SRE,Media, and Automated Warehouses.
Support server lifecycle management activities including provisioning, configuration, patching, upgrades, hardware refreshes, and decommissioning.
Perform hardware diagnostics, firmware and BIOS updates, RAID configuration, driver management, and out-of-band administration using tools such as iDRAC or equivalent technologies.
Participate in disaster recovery and business continuity efforts, including failover testing, backup and recovery validation, andrestorationreadiness.
Contribute to infrastructure projects such as migrations, platform rollouts, performance improvement initiatives, and hardware modernization efforts.
Support Windows Server and Linux-based environments, including RHEL, Ubuntu, and Rocky Linux.
Collaborate with internal stakeholders to solve complex infrastructure problems, recommend improvements, and strengthen long-term system health.
J-18808-Ljbffr