Senior Data Center Operations Engineer
Company OverviewMilestone Technologies is a global IT managed services firm that partners with organizations to scale their technology, infrastructure and services to drive specific business outcomes such as digital transformation, innovation, and operational agility. Milestone is focused on building an employee-first, performance-based culture and for over 25 years, we have a demonstrated history of supporting category-defining enterprise clients that are growing ahead of the market. The company specializes in providing solutions across Application Services and Consulting, Digital Product Engineering, Digital Workplace Services, Private Cloud Services, AI/Automation, and ServiceNow. Milestone culture is built to provide a collaborative, inclusive environment that supports employees and empowers them to reach their full potential.Our seasoned professionals deliver services based on Milestone's best practices and service delivery framework. By leveraging our vast knowledge base to execute initiatives, we deliver both short-term and long-term value to our clients and apply continuous service improvement to deliver transformational benefits to IT. With Intelligent Automation, Milestone helps businesses further accelerate their IT transformation. The result is a sharper focus on business objectives and a dramatic improvement in employee productivity. Through our key technology partnerships and our people-first approach, Milestone continues to deliver industry-leading innovation to our clients. With more than 3,000 employees serving over 200 companies worldwide, we are following our mission of revolutionizing the way IT is deployed.Job OverviewThe Senior Data Center Operations Engineer plays a critical, hands-on role in supporting the build-out and long-term operation of a high-performance, enterprise-scale data center environment supporting advanced compute and large-scale infrastructure deployments.This position is designed for an experienced engineer with deep expertise in server hardware, Linux systems, and data center operations, operating within environments that demand high availability, precision, and performance. You will contribute during the initial deployment phase, supporting infrastructure bring-up, validation, and hardware readiness. As the environment transitions into steady-state operations, you will take ownership of ongoing reliability, advanced troubleshooting, and continuous improvement initiatives.This role requires a strong operator mindset-someone who thrives in complex, production-critical environments and takes pride in resolving issues at their root. You will serve as a primary technical escalation point, working closely with engineering and infrastructure teams to maintain system stability and performance.You will collaborate with cross-functional teams, making clear and professional communication in English (written and verbal) essential for success in this role.This role offers continuity across both deployment and operational phases and provides exposure to large-scale, modern infrastructure environments, with a clear path for progression into advanced technical or engineering roles.Key ResponsibilitiesAdvanced Hardware Troubleshooting & RepairDiagnose and resolve complex hardware failures across server platforms (motherboards, CPUs, memory, storage)Perform component-level repairs and replacements on servers and data center hardwareExecute break/fix processes with a focus on minimizing downtime and meeting SLAsConduct root cause analysis (RCA) of hardware failures and implement preventative improvementsIdentify recurring failure trends and contribute to tooling, automation, and process enhancementsLinux Systems & Platform SupportUtilize Linux command-line tools for system monitoring, diagnostics, and troubleshootingSupport provisioning and deployment of servers across Linux distributions (RHEL, Ubuntu, etc.)Troubleshoot boot-level and OS-level issues in production environmentsCollaborate with engineering teams to resolve complex hardware/software interaction issuesData Center OperationsSupport hardware installation, structured cabling, and infrastructure validationMaintain accurate inventory of spare parts, assets, and retired equipmentDocument repairs, changes, and configurations in ITSM/DCIM systemsEnsure adherence to safety, security, and operational protocolsServe as a primary escalation point for complex infrastructure issuesParticipate in on-call rotation supporting 24x7 operationsCollaboration & MentorshipProvide guidance and mentorship to technicians on hardware troubleshooting and best practicesCollaborate with network, storage, and infrastructure teams to resolve cross-functional issuesContribute to knowledge sharing, documentation, and operational excellence initiativesSupport continuous improvement efforts across processes, tooling, and operational workflowsRequired SkillsStrong English communication skills (written and verbal) are required for coordination with cross-functional teamsExpert-level knowledge of server hardware architecture and component-level troubleshootingStrong proficiency with Linux systems and command-line diagnosticsSolid understanding of networking fundamentals and infrastructure componentsExperience working within structured operational environments (SOPs, SLAs, ticketing systems)Familiarity with ITSM/DCIM tools (ServiceNow, Jira, or similar)Experience with structured cabling and fiber optic connectivityStrong analytical and problem-solving skills with attention to detailAbility to operate effectively in high-pressure, high-availability environmentsStrong organizational and documentation skillsRequired Experience5+ years of experience in data center operations or similar infrastructure environmentsSignificant hands-on experience with server hardware troubleshooting and repairMinimum of 2 years of experience working with Linux operating systems in production environmentsExperience supporting enterprise server platforms and infrastructure environmentsDemonstrated experience performing root cause analysis and resolving complex hardware issuesExperience working within ticketing systems and operational workflowsExposure to data center build-outs, deployments, or infrastructure upgrades (preferred)Preferred CertificationsCompTIA A+, Server+, or Linux+LPI certification or equivalentVendor-specific hardware certificationsPhysical RequirementsAbility to lift and move equipment up to 50 lbsAbility to work in a temperature-controlled environment with moderate noise levelsAbility to perform physical tasks such as standing, walking, bending, and kneeling for extended periodsCompensationEstimated Pay Range: 47.00 - 68.00 /hrExact compensation and offers of employment are dependent on circumstances of each case and will be determined based on job-related knowledge, skills, experience, licenses or certifications, and location.Our Commitment to Diversity & InclusionAt Milestone we strive to create a workplace that reflects the communities we serve and work with, where we all feel empowered to bring our full, authentic selves to work. We know creating a diverse and inclusive culture that champions equity and belonging is not only the right thing to do for our employees but is also critical to our continued success.Milestone Technologies provides equal employment opportunity for all applicants and employees. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, religion, gender, gender identity, marital status, age, disability, veteran status, sexual orientation, national origin, or any other category protected by applicable federal and state law, or local ordinance. Milestone also makes reasonable accommodations for disabled applicants and employees.We welcome the unique background, culture, experiences, knowledge, innovation, self-expression and perspectives you can bring to our global community. Our recruitment team is looking forward to meeting you.