Nuvepro - Task Intelligence for the Enterprise
xAI· Data Center· Memphis, TN

Manager, Operations

Classified Tasks (17)

Automate 0%Augment 65%Human-Only 35%

Augment (11)

AI assists, human decides

Own day-to-day and long-term performance of mission-critical data center operations including power generation, power distribution, cooling, mechanical, electrical, and environmental systems.

operational

Drive reliability and efficiency initiatives to achieve continuous 24/7 infrastructure availability.

operational

Ensure seamless 24/7 uptime for the infrastructure powering AI training.

operational

Manage operation, maintenance, monitoring, and optimization of on-site power generation assets, electrical systems, mechanical/HVAC, liquid cooling, power distribution, UPS, generators, and building management systems.

technical

Oversee design, deployment, maintenance, and expansion of high-speed fiber optic networks, dark fiber, and connectivity infrastructure supporting AI compute clusters and data center interconnects.

technical

Own, track, and report key performance metrics including uptime (targeting 99.999%+), MTTD/MTTR, PUE, WUE, power generation efficiency, and overall infrastructure availability.

analytical

Develop and enforce standard operating procedures (SOPs) for facilities and power generation operations.

operational

Implement and maintain preventive maintenance programs for critical infrastructure and power generation assets.

operational

Develop and enforce incident response protocols and continuous improvement processes to minimize downtime and maximize efficiency.

operational

Manage operational budgets for facilities, power generation, and fiber operations.

administrative

Manage spare parts inventory for mission-critical infrastructure.

administrative

Human-Only (6)

Requires human judgment

Lead and scale the facilities operations and power generation teams responsible for reliable operation of hyperscale AI compute facilities.

leadership

Direct fiber teams responsible for high-capacity networking and connectivity that support supercomputing clusters.

leadership

Build and lead high-performing operations, power generation, and fiber teams.

leadership

Build, mentor, and grow multidisciplinary teams of operations technicians, power generation engineers, and controls specialists.

leadership

Partner with engineering, construction, procurement, and AI hardware teams to support new facility builds, expansions, commissioning, power integration, and handovers to operations.

communication

Manage vendor relationships with maintenance contractors, fiber providers, power generation OEMs, and fuel suppliers.

communication

Job description

ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: We are seeking an exceptional Manager, Operations to lead facilities operations and power generation for xAI’s hyperscale AI compute facilities. This role will own the day-to-day and long-term performance of mission-critical data center operations, including power generation, power distribution, cooling, mechanical, electrical, and environmental systems, while also directing the fiber teams responsible for high-capacity networking and connectivity that support our supercomputing clusters. You will build and lead high-performing operations, power generation, and fiber teams, drive relentless reliability and efficiency, and ensure seamless 24/7 uptime for the infrastructure powering xAI’s AI training at unprecedented scale. This high-impact position requires deep expertise in data center or hyperscale operations (including power generation), strong leadership in fast-paced environments, and the ability to deliver world-class performance under aggressive growth timelines. This is a full-time, primarily onsite role with significant travel to sites and vendor locations. RESPONSIBILITIES: Lead and scale the facilities operations and power generation teams responsible for the reliable operation, maintenance, monitoring, and optimization of critical infrastructure including on-site power generation assets, electrical systems, mechanical/HVAC, liquid cooling, power distribution, UPS, generators, and building management systems. Direct the fiber teams overseeing the design, deployment, maintenance, and expansion of high-speed fiber optic networks, dark fiber, and connectivity infrastructure supporting AI compute clusters and data center interconnects. Own key performance metrics such as uptime (targeting 99.999%+), mean time to detect/repair (MTTD/MTTR), power usage effectiveness (PUE), water usage effectiveness (WUE), power generation efficiency, and overall infrastructure availability. Develop and enforce standard operating procedures (SOPs), preventive maintenance programs, incident response protocols, and continuous improvement processes for both facilities and power generation assets to minimize downtime and maximize efficiency. Build, mentor, and grow multidisciplinary teams of operations technicians, power generation engineers and controls specialists while fostering a culture of ownership, safety, and excellence. Partner closely with engineering, construction, procurement, and AI hardware teams to support new facility builds, expansions, commissioning, power integration, and smooth handovers from project to operations. Manage operational budgets, vendor relationships (maintenance contractors, fiber providers, power generation OEMs, fuel suppliers), spare parts inven
Source: xAI careers · scraped 2026-05-22
Apply at xAI