xAI· Data Center· Memphis, TN
Sr. Manager, Engineering
Classified Tasks (20)
Automate 0%Augment 70%Human-Only 30%
Augment (14)
AI assists, human decides
Oversee design, development, and optimization of critical systems including foundations, seismic design, site development, underground utilities, facility layouts, high-voltage systems, switchgear, UPS, generators, redundancy topologies, chillers, CRAC/CRAH, BMS/SCADA/PLCs, and high-capacity fiber backbone/interconnects
technical
Develop and own technical standards, design guidelines, and best practices for power density, thermal management, structural integrity, seismic resilience, fiber performance and redundancy, uptime targets, efficiency (PUE/WUE), and scalability
technical
Drive innovation and implement advanced liquid cooling solutions, high-efficiency power delivery architectures, intelligent control systems, sustainable material and design choices, optimized facility layouts, and high-bandwidth/low-latency fiber infrastructure
technical
Manage engineering budgets, project schedules, risk assessments, and vendor relationships with OEMs, EPC firms, and other suppliers
operational
Oversee commissioning, startup, testing, and performance verification of facilities and systems prior to handover
operational
Design systems to achieve high availability targets (e.g., 99.999%+ uptime) and to support massive GPU/accelerator clusters
technical
Optimize facility efficiency metrics such as PUE and WUE through design and operational improvements
analytical
Specify and validate mechanical and HVAC systems, including liquid cooling, chillers, CRAC/CRAH units, for high-density computing environments
technical
Specify and validate electrical systems including high-voltage distribution, switchgear, UPS, generators, and redundancy topologies to support high-density loads
technical
Specify and validate controls and automation systems (BMS, SCADA, PLCs) for monitoring and automation of facility operations
technical
Specify, design, and validate high-capacity fiber optic networks for backbone, interconnects, and data hall connectivity
technical
Conduct risk assessments and implement mitigation plans for project and operational risks
analytical
Coordinate with vendors and contractors through procurement, installation, commissioning, and warranty phases
operational
Scale infrastructure designs and standards to support rapid expansion and unprecedented compute density
leadership
Human-Only (6)
Requires human judgment
Lead end-to-end engineering of physical infrastructure for xAI data centers and compute facilities, covering structural, civil, architectural, controls, electrical, mechanical, HVAC, liquid cooling, power distribution, and fiber optic networks
leadership
Build, mentor, and grow a multidisciplinary engineering team spanning structural, civil, architectural, controls, electrical, mechanical, and fiber infrastructure disciplines
leadership
Drive technical strategy for new builds, facility expansions, and continuous improvement of mission-critical compute facilities
leadership
Partner with construction, operations, procurement, and AI hardware teams to ensure integration from design through commissioning, startup, and handover
communication
Ensure structural and seismic design compliance and buildability for compute facility foundations and site development
technical
Travel to project sites and vendor locations to oversee design implementation, construction progress, and commissioning activities
operational
Job description
ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: We are seeking an exceptional Sr. Manager, Engineering to lead physical infrastructure engineering for xAI’s hyperscale AI compute facilities. This role will oversee the design, development, and optimization of critical systems including structural, civil, architectural, controls, electrical, mechanical, HVAC, liquid cooling, power distribution, fiber optic networks, and related infrastructure that power our rapidly expanding supercomputing clusters. You will build and lead a world-class multidisciplinary engineering team while driving technical strategy for new builds, expansions, and continuous improvement of mission-critical facilities. This high-impact position requires deep expertise in data center or industrial-scale infrastructure, a bias for rapid execution, and the ability to deliver reliable, high-density, energy-efficient systems at unprecedented scale. This is a full-time, primarily onsite role with significant travel to project sites and vendor locations. RESPONSIBILITIES: Lead the end-to-end engineering of physical infrastructure for xAI data centers and compute facilities, including structural and civil engineering (foundations, seismic design, site development, underground utilities), architectural engineering (facility layout, aesthetics, code compliance, and buildability), electrical systems (high-voltage, switchgear, UPS, generators, redundancy topologies), mechanical systems (HVAC, liquid cooling, chillers, CRAC/CRAH), controls & automation (BMS, SCADA, PLCs, monitoring), and high-capacity fiber optic networks (backbone, interconnects, and data hall connectivity). Build, mentor, and grow a high-performing multidisciplinary engineering team covering structural, civil, architectural, controls, electrical, mechanical, fiber infrastructure, and related disciplines. Develop and own technical standards, design guidelines, and best practices for power density, thermal management, structural integrity, seismic resilience, fiber performance and redundancy, uptime (targeting 99.999%+), efficiency (PUE/WUE optimization), and scalability to support massive GPU/accelerator clusters. Partner closely with construction, operations, procurement, and AI hardware teams to ensure seamless integration from design through commissioning, startup, and handover. Drive innovation in areas such as advanced liquid cooling, high-efficiency power delivery, intelligent controls, sustainable materials and design, optimized facility layouts, and high-bandwidth, low-latency fiber infrastructure to meet the extreme demands of next-generation AI training. Manage engineering budgets, schedules, risk assessments, and vendor relationships (OEMs, EPC firms, co