Nuvepro - Task Intelligence for the Enterprise
OpenAI

Cpustoragepop Wan Program Manager San Francisco

Comp$342K – $555K

Classified Tasks (15)

Automate 0%Augment 40%Human-Only 60%

Augment (6)

AI assists, human decides

Drive readiness to convert contracted compute capacity into schedulable production clusters

operational

Build integrated schedules spanning procurement, logistics, installation, storage readiness, network turn-up, testing, and production handoff

operational

Coordinate BOM readiness, server delivery, racks, optics, cabling, storage hardware, and vendor milestones

operational

Manage deployment of storage systems supporting training and inference workloads, including readiness, validation, performance checks, and scaling plans

technical

Build repeatable deployment playbooks, dashboards, governance cadences, and operating mechanisms for scale

operational

Identify risks early across supply chain, site readiness, technical constraints, and vendor execution, then drive mitigation plans

analytical

Human-Only (9)

Requires human judgment

Lead end-to-end execution of CPU/GPU cluster activation programs across OpenAI’s global infrastructure footprint

leadership

Own deployment programs for new Points of Presence (PoPs), backbone nodes, WAN expansion, and interconnection initiatives

leadership

Partner with engineering teams to align compute, storage, and networking dependencies before cluster activation

technical

Coordinate backbone capacity expansion, cross-connects, inter-region pathing, and cloud interconnect readiness with Azure and third-party providers

technical

Lead physical deployment execution including rack-and-stack, hardware bring-up, L1 validation, and site acceptance criteria

operational

Communicate milestones, escalations, and capacity forecasts to senior leadership

communication

Coordinate hardware readiness, site readiness, network pathing, storage availability, vendor execution, and engineering dependencies required to turn contracted infrastructure into live training and inference capacity

operational

Own complex cross-functional programs spanning compute cluster activation, storage deployment, PoP bring-up, and backbone expansion

leadership

Lead execution across CPU, Storage, PoP, and WAN infrastructure programs that unlock next-generation compute capacity

leadership

Job description

--- BEGIN UNTRUSTED EXTERNAL CONTENT (source: https://openai.com/careers/cpustoragepop-wan-program-manager-san-francisco/) --- Skip to main contentResearchProductsBusinessDevelopersCompanyFoundation(opens in a new window)Log inTry ChatGPT(opens in a new window)ResearchProductsBusinessDevelopersCompanyFoundation(opens in a new window)CPU/Storage/PoP-WAN Program Manager | OpenAICareersCPU/Storage/PoP-WAN Program ManagerHardware - San Francisco and SeattleApply now(opens in a new window)About the TeamOpenAI’s Infrastructure organization builds the systems that power frontier AI workloads at global scale. As compute demand accelerates, our ability to rapidly convert infrastructure investments into usable production capacity has become mission critical.The CPU / Storage / PoP / WAN team is responsible for the end-to-end infrastructure layers required to bring compute online: server and cluster activation, storage platforms, Points of Presence (PoPs), backbone connectivity, and global network expansion. We operate across first-party facilities, colocation environments, and strategic cloud partners to ensure OpenAI can scale reliably and quickly.About the RoleWe are seeking a highly technical Program Manager to lead execution across CPU, Storage, PoP, and WAN infrastructure programs that directly unlock OpenAI’s next generation compute capacity.In this role, you will own complex cross-functional programs spanning compute cluster activation, storage deployment, PoP bring-up, and backbone expansion. You will coordinate hardware readiness, site readiness, network pathing, storage availability, vendor execution, and engineering dependencies required to turn contracted infrastructure into live training and inference capacity.This role requires strong technical fluency across hardware systems, network infrastructure, storage architecture, and deployment execution. You should be comfortable operating from rack-level implementation details through executive-level capacity planning discussions.This role is based in San Francisco, CA, with travel as needed.Key ResponsibilitiesLead end-to-end execution of CPU / GPU cluster activation programs across OpenAI’s global infrastructure footprintDrive readiness to convert contracted compute capacity into schedulable production clustersOwn deployment programs for new PoPs, backbone nodes, WAN expansion, and interconnection initiativesBuild integrated schedules spanning procurement, logistics, installation, storage readiness, network turn-up, testing, and production handoffCoordinate BOM readiness, server delivery, racks, optics, cabling, storage hardware, and vendor milestonesPartner with engineering teams to align compute, storage, and networking dependencies before cluster activationManage deployment of storage systems supporting training and inference workloads, including readiness, validation, performance checks, and scaling plansCoordinate backbone capacity expansion, cross-connects, inter-region pathing, and cloud interconnect readiness with Azure and third-party providersLead physical deployment execution including rack-and-stack, hardware bring-up, L1 validation, and site acceptance criteriaBuild repeatable deployment playbooks, dashboards, governance cadences, and operating mechanisms for scaleIdentify risks early across supply chain, site readiness, technical constraints, and vendor execution, then drive mitigation plansCommunicate milestones, escalations, and capacity forecasts to senior leadershipQualifications8+ years of experience in technical program management, infrastructure deployment, network deployment, or data center operationsStrong experience delivering programs involving compute, storage, networking, or large-scale infrastructure systemsWorking knowledge of servers, clusters, storage arrays, routers, switches, optics, and structured cablingExperience owning cross-functional programs across engineering, operations, supply chain, and external vendorsStrong understanding of deploym
Source: OpenAI careers · scraped 2026-05-22
Apply at OpenAI