Nuvepro - Task Intelligence for the Enterprise
OpenAI· Hardware· San Francisco

Networking Operating System Firmware Engineer

Comp$266K – $445K

Classified Tasks (17)

Automate 0%Augment 65%Human-Only 35%

Augment (11)

AI assists, human decides

Bootstrap and scale the switching layer of AI supercomputers

technical

Build and maintain custom NOS images from scratch using open source components (SONiC, SAI, FRR, and related networking stacks)

technical

Work across the Linux kernel, switch ASIC SAI/SDKs, platform drivers, control-plane services, and orchestration layers

technical

Design, implement, test, and debug production NOS software across platform drivers, routing and control-plane state, ASIC programming, observability, and fleet integration

technical

Integrate, build, and configure Linux kernel components, device drivers, switch ASIC SDKs, and SAI layers

technical

Extend and customize NOS services for routing, telemetry, control-plane state, and distributed automation

technical

Implement and debug route, neighbor, next-hop, and ECMP programming flows from control-plane intent through ASIC hardware state

technical

Build software mechanisms to distinguish control-plane acceptance, SAI/SDK acceptance, and explicit hardware programming acknowledgement

technical

Evaluate switch silicon SDK releases, track vendor deliverables, and validate platform requirements with vendors and ASIC partners

communication

Integrate switches into fleet-wide monitoring, remote diagnostics, telemetry pipelines, and automated lifecycle workflows

operational

Develop robust CI/build pipelines for reproducible NOS builds and controlled rollout across the fleet

operational

Human-Only (6)

Requires human judgment

Resolve ambiguous, open-ended technical problems to drive feature development across software, hardware, and vendor boundaries

analytical

Bring up new switch platforms including thermal and fan control, power monitoring, transceiver management, watchdogs, OSFP CMIS, LEDs, CPLDs, and board-specific platform logic

operational

Validate ASIC configurations, link bring-up, SerDes tuning, buffer profiles, and performance baselines with hardware teams

operational

Debug complex issues spanning kernel drivers, platform monitoring, NOS services, routing agents, orchestration services, hardware signals, ASIC state, and network topology

technical

Support factory bring-up and qualification through mass deployment

operational

Collaborate on networking protocols and technologies to improve performance and reliability at AI factory scale

leadership

Job description

Networking Operating System Firmware Engineer | OpenAI Careers ## Networking Operating System Firmware Engineer Hardware - San Francisco Apply now(opens in a new window) ### **About the Team** OpenAI’s Hardware organization develops silicon and system-level solutions designed for the unique demands of advanced AI workloads. The team is responsible for building the next generation of AI-native silicon while working closely with software and research partners to co-design hardware tightly integrated with AI models. In addition to delivering production-grade silicon for OpenAI’s supercomputing infrastructure, the team also creates custom design tools and methodologies that accelerate innovation and enable hardware optimized specifically for AI. ### Role summary We are seeking a Networking Operating System Firmware Engineer to help bootstrap and scale the switching layer of our AI supercomputers. In this role, you will build and maintain custom NOS images from scratch, using open source components from SONiC, SAI, FRR, and related networking stacks while working across the Linux kernel, switch ASIC SAI/SDKs, platform drivers, control-plane services, and orchestration layers. This is a software engineering role that requires a deep understanding of networking, NOS internals, switch hardware, and production systems. You will design, implement, test, and debug production NOS software across platform drivers, routing and control-plane state, ASIC programming, observability, and fleet integration. The engineer in this role should be able to work through ambiguous, open-ended technical problems and drive feature development across software, hardware, and vendor boundaries. This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees. ### In this role, you will * Design, develop, and maintain custom NOS images for large-scale AI fabrics, using open source components from SONiC, FRR, and related networking stacks. * Integrate, build and configure Linux kernel components, device drivers, switch ASIC SDKs, and SAI layers. * Bring up new switch platforms, including thermal and fan control, power monitoring, transceiver management, watchdogs, OSFP CMIS, LEDs, CPLDs, and board-specific platform logic. * Extend and customize NOS services for routing, telemetry, control-plane state, and distributed automation. * Implement and debug route, neighbor, next-hop, and ECMP programming flows from control-plane intent through ASIC hardware state. * Build software mechanisms that distinguish control-plane acceptance, SAI/SDK acceptance, and explicit hardware programming acknowledgement. * Work with hardware teams to validate ASIC configurations, link bring-up, SerDes tuning, buffer profiles, and performance baselines. * Evaluate switch silicon SDK releases, track vendor deliverables, and validate platform requirements with vendors and ASIC partners. * Debug complex issues spanning kernel drivers, platform monitoring, NOS services, routing agents, orchestration services, hardware signals, ASIC state, and network topology. * Integrate switches into fleet-wide monitoring, remote diagnostics, telemetry pipelines, and automated lifecycle workflows. * Develop robust CI/build pipelines for reproducible NOS builds and controlled rollout across the fleet. * Support factory bring-up and qualification all the way through mass deployment. * Collaborate on networking protocols and technologies that improve performance and reliability at AI factory scale. ### You might thrive in this role if you have * Proven experience working with SONiC or comparable NOS stacks such as FBOSS, Cumulus Linux, Arista EOS, Junos PFE-level integration, or equivalent platform software. * Strong software engineering fundamentals: clear interfaces, data models, state-machine design, error handling, testing, observability, performance debugging, and
Source: OpenAI careers · scraped 2026-05-22
Apply at OpenAI