NVIDIA is a leading technology company known for its groundbreaking developments in Artificial Intelligence and High-Performance Computing. They are seeking a Senior Engineering Manager to lead a firmware engineering organization focused on delivering manageability firmware for Data Center Compute Systems, overseeing architecture, execution, and quality across multiple programs.
Responsibilities:
- Lead a large firmware engineering organization delivering OpenBMC based firmware and MCU firmware for next-generation Data Center Compute Systems
- Own HGX platform as a lead for Firmware and System software readiness working across the organization
- Define and drive the long-term firmware roadmap, balancing architectural innovation with product execution and delivery milestones
- Drive architecture strategy across BMC, MCU, platform software, manageability, health management, and data center firmware interfaces
- Lead execution across multiple programs, coordinating priorities, hiring, managing cross component dependencies, and delivery commitments across a large engineering team
- Collaborate with data center architects, cloud customers, senior stakeholders, and cross-functional teams to define requirements, scope implementation, and deliver at Speed of Light
- Partner with hardware, systems, security, validation, manufacturing, field, and customer engineering teams to ensure scalable manageability architecture across data center products
- Manage customer and executive escalations for complex firmware, platform, and deployment issues
- Build, mentor, and grow a high-performing engineering organization with strong technical leadership, execution discipline, and quality culture
Requirements:
- BS, MS, or PhD in EE/CS or related field of education or equivalent experience
- 12+ overall years of proven experience in server firmware, BMC/OpenBMC, MCU firmware, platform software, or data center systems
- 6+ years of experience managing software/firmware engineering teams
- Strong technical leadership in data center system architecture, server manageability, telemetry, health management, and reliability at scale
- Proven record delivering production firmware for large data centers with strong quality, debug, and operational discipline
- Experience leading architecture and execution across multiple programs, cross-functional teams, and customer-facing deliverables
- Strong understanding of firmware development lifecycle, validation, release management, issue triage, and production support
- Excellent communication skills, strong work ethic, sound judgment, and the ability to align teams through complex technical and business tradeoffs
- Experience leading large distributed engineering organizations, including multi-team execution and senior technical leaders
- High level of ownership to deliver products working across matric organization, having done that for a couple of products. Having mindset bucks stop at me
- Hands-on experience with BMC firmware/software stack, MCU firmware, C/C++, Python, and debugging server platform. Expertise with OOB management DMTF protocols and standards such as MCTP, PLDM, SPDM, and Redfish
- Experience with Embedded Linux, FreeRTOS, Yocto/BitBake, Git, Perforce, Jira, and modern firmware CI/CD practices
- Proven ability to drive complex architecture, quality, reliability, and customer escalation work across 25+ engineers or similarly large engineering teams