Provide architectural direction across strategic engagements where standard capabilities are not enough and advanced implementation, optimization, or integration customization is needed
Help customers integrate the right components to deliver on their outcomes
Where DSX software fits, advise on adopting it the right way. Where it doesn't, help them succeed with the right alternative and bring the gap back to product and engineering
Dive into complex technical challenges hands-on when needed to solve critical problems, validate architectures, or prove out solutions
Lead technically demanding programs end to end, including third-party performance benchmarking across hardware and workloads
Identify common challenges and solution patterns across engagements. Share findings with internal teams and the broader AI community
Develop standardized approaches, reference architectures, and structured guidance rooted in patterns from successful engagements
Partner with product, engineering, and other customer-facing NVIDIA teams so what we learn in the field informs internal strategy and capabilities
Design technical strategies for advanced AI workloads (distributed training, large-scale inference, model and pipeline optimization, MLOps) that apply across multiple customers and partners
Help develop new infrastructure patterns and playbooks for the latest NVIDIA hardware as it lands with customers and partners
Requirements
Bachelors degree or equivalent experience
12+ years in technical roles such as solutions architecture, ML engineering, technical product management, or technical consulting across multiple customers or projects
Alternatively, 5+ years of specialist-level experience working at the frontier of AI infrastructure
Strong technical leadership with the ability to guide teams and influence technical decisions without direct authority
Systems thinking with the ability to understand customer outcomes and translate them into clear technical requirements and architectures
Willingness to prototype, implement, validate, and troubleshoot hands-on when needed to solve critical problems or prove out approaches
A solid technical foundation in the technologies AI infrastructure is built on, especially Linux systems administration
A self-directed learner who can ramp on brand new technologies and unfamiliar technical domains independently
Strong communication skills with the ability to engage technical teams, executives, and multi-functional collaborators.