NVIDIA is looking for a Deep Learning Architect to join our team working at the cutting edge of AI infrastructure. In this role, you will build and run simulations that capture the traffic dynamics of agentic AI workloads, mine the results for actionable insights, and help guide architectural decisions for next-generation datacenter and GPU systems.
Responsibilities:
- Develop and extend C++ and Python simulators that model system-level network and compute traffic for agentic LLM workloads in datacenter environments
- Characterize real-world LLM serving workloads and distill them into representative simulator inputs
- Run simulations at scale and apply statistical techniques to post-process and interpret results
- Identify performance bottlenecks and translate findings into concrete architectural recommendations
- Collaborate with hardware, software, and research teams to influence the design of future AI systems