NVIDIA is a leading technology company known for its innovative work in AI. They are seeking a Senior Software Engineer to build the NeMo Platform, focusing on developing infrastructure for evaluating and improving AI agents.
Responsibilities:
- Design and implement Python-first APIs, SDK workflows, and plugin interfaces for building, measuring, and improving agents across multiple runtimes and product surfaces
- Build reusable systems for observing behavior, measuring progress, detecting regressions, and turning runtime evidence into product decisions
- Build systems for ingesting, normalizing, validating, and analyzing agent execution data and evaluation datasets
- Partner with research, product, platform, and infrastructure teams to integrate agentic capabilities broadly across NVIDIA agent runtimes and developer workflows
- Help turn emerging agent development and improvement techniques into reliable, reusable product capabilities
- Improve reliability, observability, debuggability, and performance across NeMoStack services, SDKs, plugins, jobs, and developer workflows
- Build strong test coverage across unit, integration, E2E, Docker, and Kubernetes workflows
- Drive “speed of light” engineering: fast iteration, high ownership, pragmatic decisions, and performance-minded implementation under production constraints
- Provide senior technical leadership through design reviews, code reviews, mentoring, and ownership of ambiguous cross-component problems