[Remote] Distinguished Engineer – High Performance AI
Note: The job is a remote job and is open to candidates in USA. NVIDIA is a leader in computer graphics and AI technology, seeking a Distinguished Engineer – High Performance AI to build groundbreaking agentic AI systems for the CUDA ecosystem. The role involves defining technical direction, driving execution across the stack, and collaborating with internal teams to translate advances into production capabilities.
Responsibilities
- Set strategy and lead execution for agentic AI systems for the CUDA ecosystem, defining roadmaps and measurable success metrics (performance, quality, reliability, developer productivity)
- Co-design agentic system solutions with software, hardware and algorithm teams; influence and adopt new capabilities as they become available
- Develop reproducible, high-fidelity evaluation frameworks covering performance, quality and developer productivity
- Collaborate across the AI stack and help drive architecture and key technical decisions —from hardware through compilers/toolchains, kernels/libraries, frameworks, distributed training, and inference/serving—and with model and research/engineering teams
- Scale impact through leadership: mentor and grow senior technical talent
Skills
- Bachelor's degree in Computer Science, Electrical Engineering, or related field (or equivalent experience)
- 17+ years industry and/or academia experience with AI systems development; strong exposure to building foundational models, agents or orchestration frameworks; hands-on experience with deep learning frameworks and modern inference stacks
- Strong C/C++ and Python programming skills; solid software engineering fundamentals; ability to set engineering standards and review architecture at scale
- Experience with GPU programming and performance optimization (CUDA or equivalent)
- Proven track record leading large, cross-team efforts from concept through production, including navigating ambiguity, aligning stakeholders, and delivering measurable outcomes
- MS or PhD preferred
- Track record building/evaluating deep learning models, coding agents and developer tooling, and driving broad adoption across teams or customers
- Demonstrated ability to optimize and deploy high-performance models, including on resource-constrained platforms. Deep expertise in GPU performance optimizations, evidenced by benchmark wins or published results
- Publications or open-source leadership in deep learning, multi-agent systems, reinforcement learning, or AI systems; contributions to widely used repos or standards
- Experience leading projects end-to-end, mentoring small teams; ability to drive concepts to production
- Recognized technical leadership (e.g., setting platform direction, creating widely used architectures/APIs, or establishing evaluation/benchmarking standards)
Benefits
- Equity
- Benefits
Company Overview
Company H1B Sponsorship
Apply To This Job