Distinguished Engineer, Utility Computing
NVIDIA
Job Summary
NVIDIA is seeking a Distinguished Engineer for Utility Computing to lead the development of DGX Cloud strategy, focusing on Infrastructure as a Service (IaaS) systems for bare metal and virtualized accelerated computing. This role involves defining technical strategy across various datacenter environments, developing control and data plane systems for utility computing, and collaborating with leadership and partners to deliver scalable accelerated computing solutions.
Must Have
- Define and drive the technical implementation for DGX Cloud utility computing systems to deliver bare metal and virtualized accelerated computing.
- Drive the technical strategy and integrations for shared memory, storage, and networking systems.
- Guide the technical delivery across varied environments: enterprise, government/sovereign, high security/air-gapped, and private datacenters.
- Collaborate with customers, infrastructure providers, and partners to ensure NVIDIA’s solutions set the industry standard for performance and availability.
- Lead all technical aspects of planning and continuous evolution of a large technical scope.
- 18+ overall years in technical roles with a focus on operating systems, virtualization, networking, storage, and cloud infrastructure.
- 7-10+ years of lead experience.
- BS/MS or higher or equivalent experience in systems / software engineering, or related engineering fields.
- Technical proficiency in multi-tenant datacenter and cloud-native architectures for bare metal and virtualization across compute, storage, and networking.
- Proven success delivering high-impact technically complex solutions that achieve high levels of transparency into resource utilization, performance, and operational insights.
- Ability to synthesize cross-functional needs into architecture and design while guiding internal execution across diverse teams.
- Strong collaboration and influence skills, capable of leading engineering engagement, communicating with peers, partners, and working with high performance and accelerated computing customers.
Good to Have
- Real world experience building the systems to support AI/ML workloads.
- Direct experience in designing, developing, delivering and operating secure, highly available, scaled out systems in enterprise and cloud environments.
- Demonstrated history of creating scalable processes and extensible systems that facilitate cross-functional collaboration and operations at scale.
- Familiarity with open source ecosystems and projects in the infrastructure space (e.g. Linux, KVM).
- Ability to collaborate and influence in open source project governance to represent NVIDIA, customers, and partners interests in technical alignment and direction.
Perks & Benefits
- Competitive salaries
- Generous benefits package
- Two free days each quarter to disconnect from work to recharge
Job Description
NVIDIA is leading the industry in delivering accelerated computing in cloud and enterprise environments. We’re a team of innovative engineers dedicated to solving some of the world’s biggest challenges, constantly driving advancements, and impacting millions of lives worldwide!
As a technology leader at NVIDIA, you will lead the development of DGX Cloud strategy for utility computing. Specifically, the Infrastructure as a Service (IaaS) systems to deliver bare metal and virtualized accelerated computing hardware to users. You will define and drive the technical strategy across multiple datacenter environments (enterprise, sovereign, neocloud). Including defining and developing the control and data plane systems that enable utility computing (on-demand, scalable, metered) for accelerated computing hardware. You will work with NVIDIA leadership, cross-organizationally and cross-functionally, to establish the product definition, roadmap, and technical strategy to deliver utility accelerated computing at scale.
What You’ll Be Doing:
- Various Architectural Work: define and drive the technical implementation for DGX Cloud utility computing systems to deliver bare metal and virtualized accelerated computing.
- Collaborate on Cross Domain Disciplines: drive the technical strategy and integrations for shared memory, storage, and networking systems.
- Accelerate Integration: Guide the technical delivery across varied environments: enterprise, government/sovereign, high security/air-gapped, and private datacenters.
- Engage Stakeholders: Collaborate with customers, infrastructure providers, and partners to ensure NVIDIA’s solutions set the industry standard for performance and availability.
- Full Software and System Lifecycle: From ideation to architecture, design, development, deployment, operations, and full lifecycle management, lead all technical aspects of planning and continuous evolution of a large technical scope.
What We Need to See:
- 18+ overall years in technical roles with a focus on operating systems, virtualization, networking, storage, and cloud infrastructure. Defining the abstractions and building the systems to deliver secure, highly available, durable systems for IaaS.
- 7-10+ years of lead experience
- BS/MS or higher or equivalent experience in systems / software engineering, or related engineering fields
- Technical proficiency in multi-tenant datacenter and cloud-native architectures for bare metal and virtualization across compute, storage, and networking.
- Proven success delivering high-impact technically complex solutions that achieve high levels of transparency into resource utilization, performance, and operational insights.
- Technical Leadership: Ability to synthesize cross-functional needs into architecture and design while guiding internal execution across diverse teams.
- Communication and Teamwork: Strong collaboration and influence skills, capable of leading engineering engagement, communicating with peers, partners, and working with high performance and accelerated computing customers.
Ways to Stand Out from the Crowd:
- Application of Artificial Intelligence: Real world experience building the systems to support AI/ML workloads.
- Industry Expertise: Direct experience in designing, developing, delivering and operating secure, highly available, scaled out systems in enterprise and cloud environments.
- Engineering Enablement: Demonstrated history of creating scalable processes and extensible systems that facilitate cross-functional collaboration and operations at scale.
- Open Source Collaboration: Familiarity with open source ecosystems and projects in the infrastructure space (e.g. Linux, KVM). Ability to collaborate and influence in open source project governance to represent NVIDIA, customers, and partners interests in technical alignment and direction.
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you! NVIDIA’s invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.
With competitive salaries and a generous benefits package (www.nvidiabenefits.com), we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 308,000 USD - 471,500 USD.
You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/).
Applications for this job will be accepted at least until December 20, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.