Software Engineering Manager, Distributed Task-based Runtimes

1 Month ago • 8 Years + • Research & Development • $224,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks an experienced Software Engineering Manager to lead the development of its distributed runtime stack for large-scale computing. This role involves managing a team designing, developing, and optimizing software (Legate, Legion, Realm) that simplifies development of AI, scientific computing, and data analytics applications. Responsibilities include team leadership, project planning, collaboration with internal and external teams (research, engineering, product management, partners), and ensuring high-quality, high-performance software. The ideal candidate has strong leadership experience, HPC expertise, and experience with GPU-accelerated software development (C, C++, Python).
Must have:
  • Lead and mentor engineering teams
  • 8+ years experience in distributed runtimes
  • 3+ years experience leading software teams
  • HPC and performance critical applications
  • GPU-accelerated software development (C/C++/Python)
  • Agile development and project management
Good to have:
  • Experience with Legion, Ray, or Dask
  • CUDA, MPI, or OpenMP experience
  • CPU/GPU architecture knowledge
  • Development of domain-specific libraries/languages
  • Machine Learning/Deep Learning understanding
Perks:
  • Equity
  • Benefits

Job Details

We are looking for an experienced software engineering manager to lead the development of NVIDIA’s distributed runtime stack for large-scale distributed computing that attempts to democratize scalable accelerated computing for everyone. Around the world, leading commercial and academic organizations are revolutionizing AI, scientific computing, and data analytics, using data centers powered by GPUs. Applications of these technologies include LLMs, Computer Vision, autonomous vehicles and countless others. Our team develops foundational distributed computing software that extremely simplifies development of such applications!

In this role, you will lead an engineering team designing, developing, and optimizing the distributed task-based runtime software stack that includes Legate, Legion and Realm. Ideal candidates should have experience leading software product engineering teams, and be motivated to advance the state-of-the-art in a variety of accelerated computing domains. If this sounds exciting, we would love to meet you!

What you'll be doing:

  • Lead, mentor, and grow your distributed runtime engineering team and be responsible for the planning and execution of projects as well as the quality, and performance of the runtime stack.

  • Work closely with NVIDIA Research, Engineering, Developer Technology, and Product Management teams in the areas of scientific computing, data analytics, programming systems, and AI to help collect requirements for your products as well as contribute to the development of technology roadmaps.

  • Interact with external partners and researchers to understand their use cases and requirements.

What we need to see:

  • BS, MS or PhD degree in Computer Science, Electrical Engineering or related field (or equivalent experience)

  • 8+ years of overall experience in developing distributed runtimes or at-scale high-performance software.

  • 3+ years of experience recruiting, training and leading software engineering teams.

  • Background in high performance computing and performance critical applications

  • Experience implementing, tuning, and debugging runtimes and/or distributed systems for supercomputers or the cloud

  • Hands-on experience with design, development, testing, maintenance, and performance optimization of GPU-accelerated software using C, C++ or Python.

  • Strong collaboration, communication, and documentation habits.

  • Experience with agile software development practices using project management tools such as JIRA.

Ways to stand out from the crowd:

  • Experience with development of distributed runtimes such as Legion, Ray or Dask

  • Experience with parallel programming, ideally using CUDA, MPI or OpenMP

  • Good knowledge of CPU and/or GPU hardware architecture.

  • Development of domain specific libraries/languages for high performance computing

  • Good understanding of Machine Learning and Deep Learning technologies

The base salary range is 224,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Daybreak Game Company LLC - QA Lead

Daybreak Game Company LLC

Renton, Washington, United States (Hybrid)
2 Months ago
Onward Search - Product Manager V

Onward Search

San Jose, California, United States (Hybrid)
1 Week ago
Epic Games - Senior Producer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
3 Months ago
Canva - Senior Business Systems Engineer – Integration & Automation

Canva

Makati, Metro Manila, Philippines (Remote)
2 Weeks ago
Metyis - Lead Devops Engineer

Metyis

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Tencent - Senior Software Engineer (C++)

Tencent

Shanghai, Shanghai, China (On-Site)
3 Weeks ago
Passive Logic - Senior Electrical Engineer

Passive Logic

Salt Lake City, Utah, United States (On-Site)
6 Months ago
DNEG - Video Streaming Engineer - Imaging, Playback and Review Tools

DNEG

London, England, United Kingdom (Remote)
5 Days ago
ByteDance - LLM Software Engineer/Researcher Graduate (Applied Machine Learning) - 2024 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Senior Firmware Engineer - Memory Subsystem

NVIDIA

Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Electronic Arts - Development Director I / Project Manager

Electronic Arts

Toronto, Ontario, Canada (Remote)
3 Weeks ago
Tencent - Game Production Project Management Intern

Tencent

Tokyo, Japan (On-Site)
1 Month ago
undefined - Product Manager

Montreal, Quebec, Canada (Hybrid)
1 Month ago
Krafton  - Technical Project Manager

Krafton

Seoul, South Korea (On-Site)
3 Weeks ago
Hawk Eye Innovations - 3rd Line Support Engineer

Hawk Eye Innovations

Basingstoke, England, United Kingdom (Hybrid)
4 Weeks ago
Xsolla - Global Banking and Payments Director

Xsolla

(Remote)
2 Weeks ago
Nintendo - Certification Tester I

Nintendo

Redmond, Washington, United States (On-Site)
2 Months ago
Lucid Reality Labs - Business Analyst

Lucid Reality Labs

Poland (Remote)
2 Months ago
Tesla - Service Network Compliance Manager

Tesla

Bakırköy, İzmir, Türkiye (On-Site)
2 Months ago
Lionbridge Games - Technical Project Manager

Lionbridge Games

Mexico City, Mexico City, Mexico (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Ubisoft - Storage Architect

Ubisoft

Montreal, Quebec, Canada (On-Site)
4 Months ago
Ubisoft - Technical Animation Director

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Weeks ago
Microsoft - Senior Researcher – Cloud and AI Infrastructure

Microsoft

Vancouver, British Columbia, Canada (On-Site)
1 Week ago
NVIDIA - Senior ASIC Verification Engineer

NVIDIA

Canada (Hybrid)
1 Month ago
Ubisoft - Lead R&D Programmer - La Forge

Ubisoft

Montreal, Quebec, Canada (Hybrid)
1 Week ago
NVIDIA - Senior Networking Architect

NVIDIA

Canada (On-Site)
3 Months ago
Keywords Studios - Art Business Development Manager, North America

Keywords Studios

Quebec, Canada (Remote)
1 Week ago
Next Level Games - Technical Designer

Next Level Games

British Columbia, Canada (Hybrid)
1 Week ago
Amber - Localization Quality Assurance (Swedish)

Amber

Quebec, Canada (Hybrid)
2 Months ago
Luma Pictures - Senior Compositor

Luma Pictures

Vancouver, British Columbia, Canada (Remote)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Senior Software Verification Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
Assystems - Ingénieur d'Etudes Electricité H/F

Assystems

Lyon, Auvergne-Rhône-Alpes, France (On-Site)
5 Months ago
Riot Games - Researcher III

Riot Games

Singapore (On-Site)
2 Months ago
Tesla - Lead/Manager (Power) Electronic/Electrical Design Engineer

Tesla

Brandenburg, Germany (On-Site)
2 Months ago
Google - Silicon Senior Physical Design Engineer, TPU

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Fluence - Lead Engineer - Advanced Battery Modules

Fluence

Houston, Texas, United States (Hybrid)
6 Months ago
ByteDance - Linux System Engineer

ByteDance

London, England, United Kingdom (On-Site)
3 Months ago
Google - Research Scientist, Pathfinding Component Development

Google

Goleta, California, United States (On-Site)
1 Week ago
Cadence - Principal Software Engineer

Cadence

Shanghai, Shanghai, China (On-Site)
7 Months ago
NVIDIA - Interconnect Failure Analysis Hardware Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug