Senior Software Engineer, DGX Cloud Orchestration

4 Days ago • 5-9 Years • DevOps • $136,000 PA - $264,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer to join its DGX Cloud team. This role involves designing and developing scalable automation solutions for high-performance GPU infrastructure, integrating diverse systems, and creating seamless workflows for global cloud operations. Responsibilities include designing APIs (GraphQL/REST), building state management systems, collaborating across teams to codify business processes, developing extensible platforms, integrating with Kubernetes and observability systems, optimizing cloud operations, and leading impactful technical projects. The ideal candidate possesses expertise in building APIs, proficiency in Go, Java, or Python, familiarity with cloud infrastructure (AWS, GCP, Azure), and experience with high-scale distributed systems.
Must have:
  • GraphQL/REST API design & development
  • Go/Java/Python proficiency
  • Cloud infrastructure & Kubernetes expertise
  • High-scale distributed systems experience
  • Workflow orchestration system design
Good to have:
  • Experience reducing operational inefficiencies
  • Strong debugging and problem-solving skills
Perks:
  • Equity
  • Benefits

Job Details

We are looking for a Senior Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure. You will play a critical role in designing scalable automation solutions, integrating diverse systems, and enabling seamless workflows across global cloud operations. NVIDIA is widely recognized as one of the most desirable employers, with some of the most talented people in the world working for us. If you're passionate about building scalable, efficient systems to power cloud operations, we invite you to join our team.

What You'll Be Doing

  • Design and develop APIs (GraphQL/REST) to orchestrate and integrate operational workflows.

  • Build state management and workflow automation systems that streamline infrastructure lifecycle processes.

  • Collaborate across teams to codify business processes into scalable, self-measuring systems.

  • Develop extensible, schema-driven platforms for reducing manual toil and ensuring operational consistency.

  • Drive integrations with container orchestration tools like Kubernetes and observability systems such as Prometheus, OpenTelemetry, Grafana.

  • Optimize the reliability and efficiency of cloud operations through automated workflows and telemetry systems.

  • Lead and ship impactful technical projects, ensuring quality and scalability at every stage

What we need to see:

  • 5-9+ years of industry experience with a Bachelor’s or Master’s degree (or equivalent experience), or 2+ years with a PhD.

  • Expertise in building GraphQL and REST APIs.

  • Proficiency in programming languages such as Go, Java, or Python.

  • Familiarity with modern JavaScript frameworks (e.g., React, Angular, Next.js).

  • Strong understanding of cloud infrastructure (AWS, GCP, Azure) and container technologies like Docker and Kubernetes.

  • Experience with high-scale distributed systems, including architectural patterns for APIs and data pipelines.

  • Outstanding communication and collaboration skills, with a focus on solving complex operational challenges.

  • A passion for automating manual processes and driving system efficiency.

Ways to Stand Out from the Crowd

  • A track record of designing workflow orchestration systems for large-scale infrastructure.

  • Proven experience in reducing operational inefficiencies through automation and integration.

  • Strong debugging and problem-solving skills in distributed environments.

NVIDIA is committed to creating an environment where diverse perspectives drive innovation. As part of the DGX Cloud team, you’ll work on ground breaking technology that powers the future of AI and cloud computing. NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. Our invention serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

The base salary range is 136,000 USD - 264,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Gaming Innovation Group  - DevOps Data Engineer

Gaming Innovation Group

St. Julian's, Malta (Hybrid)
2 Weeks ago
ION - Cloud Engineer Kubernetes

ION

Milan, Lombardy, Italy (Hybrid)
6 Months ago
Luxoft - Senior DevOps Engineer (Azure)

Luxoft

New Delhi, Delhi, India (Remote)
4 Months ago
Homa games - Senior Full-Stack Engineer: Unity C#

Homa games

Île-de-France, France (Remote)
2 Weeks ago
Meta - Production Engineering

Meta

Seattle, Washington, United States (Hybrid)
4 Months ago
Ubisoft - Monitoring Specialist - Golang Developer

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
4 Days ago
Dream Sports - Director System IT

Dream Sports

Mumbai, Maharashtra, India (On-Site)
4 Months ago
Razer - Software Engineer (DevOps)

Razer

Shah Alam, Selangor, Malaysia (On-Site)
6 Months ago
Probably Monsters - Build Engineer, Ecosystems (Core Technology)

Probably Monsters

Texas, United States (On-Site)
2 Months ago
Zazz - Cloud Engineer (AWS)

Zazz

(Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PwC - ETIC, Cloud Infrastructure - Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
5 Months ago
Wargaming - Game Developer

Wargaming

Warsaw, Masovian Voivodeship, Poland (Hybrid)
2 Weeks ago
Warner Bros Games - Staff Software Engineer

Warner Bros Games

(Hybrid)
3 Weeks ago
Nielsen Holdings - Software Engineer - Platform

Nielsen Holdings

Mumbai, Maharashtra, India (Hybrid)
5 Months ago
Inworld AI - Staff Cloud DevOps/Site Reliability Engineer (SRE) - USA

Inworld AI

Mountain View, California, United States (On-Site)
8 Months ago
Warner Bros Games - Senior Software Engineer - Fullstack (AdTech Team)

Warner Bros Games

Pune, Maharashtra, India (Hybrid)
1 Month ago
Sony Interactive Entertainment - Database Reliability Engineer (DBRE)  - 世界最大級のゲームプラットフォーム

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
5 Months ago
Trend Micro - (Sr.) Cloud Backend Engineer

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Evolution - Scala Engineer

Evolution

Warsaw, Masovian Voivodeship, Poland (On-Site)
10 Months ago
ByteDance - Site Reliability Engineer, Edge Services

ByteDance

Seattle, Washington, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

ByteDance - Machine Learning Engineer Intern (Product RD and Infrastructure - LLM Unit Tests)

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Eleven Labs - Website Engineer

Eleven Labs

United States (Remote)
2 Weeks ago
Games For Love - Esports Game Player

Games For Love

Lynnwood, Washington, United States (Remote)
8 Months ago
Rockstar Games - Marketing Manager, Live Services

Rockstar Games

New York, New York, United States (On-Site)
4 Months ago
VX Media - Showroom Coordinator Intern *UNPAID*

VX Media

New York, New York, United States (On-Site)
5 Months ago
Netflix - Senior Manager, Product Management (Demand Connectivity) - Ads

Netflix

Los Gatos, California, United States (On-Site)
3 Months ago
Crunchyroll - Principal Software Engineer, Video Players

Crunchyroll

San Francisco, California, United States (Remote)
2 Months ago
Bad Robot Games - Online Engineer

Bad Robot Games

California, United States (Remote)
7 Hours ago
On Location - Junior Product & Pricing Analyst - Olympic & Paralympic Games

On Location

Raleigh, North Carolina, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Wargaming - DevOps Engineer

Wargaming

Nicosia, Nicosia, Cyprus (On-Site)
3 Months ago
Sonar Source - Support Engineer

Sonar Source

Geneva, Geneva, Switzerland (On-Site)
5 Months ago
Zazz - Cloud Engineer (Azure)

Zazz

(Remote)
2 Months ago
Truecaller - Senior MLOps Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
5 Months ago
Egnyte - Database Administrator

Egnyte

India (Remote)
1 Month ago
The Walt Disney Company - Sr. FinOps Tech Data Analyst

The Walt Disney Company

Washington, United States (On-Site)
3 Weeks ago
Luxoft - Senior Java engineer (with oncall support)

Luxoft

Ukrainka, Kyiv Oblast, Ukraine (Remote)
4 Months ago
Scanline VFX - Senior DevOps Engineer

Scanline VFX

Vancouver, British Columbia, Canada (Hybrid)
2 Months ago
ByteDance - Software Engineer, Cloud Infrastructure

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Visa - Chief Systems Architect

Visa

Auckland, Auckland, New Zealand (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Hanoi, Hanoi, Vietnam (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Shanghai, Shanghai, China (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Shanghai, Shanghai, China (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug