Senior Software Engineer, DGX Cloud Orchestration

2 Weeks ago • 5-9 Years • DevOps • $136,000 PA - $264,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer to join its DGX Cloud team. This role involves designing and developing scalable automation solutions for high-performance GPU infrastructure, integrating diverse systems, and creating seamless workflows for global cloud operations. Responsibilities include designing APIs (GraphQL/REST), building state management systems, collaborating across teams to codify business processes, developing extensible platforms, integrating with Kubernetes and observability systems, optimizing cloud operations, and leading impactful technical projects. The ideal candidate possesses expertise in building APIs, proficiency in Go, Java, or Python, familiarity with cloud infrastructure (AWS, GCP, Azure), and experience with high-scale distributed systems.
Must have:
  • GraphQL/REST API design & development
  • Go/Java/Python proficiency
  • Cloud infrastructure & Kubernetes expertise
  • High-scale distributed systems experience
  • Workflow orchestration system design
Good to have:
  • Experience reducing operational inefficiencies
  • Strong debugging and problem-solving skills
Perks:
  • Equity
  • Benefits

Job Details

We are looking for a Senior Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure. You will play a critical role in designing scalable automation solutions, integrating diverse systems, and enabling seamless workflows across global cloud operations. NVIDIA is widely recognized as one of the most desirable employers, with some of the most talented people in the world working for us. If you're passionate about building scalable, efficient systems to power cloud operations, we invite you to join our team.

What You'll Be Doing

  • Design and develop APIs (GraphQL/REST) to orchestrate and integrate operational workflows.

  • Build state management and workflow automation systems that streamline infrastructure lifecycle processes.

  • Collaborate across teams to codify business processes into scalable, self-measuring systems.

  • Develop extensible, schema-driven platforms for reducing manual toil and ensuring operational consistency.

  • Drive integrations with container orchestration tools like Kubernetes and observability systems such as Prometheus, OpenTelemetry, Grafana.

  • Optimize the reliability and efficiency of cloud operations through automated workflows and telemetry systems.

  • Lead and ship impactful technical projects, ensuring quality and scalability at every stage

What we need to see:

  • 5-9+ years of industry experience with a Bachelor’s or Master’s degree (or equivalent experience), or 2+ years with a PhD.

  • Expertise in building GraphQL and REST APIs.

  • Proficiency in programming languages such as Go, Java, or Python.

  • Familiarity with modern JavaScript frameworks (e.g., React, Angular, Next.js).

  • Strong understanding of cloud infrastructure (AWS, GCP, Azure) and container technologies like Docker and Kubernetes.

  • Experience with high-scale distributed systems, including architectural patterns for APIs and data pipelines.

  • Outstanding communication and collaboration skills, with a focus on solving complex operational challenges.

  • A passion for automating manual processes and driving system efficiency.

Ways to Stand Out from the Crowd

  • A track record of designing workflow orchestration systems for large-scale infrastructure.

  • Proven experience in reducing operational inefficiencies through automation and integration.

  • Strong debugging and problem-solving skills in distributed environments.

NVIDIA is committed to creating an environment where diverse perspectives drive innovation. As part of the DGX Cloud team, you’ll work on ground breaking technology that powers the future of AI and cloud computing. NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. Our invention serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

The base salary range is 136,000 USD - 264,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Tatsu Works - Senior Fullstack Engineer

Tatsu Works

(Remote)
5 Months ago
NVIDIA - Senior Software Architect, Accelerated Computing SDN

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Zscaler - Principal Software Engineer

Zscaler

San Jose, California, United States (Hybrid)
8 Hours ago
Canonical - Senior Software Engineer - Python/MongoDB

Canonical

(Remote)
8 Hours ago
Snloker AI - Staff Software Engineer  — AI Platform

Snloker AI

San Francisco, California, United States (Hybrid)
1 Day ago
Zazz - Data Engineer

Zazz

(Remote)
3 Months ago
SmileGate - Game Data Engineer

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
1 Month ago
ByteDance - Cloud Solution Architect (Automotive Industry)

ByteDance

(On-Site)
1 Month ago
Google - Staff Software Engineer, Networking Infrastructure

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Haptic - Senior Fullstack Developer

Haptic

Paris, Île-de-France, France (Remote)
4 Months ago
Warner Bros Games - Staff Software Engineer - Backend (Adtech Team)

Warner Bros Games

Pune, Maharashtra, India (Hybrid)
2 Months ago
ByteDance - Database Administrator - Game

ByteDance

Singapore (On-Site)
4 Months ago
Colo pl - Server-Side Engineer (Game Industry Experience Not Required)

Colo pl

Minato City, Tokyo, Japan (On-Site)
1 Year ago
NewGlobe - Senior DevOps Engineer

NewGlobe

Lagos, Lagos, Nigeria (On-Site)
8 Hours ago
Alphasense - Staff Technical Program Manager (Security)

Alphasense

Pune, Maharashtra, India (On-Site)
7 Hours ago
Aerospike - Senior Quality Engineer

Aerospike

Bengaluru, Karnataka, India (On-Site)
22 Hours ago
Le Collectionist - Lead Data Engineer (H/F/X) - CDI - Paris

Le Collectionist

Paris, Île-de-France, France (On-Site)
8 Months ago
N-iX - Senior Java Developer

N-iX

Poland (Hybrid)
2 Weeks ago
SOFTSWISS - Manual QA Engineer

SOFTSWISS

(Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Inworld AI - Senior Software Development Engineer in Test (SDET) – Game Engine SDKs - USA

Inworld AI

Mountain View, California, United States (On-Site)
6 Months ago
Google - Senior Software Engineer, iOS

Google

Raleigh, North Carolina, United States (On-Site)
1 Week ago
Univision - Creative Project Lead

Univision

Los Angeles, California, United States (On-Site)
19 Hours ago
World Relief - Clinical Services Manager

World Relief

Durham, North Carolina, United States (On-Site)
1 Month ago
AGS - American Gaming Systems - Field Service Technician II

AGS - American Gaming Systems

Boston, Massachusetts, United States (On-Site)
2 Weeks ago
ByteDance - Senior Security Tech Lead Manager - Security Engineering

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Google - Program Manager I, Strategy Operations, Pixel

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Axon - Manager, Site Reliability Engineering

Axon

Seattle, Washington, United States (Remote)
2 Months ago
Google - Software Engineering Manager II, Google Distributed Cloud air-gapped Operations Engineering

Google

Kirkland, Washington, United States (On-Site)
2 Days ago
The Orchard - Manager, Marketing Strategist

The Orchard

New York, New York, United States (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

CloudLinux - Senior Python Developer for KernelCare

CloudLinux

Tbilisi, Tbilisi, Georgia (Remote)
1 Month ago
Ajmera Infotech - Senior ASP.NET Developer with Azure Expertise

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
4 Months ago
Velotio Technologies - Senior DevOps Engineer (AWS)

Velotio Technologies

Pune, Maharashtra, India (Remote)
1 Month ago
ByteDance - Software Engineer - Serverless Compute Infrastructure

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
White Hat Gaming  - Site Reliability Engineer (SRE)

White Hat Gaming

(Remote)
1 Month ago
ION - Cloud Engineer Kubernetes

ION

Milan, Lombardy, Italy (Hybrid)
6 Months ago
ByteDance - Solutions Architect

ByteDance

(On-Site)
2 Weeks ago
Ubisoft - Backend Golang Developer

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago
Omnissa - Member of Technical Staff (Automation)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Google - Customer Engineer, SAP, Google Cloud

Google

Kansas City, Missouri, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug