Senior Cloud Test Developer Architect

3 Months ago • 8 Years + • DevOps • $200,000 PA - $391,000 PA

Job Summary

Job Description

NVIDIA seeks a Senior Cloud Test Developer Architect to design, optimize, and test large-scale cloud infrastructure for its Unified Cloud Services and Data Center offerings. Responsibilities include leveraging AI-powered testing tools, collaborating with engineering teams, crafting end-to-end test strategies, leading cloud bring-up activities, architecting cloud-native test automation frameworks, developing scalable infrastructure automation, improving observability and monitoring, ensuring resilience and failover testing, and collaborating with internal and external teams. The ideal candidate possesses extensive experience with cloud platforms (AWS, Azure, GCP), Kubernetes, IaC, CI/CD, and cloud security, along with strong programming skills (Python, Go, or Java).
Must have:
  • 8+ years cloud infrastructure experience
  • Kubernetes expertise
  • IaC & Configuration Management
  • Cloud Networking & Storage knowledge
  • CI/CD pipeline experience
  • Python/Go/Java proficiency
  • Cloud security knowledge
Good to have:
  • AI-powered testing experience
  • Kubernetes Operators
  • Confidential Computing knowledge
  • Zero Trust Security models
  • Edge computing familiarity
Perks:
  • Competitive salary and benefits
  • Flexible work environment
  • Opportunity to work with industry experts
  • Equity

Job Details

We are in search of a highly skilled Senior Test Developer Architect to join our dynamic Enterprise Software QA team. This role presents an outstanding opportunity to craft the design, optimization, and testing of large-scale cloud infrastructure for foundational NVIDIA Unified Cloud Services and Data Center offerings. Seeking cloud infrastructure expert with expertise in distributed systems, test automation, cloud architectures, for a dynamic role.

What You’ll Be Doing:

  • Leverage AI-powered testing tools to improve test automation, increase coverage, and accelerate testing cycles for cloud-based infrastructure.

  • Collaborate with product engineering teams to deeply understand cloud service architectures and provide mentorship to SWQA teams on testing cloud-native applications at scale.

  • Craft and develop end-to-end test strategies for validating cloud infrastructure, including compute, storage, networking, security, and orchestration layers.

  • Lead NVIDIA Cloud bring-up activities from a software quality assurance perspective, ensuring scalability, reliability, and performance.

  • Architect and implement cloud-native test automation frameworks to validate multi-cloud (AWS, Azure, Google Cloud) and hybrid-cloud environments.

  • Develop scalable and resilient infrastructure automation by using Infrastructure as Code (IaC), Configuration Management, and optimization techniques.

  • Improve observability and monitoring through AI-powered anomaly detection, predictive analytics, and intelligent alerting.

  • Ensure resilience and failover testing of cloud-based microservices and distributed architectures.

  • Collaborate with internal teams and cloud service partners to ensure alignment with industry standard methodologies and real-world use cases.

What We Need to See:

  • Master’s or Ph.D. in Computer Science, Cloud Computing, or a related field, or equivalent experience.

  • 4+ years of hands-on experience in cloud-native cluster management, including Docker, Slurm, Kubernetes, OpenShift, and Ansible.

  • 8+ years of experience working with cloud infrastructure platforms like AWS, Azure, and Google Cloud, with deep expertise in multi-cloud and hybrid-cloud architectures.

  • Strong hands-on experience with Cloud Networking (VPCs, Load Balancers, Service Mesh, API Gateways) and Storage Technologies (EBS, S3, Azure Blob, GFS).

  • Advanced proficiency in Infrastructure as Code (IaC) and Configuration Management tools (e.g., Terraform, CloudFormation, Pulumi, Ansible).

  • Deep expertise in Kubernetes administration, service mesh technologies (Istio, Linkerd), and container security.

  • Proficiency in Python, Go, or Java for cloud automation, testing frameworks, and infrastructure scripting.

  • Expertise in CI/CD pipelines using GitOps models, GitLab, Jenkins, ArgoCD, and Spinnaker for automated cloud deployments.

  • Hands-on experience with cloud observability and monitoring tools (Prometheus, Grafana, CloudWatch, Thanos, Datadog, New Relic).

  • Strong cloud security knowledge, including Kubernetes security, IAM policies, encryption, and vulnerability management.

  • Proven track record to debug complex cloud infrastructure issues, involving DNS, HTTP, Linux, cloud networking, and containers.

Ways to Stand Out from the Crowd:

  • A true innovator who isn't afraid to challenge the status quo and bring fresh ideas to the table. You're always looking for ways to improve existing systems and processes. Passion and curiosity about the latest technologies and trends in cloud infrastructure and distributed systems. You're not just familiar with the tools, but you understand the underlying principles and can demonstrate this knowledge to make strategic decisions. Committed to personal and professional growth. You're crafting opportunities to learn new skills and deepen your expertise.

  • Deep expertise in bringing to bear cloud testing powered by AI, demonstrating machine learning for predictive failure analysis, anomaly detection, and self-healing infrastructure.

  • Strong knowledge of Kubernetes Operators, Helm charts, and custom controllers for automating cloud operations.

  • Familiarity with Confidential Computing, Zero Trust Security models, and cloud-native security frameworks.

  • Excitement for the latest cloud architectures, like edge computing, infrastructure driven by AI, and serverless computing.

By joining our team, you will be part of a forward-thinking company that values innovation and creativity. We offer a competitive salary and benefits package, a flexible work environment, and the opportunity to work with some of the industry leading experts. If you're ready to take your career to the next level, we'd love to hear from you.

The base salary range is 200,000 USD - 391,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Applike Group - Senior DevOps Engineer  (f/m/d) 🚀

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
7 Months ago
ByteDance - Software Engineer (Applied Machine Learning - Enterprise)

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
N-iX - Senior Data Engineer

N-iX

Kyiv, Kyiv City, Ukraine (Remote)
2 Months ago
Single Store - Technical Account Manager

Single Store

Hyderabad, Telangana, India (Remote)
2 Months ago
NVIDIA - Senior Software Architect, Accelerated Computing SDN

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
PeopleFun - Senior Game Server Engineer II, Wordscapes

PeopleFun

United States (Remote)
2 Months ago
Escape Velocity Entertainment - Release Engineer

Escape Velocity Entertainment

(Remote)
2 Months ago
ByteDance - Cloud Native Infrastructure Engineer - Foundational Technology

ByteDance

Singapore (On-Site)
6 Months ago
Flying Bark Productions - DevOps Engineer

Flying Bark Productions

Sydney, New South Wales, Australia (Hybrid)
2 Months ago
NVIDIA - Senior Software Configuration Management Engineer

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Thatgamecompany - Senior DevOps Engineer (LiveOps)

Thatgamecompany

Shanghai, Shanghai, China (On-Site)
2 Months ago
Gunzilla - Blockchain Go Engineer

Gunzilla

Kyiv, Kyiv City, Ukraine (On-Site)
2 Months ago
Axinous - Principal Software Engineer (ZDX Platform Engineering)

Axinous

San Jose, California, United States (Hybrid)
6 Months ago
Pocket Worlds - Senior Backend Engineer

Pocket Worlds

United States (Remote)
2 Months ago
ByteDance - Senior Software Engineer - Serverless Compute Infrastructure

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
SparkCognition - Software Engineer (Scala_Backend)

SparkCognition

Bengaluru, Karnataka, India (On-Site)
8 Months ago
GoTo Group - Senior Software Engineer - Event Platform

GoTo Group

Gurugram, Haryana, India (On-Site)
7 Months ago
Go Fund Me - Senior DevEx Engineer

Go Fund Me

Buenos Aires, Buenos Aires, Argentina (Remote)
3 Months ago
Tesla - Senior Software Engineer (Backend, .Net)

Tesla

Amsterdam, North Holland, Netherlands (On-Site)
3 Months ago
Nielsen Holdings - Software Engineer - ( Java and GO, AWS, Kubernetes, Terraform, Cassandra, PostgreSQL)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Trackman - Trackman Baseball System Operator

Trackman

Sacramento, California, United States (On-Site)
2 Months ago
Riot Games - Principal Producer, VALORANT - Core Leadership

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Next Level Business Services - Salesforce Developer

Next Level Business Services

San Antonio, Texas, United States (On-Site)
7 Months ago
ByteDance - Research Engineer, Computer Vision

ByteDance

San Jose, California, United States (On-Site)
7 Months ago
Crunchyroll - Senior Software Engineer, Membership

Crunchyroll

San Francisco, California, United States (On-Site)
6 Months ago
Epic Games - Senior Gameplay Systems Developer, Developer Relations

Epic Games

Cary, North Carolina, United States (On-Site)
5 Months ago
Mattel  Inc  - American Girl Server

Mattel Inc

Dallas, Texas, United States (On-Site)
2 Months ago
Immutable - Head of Customer Growth, Americas

Immutable

United States (Remote)
2 Months ago
ByteDance - Software Engineer in Machine Learning Systems

ByteDance

Seattle, Washington, United States (On-Site)
7 Months ago
On Location - Junior Product & Pricing Analyst - Olympic & Paralympic Games

On Location

Raleigh, North Carolina, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Warner Bros Games - Staff Software Engineer - AWS Architecture (Observability Team)

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Balbix - Staff /Sr Staff/ Principal Engineer - Lakehouse

Balbix

Gurugram, Haryana, India (On-Site)
7 Months ago
NVIDIA - Senior Site Reliability Engineer - Infrastructure

NVIDIA

Westford, Massachusetts, United States (On-Site)
3 Months ago
PwC - Azure Data Engineer, Manager (Security clearance required)

PwC

Ottawa, Ontario, Canada (On-Site)
6 Months ago
Rackspace Technology - Software Engineer IV

Rackspace Technology

India (Remote)
2 Months ago
Velotio Technologies - Cloud Security Engineer

Velotio Technologies

Maharashtra, India (Remote)
2 Months ago
Extreme Network - SR PROGRAMMER - Oracle Fusion Cloud- VBCS/ BI Reports/ OTBI/FRS & SmartView

Extreme Network

Chennai, Tamil Nadu, India (Hybrid)
7 Months ago
WorldWinner - Senior DevOps Engineer

WorldWinner

(Remote)
4 Months ago
Wargaming - DevOps Engineer

Wargaming

Nicosia, Nicosia, Cyprus (On-Site)
5 Months ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Austin, Texas, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug