Senior Cloud Test Developer Architect

3 Weeks ago • 8 Years + • DevOps • $200,000 PA - $391,000 PA

Job Summary

Job Description

NVIDIA seeks a Senior Cloud Test Developer Architect to design, optimize, and test large-scale cloud infrastructure for its Unified Cloud Services and Data Center offerings. Responsibilities include leveraging AI-powered testing tools, collaborating with engineering teams, crafting end-to-end test strategies, leading cloud bring-up activities, architecting cloud-native test automation frameworks, developing scalable infrastructure automation, improving observability and monitoring, ensuring resilience and failover testing, and collaborating with internal and external teams. The ideal candidate possesses extensive experience with cloud platforms (AWS, Azure, GCP), Kubernetes, IaC, CI/CD, and cloud security, along with strong programming skills (Python, Go, or Java).
Must have:
  • 8+ years cloud infrastructure experience
  • Kubernetes expertise
  • IaC & Configuration Management
  • Cloud Networking & Storage knowledge
  • CI/CD pipeline experience
  • Python/Go/Java proficiency
  • Cloud security knowledge
Good to have:
  • AI-powered testing experience
  • Kubernetes Operators
  • Confidential Computing knowledge
  • Zero Trust Security models
  • Edge computing familiarity
Perks:
  • Competitive salary and benefits
  • Flexible work environment
  • Opportunity to work with industry experts
  • Equity

Job Details

We are in search of a highly skilled Senior Test Developer Architect to join our dynamic Enterprise Software QA team. This role presents an outstanding opportunity to craft the design, optimization, and testing of large-scale cloud infrastructure for foundational NVIDIA Unified Cloud Services and Data Center offerings. Seeking cloud infrastructure expert with expertise in distributed systems, test automation, cloud architectures, for a dynamic role.

What You’ll Be Doing:

  • Leverage AI-powered testing tools to improve test automation, increase coverage, and accelerate testing cycles for cloud-based infrastructure.

  • Collaborate with product engineering teams to deeply understand cloud service architectures and provide mentorship to SWQA teams on testing cloud-native applications at scale.

  • Craft and develop end-to-end test strategies for validating cloud infrastructure, including compute, storage, networking, security, and orchestration layers.

  • Lead NVIDIA Cloud bring-up activities from a software quality assurance perspective, ensuring scalability, reliability, and performance.

  • Architect and implement cloud-native test automation frameworks to validate multi-cloud (AWS, Azure, Google Cloud) and hybrid-cloud environments.

  • Develop scalable and resilient infrastructure automation by using Infrastructure as Code (IaC), Configuration Management, and optimization techniques.

  • Improve observability and monitoring through AI-powered anomaly detection, predictive analytics, and intelligent alerting.

  • Ensure resilience and failover testing of cloud-based microservices and distributed architectures.

  • Collaborate with internal teams and cloud service partners to ensure alignment with industry standard methodologies and real-world use cases.

What We Need to See:

  • Master’s or Ph.D. in Computer Science, Cloud Computing, or a related field, or equivalent experience.

  • 4+ years of hands-on experience in cloud-native cluster management, including Docker, Slurm, Kubernetes, OpenShift, and Ansible.

  • 8+ years of experience working with cloud infrastructure platforms like AWS, Azure, and Google Cloud, with deep expertise in multi-cloud and hybrid-cloud architectures.

  • Strong hands-on experience with Cloud Networking (VPCs, Load Balancers, Service Mesh, API Gateways) and Storage Technologies (EBS, S3, Azure Blob, GFS).

  • Advanced proficiency in Infrastructure as Code (IaC) and Configuration Management tools (e.g., Terraform, CloudFormation, Pulumi, Ansible).

  • Deep expertise in Kubernetes administration, service mesh technologies (Istio, Linkerd), and container security.

  • Proficiency in Python, Go, or Java for cloud automation, testing frameworks, and infrastructure scripting.

  • Expertise in CI/CD pipelines using GitOps models, GitLab, Jenkins, ArgoCD, and Spinnaker for automated cloud deployments.

  • Hands-on experience with cloud observability and monitoring tools (Prometheus, Grafana, CloudWatch, Thanos, Datadog, New Relic).

  • Strong cloud security knowledge, including Kubernetes security, IAM policies, encryption, and vulnerability management.

  • Proven track record to debug complex cloud infrastructure issues, involving DNS, HTTP, Linux, cloud networking, and containers.

Ways to Stand Out from the Crowd:

  • A true innovator who isn't afraid to challenge the status quo and bring fresh ideas to the table. You're always looking for ways to improve existing systems and processes. Passion and curiosity about the latest technologies and trends in cloud infrastructure and distributed systems. You're not just familiar with the tools, but you understand the underlying principles and can demonstrate this knowledge to make strategic decisions. Committed to personal and professional growth. You're crafting opportunities to learn new skills and deepen your expertise.

  • Deep expertise in bringing to bear cloud testing powered by AI, demonstrating machine learning for predictive failure analysis, anomaly detection, and self-healing infrastructure.

  • Strong knowledge of Kubernetes Operators, Helm charts, and custom controllers for automating cloud operations.

  • Familiarity with Confidential Computing, Zero Trust Security models, and cloud-native security frameworks.

  • Excitement for the latest cloud architectures, like edge computing, infrastructure driven by AI, and serverless computing.

By joining our team, you will be part of a forward-thinking company that values innovation and creativity. We offer a competitive salary and benefits package, a flexible work environment, and the opportunity to work with some of the industry leading experts. If you're ready to take your career to the next level, we'd love to hear from you.

The base salary range is 200,000 USD - 391,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

NVIDIA - Senior Site Reliability Engineer - GPU Clusters

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Zeta - Senior Site Reliability Engineer

Zeta

Hyderabad, Telangana, India (On-Site)
5 Months ago
GoReel - Python Developer

GoReel

(Remote)
6 Days ago
Canva - Senior Machine Learning Researcher (GenAI) - Canva Austria (f/m/x)

Canva

Vienna, Vienna, Austria (Remote)
4 Months ago
seeking alpha - Senior Back-End Developer

seeking alpha

Israel (Remote)
3 Months ago
PwC - ETIC, GCP Technical Support Engineer - Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
5 Months ago
IO Interactive - Senior Build Engineer

IO Interactive

Copenhagen, Denmark (Hybrid)
1 Month ago
Rackspace Technology - Cloud Practice Engineer

Rackspace Technology

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
prizepicks - Database Reliability Engineer

prizepicks

Atlanta, Georgia, United States (Remote)
1 Week ago
ION - Microsoft System Engineer, Italy

ION

Italy (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Sr Machine Learning Engineer

The Walt Disney Company

New York, New York, United States (On-Site)
6 Days ago
Zeta - Engineering Manager - Cloud Security (DevSecOps)

Zeta

Bengaluru, Karnataka, India (On-Site)
5 Months ago
ByteDance - Senior Machine Learning Ops Engineer, ML System - Foundation Model

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Epic Games - Security Engineer - Backend (Asset Integrity)

Epic Games

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)
1 Week ago
Sonar Source - Solutions Engineer - Strategic Accounts

Sonar Source

Austin, Texas, United States (Hybrid)
5 Months ago
Aristocrat Gaming - Senior Engineer I

Aristocrat Gaming

Noida, Uttar Pradesh, India (Hybrid)
2 Weeks ago
Zoox - Staff/Senior Staff Software Platform Engineer

Zoox

Foster City, California, United States (Hybrid)
5 Months ago
NVIDIA - Senior Cloud Test Developer Architect

NVIDIA

Canada (On-Site)
3 Weeks ago
DEVOTEAM - Backend Developer Cloud (m/w/d)

DEVOTEAM

Frankfurt, Hessen, Germany (Remote)
5 Months ago
Meta - Production Engineer

Meta

New York, New York, United States (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Cloud Imperium Games - Associate Sales Manager

Cloud Imperium Games

Austin, Texas, United States (On-Site)
4 Months ago
Light Speed Studios - Senior 3D Artist

Light Speed Studios

Los Angeles, California, United States (On-Site)
1 Month ago
ZAGG,  Inc  - Motion Graphics Artist

ZAGG, Inc

Midvale, Utah, United States (On-Site)
8 Months ago
Realworld one - Associate, IT Support / Helpdesk (College Student, Part-Time)

Realworld one

Dallas, Texas, United States (Hybrid)
5 Days ago
The Walt Disney Company - Senior Systems Engineer

The Walt Disney Company

New York, New York, United States (On-Site)
3 Months ago
JMA - Regional Sales Director - DAS - Pacific Northwest

JMA

United States (Remote)
5 Months ago
NVIDIA - Senior ASIC Power Engineer

NVIDIA

Durham, North Carolina, United States (On-Site)
1 Month ago
Samsung Semiconductor - Senior Engineer, Design Verification

Samsung Semiconductor

San Jose, California, United States (On-Site)
1 Week ago
Rackspace Technology - Program Lead - AWS Strategic Collaboration

Rackspace Technology

United States (Remote)
3 Weeks ago
Trek - Sales Associate

Trek

Arlington, Virginia, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

RoofStack - Senior Backend Developer

RoofStack

İstanbul, İstanbul, Türkiye (On-Site)
2 Weeks ago
Wargaming - DevOps Engineer

Wargaming

Shanghai, Shanghai, China (On-Site)
1 Week ago
Zazz - Java Developer

Zazz

(Remote)
1 Month ago
Demonware - Site Reliability Intern

Demonware

Shanghai, Shanghai, China (On-Site)
21 Hours ago
Warner Bros Games - Senior Software Developer

Warner Bros Games

Ottawa, Ontario, Canada (Hybrid)
3 Months ago
Inkittt - Senior Machine Learning Engineer, Recommendations

Inkittt

San Francisco, California, United States (Hybrid)
2 Months ago
Rackspace Technology - AWS L1 Support Engineer

Rackspace Technology

Gurugram, Haryana, India (Remote)
3 Weeks ago
Nagarro - Staff Engineer (Cloud Infrastructure)

Nagarro

Gurugram, Haryana, India (On-Site)
5 Months ago
Rackspace Technology - Software Engineer IV

Rackspace Technology

United States (Remote)
3 Weeks ago
Nagarro - Senior Cloud Consultant

Nagarro

Germany (Remote)
6 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug