Solutions Architect, Data Center Infrastructure

1 Week ago • 3 Years + • DevOps • $120,000 PA - $235,750 PA

Job Summary

Job Description

NVIDIA seeks a Solutions Architect for Data Center Infrastructure to lead planning and deployments of AI data centers. Responsibilities include data center audits, planning, and deployment, ensuring infrastructure integrity aligns with NVIDIA reference architectures. This involves power distribution, cooling systems, networking, server hardware, storage, and telemetry. The role requires pre-deployment planning, risk identification, vendor training, and infrastructure design evaluation for consistency with industry standards. Testing, troubleshooting, and validation of compute systems are key, along with mentorship and continuous improvement initiatives. Collaboration with internal teams, vendors, and customers is crucial for seamless integration of data center infrastructure solutions.
Must have:
  • 3+ years data center experience
  • Data center operations knowledge
  • Power distribution & cooling expertise
  • Networking & server hardware knowledge
  • Pre-deployment planning skills
  • Strong problem-solving skills
  • Excellent communication skills
  • Willingness to travel (40%)
Good to have:
  • Linux system administration
  • Relevant certifications
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking a Solutions Architect in Data Center Infrastructure to join our Infrastructure Specialists team. Academic and commercial groups worldwide are using NVIDIA products to redefine deep learning, data analytics, and power data centers. Join the team building many of the world's largest and fastest data centers and supercomputers! NVIDIA is looking for someone who can lead planning and deployments of AI data centers including power/cooling systems, cabling and network provisioning and bring-up/validation.

As the NVIS Solutions Architect for Datacenter Infrastructure, you will focus on data center audit, planning and deployment ensuring the integrity of NVIDIA platform infrastructure. Your primary goal will be to guarantee that all aspects of the data center's physical infrastructure are meticulously planned, implemented, and validated to meet NVIDIA reference architectures, operational requirements, and industry standards. This infrastructure includes architectural systems, power distribution, liquid/air cooling systems, compute, network and cabling (fiber and copper), and telemetry systems.

What you will be doing:

  • NVIS Datacenter Engineering and planning: Collaborate with other teams to plan and implement data center infrastructure solutions based on NVIDIA Datacenter reference architecture, including power distribution, cooling systems, network architecture, server hardware, and storage systems.

  • Plan and manage deployment of NVIDIA's pioneering AI infrastructure solutions including highly complex rack-scale, liquid cooled compute and networking hardware systems, in a fluid and fast paced environment.

  • Conduct pre-deployment planning including reviewing cluster and data center architecture, plan network port mapping and fiber optic cabling BOM, identify potential risks, train vendors and find areas for improvement.

  • Evaluate customers' and partners' infrastructure design proposals for consistency with industry standards and regulatory requirements. Provide feedback and recommendations to improve performance, scalability, and cost-effectiveness.

  • Perform testing, troubleshooting and validation of compute systems based on collaboration with product and engineering teams.

  • Act as the NVIS mentor providing guidance, mentorship, and support to ensure the NVIS team's success in their respective roles.

  • Quality Assurance: Establish and enforce quality assurance processes to verify that deployments meet established specifications and performance benchmarks. Conduct thorough bring-up, testing, and validation to validate the functionality and reliability of infrastructure components.

  • Continuous Improvement: Drive continuous improvement initiatives to enhance data center infrastructure efficiency for NVIDIA data center reference architecture and deployment blueprint, resilience, and sustainability. Find opportunities to streamline processes, automate repetitive tasks, and leverage emerging technologies to optimize infrastructure operations.

  • Collaboration and Communication: Collaborate and communicate across internal teams, external vendors, and customers to facilitate the seamless integration of data center infrastructure solutions. Serve as a domain expert and point of contact for infrastructure-related inquiries and blocking issues.

What we need to see:

  • Bachelor's degree (or equivalent experience) in Engineering, Computer Science, Information Technology, or a related field.

  • 3+ years of overall experience in enterprise and/or hyperscale data centers with continual infrastructure deployment experience, preferably for high density AI/HPC data centers.

  • Working experience in data center operations, or infrastructure management roles, focusing on large-scale data center deployments.

  • Strong technical knowledge and experience in the data center stack - power distribution, liquid cooling, servers, networking, storage and pre-deployment planning

  • Relevant certification – preferred

  • Demonstrated technical and project leadership under fluid situations, ability to adapt to unknowns and change.

  • Excellent analytical, problem-solving, and decision-making skills, keen attention to detail, and a commitment to quality.

  • Excellent communication and interpersonal abilities, capable of engaging with various collaborators like customers to enable productive discussions.

  • Organization & Time Management – able to plan, schedule, and organize tasks related to the job to achieve goals within or ahead of established time frames.

  • Willingness to travel (40%).

Way to stand out from the crowd:

  • Linux system administration skills

  • Strong knowledge of whole data center Infrastructure stack

  • Flexible/agile and enjoys solving challenging problems

NVIDIA is widely considered one of the world's most desirable employers in technology. We have some of the world's most forward-thinking and passionate people working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 120,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Tencent - Research Intern (NLP)

Tencent

Palo Alto, California, United States (On-Site)
2 Months ago
NVIDIA - Senior Synthesis Flow CAD Engineer

NVIDIA

Canada (On-Site)
1 Month ago
NVIDIA - Customer Technical Program Manager

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
NVIDIA - Machine Learning Software Platform Architect

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
ByteDance - Research Scientist (Machine Learning for Science (AI-for-Science))

ByteDance

Seattle, Washington, United States (On-Site)
1 Week ago
Google - Senior Software Engineer, Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
NVIDIA - Senior Storage and Data Production Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Tencent - Senior Technical Account Manager

Tencent

Frankfurt, Hessen, Germany (On-Site)
1 Week ago
Info Stretch - Lead Data Engineer

Info Stretch

Chennai, Tamil Nadu, India (On-Site)
5 Months ago
Scopely - Lead DevOps/SRE - Unannounced Project

Scopely

Dublin, County Dublin, Ireland (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Senior Machine Learning Ops Engineer, ML System - Foundation Model

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Westford, Massachusetts, United States (Hybrid)
1 Month ago
Trend Micro - Sr. Data Scientist (AI Lab)

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Talentica Software - Data Scientist

Talentica Software

India (Remote)
6 Months ago
NVIDIA - System Design Validation Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
ByteDance - Research Scientist, Multimodal Foundation Model

ByteDance

Singapore (On-Site)
6 Months ago
Google - Software Engineer III, Machine Learning, Google Ads

Google

Kirkland, Washington, United States (On-Site)
1 Week ago
NVIDIA - Senior Costing Analyst

NVIDIA

Santa Clara, California, United States (On-Site)
1 Week ago
ByteDance - Software Engineer Intern (AI Platform)

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Bellevue, Washington, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Texas, United States

Epic Games - Release Manager

Epic Games

Cary, North Carolina, United States (On-Site)
7 Months ago
GoMotive - Staff Software Engineer

GoMotive

United States (Remote)
1 Month ago
Google - Technical Program Manager II, Data Center Networking, Technical Infrastructure

Google

Atlanta, Georgia, United States (On-Site)
1 Week ago
Netflix - Engineering Manager, Ads Supply

Netflix

United States (Remote)
4 Days ago
The Walt Disney Company - Sr Data Analyst

The Walt Disney Company

New York, New York, United States (On-Site)
1 Week ago
ION - Senior Business Consultant - Allegro​

ION

Houston, Texas, United States (On-Site)
6 Months ago
NVIDIA - Senior Signal and Power Integrity Engineer - Hardware

NVIDIA

Austin, Texas, United States (On-Site)
1 Month ago
NVIDIA - DevOps Engineering Intern, DGXC Console - Fall 2025

NVIDIA

Washington, United States (On-Site)
1 Week ago
Google - Senior Software Engineer, Infrastructure, Google Cloud Storage

Google

New York, New York, United States (On-Site)
1 Week ago
Trackman - Trackman Baseball System Operator

Trackman

Arizona, United States (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Crunchyroll - Staff Site Reliability Engineer - Data Engineering, Platform

Crunchyroll

San Francisco, California, United States (Remote)
5 Months ago
Google - Delivery Executive, Google Cloud Consulting

Google

Munich, Bavaria, Germany (On-Site)
6 Days ago
Mattel  Inc  - Live Games Infrastructure Manager - Digital Gaming

Mattel Inc

El Segundo, California, United States (On-Site)
6 Months ago
CD PROJEKT RED - Senior DevOps Engineer

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Days ago
Admin Looks - Release Manager

Admin Looks

Hyderabad, Telangana, India (Remote)
5 Months ago
Google - Technical Solutions Engineer, Apigee

Google

Maharashtra, India (On-Site)
1 Week ago
Warner Bros Games - Senior Software Developer

Warner Bros Games

Ottawa, Ontario, Canada (Hybrid)
4 Months ago
Google - Software Engineer II, BIOS, Google Cloud Platform

Google

Taipei City, Taiwan (On-Site)
1 Week ago
ByteDance - Backend Software Engineer - Foundational Technology

ByteDance

Singapore (On-Site)
1 Month ago
Google - Senior Software Engineer, Infrastructure Storage, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug