Data Center Cluster Architect

1 Month ago • All levels • $207,800 PA - $378,700 PA

Job Summary

Job Description

The Data Center Systems Architecture team is seeking a Cluster Architect to design and optimize computer architectures for high-performance computing (HPC) clusters. This role involves creating complex system architectures and meeting product goals related to performance, size, power, thermal, and cost. The architect will define infrastructure details, collaborate with engineering teams on cluster network integration, and ensure efficient data flow. Responsibilities include defining rack and cluster configurations, designing optimized networks for AI/ML clusters, influencing hardware and software selection, analyzing network traffic, and collaborating with various stakeholders. Innovation, championing new features, and mentoring junior engineers are also part of the role. This position may require occasional travel.

Job Details

The Datacenter Systems Architecture team seeks an outstanding Cluster architect to design and optimize computer architectures specifically for high-performance computing (HPC) clusters. This position is a multi-disciplinary and cross-functional lead engineering role encompassing all aspects of computer system design. The candidate will have the skills and experience to create complex system architectures, surprise and delight our customers, and advance our products’ performance, size, power, thermal and cost goals. As a technical specialist, negotiate and document the solution details of the infrastructure from a physical, electrical, and logical perceptive of compute clusters within the datacenter. Collaborate and leverage domain expertise knowledge to provide guidance, and leadership to cross-functional engineering teams to integrate cluster network architectures into overall system architecture to ensure efficient data flow, impact product definitions, and meet scalability requirements. Define the rack and cluster capabilities, configurations, and scale out requirements to support the deployment of dense compute and specialty compute workloads and applications, including but not limited to the following: Pathfinding on novel cluster architecture choices with a broad group of architects and system engineers, networking, technical leads, and HW/SW stakeholders. Creating optimized network designs for large-scale AI/ML clusters considering factors like bandwidth, latency, and scalability. Influencing networking hardware and software components selection for the cluster, including switches, adapters, and protocols. Analyzing network traffic patterns and implementing strategies to improve data transfer speeds within the cluster for target topologies and choice configurations. Collaborate with mechanical, physical, electrical, thermal, power, networking, OS, SW, datacenter infrastructure stakeholders for performant scalable deployments. Be innovative and curious. Explore and champion new product-level features and workflows. Define, develop and utilize tools, scripts, automation and methods of system analysis for performance of compute clusters within a DC environment. Mentor junior engineers to best practices and data-driven processes The role may require occasional domestic and international travel.

Similar Jobs

warner bros games - Staff Software Engineer - MSC Rights Team

warner bros games

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Kwalee - Senior Growth Manager

Kwalee

Royal Leamington Spa, England, United Kingdom (On-Site)
2 Months ago
Aristocrat Gaming - Android Developer

Aristocrat Gaming

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago
fortanix - Software Engineer - II

fortanix

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
hogarth - CGI Creative Director

hogarth

Mexico City, Mexico (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

rivos - Silicon CAD Front End- Full time

rivos

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Capgemini - VMware Admin

Capgemini

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Boomi  - Cloud API Management Technical Consultant

Boomi

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Passion Gaming - AWS DevOps Engineer

Passion Gaming

Gurugram, Haryana, India (On-Site)
1 Year ago
Ion - Senior Software Engineer - Full Stack

Ion

Pune, Maharashtra, India (On-Site)
8 Months ago
Animoca Brands - Web3 Engineer

Animoca Brands

Hong Kong, Hong Kong (Hybrid)
2 Months ago
Red Rover Interactive - Senior Server programmer

Red Rover Interactive

Newcastle Upon Tyne, England, United Kingdom (Hybrid)
1 Year ago
bytedance - Edge Network Engineer

bytedance

Singapore (On-Site)
1 Month ago
caliogo - Head of Professional Services

caliogo

(Remote)
3 Months ago
bytedance - Backend Engineer (Model Inference) Intern - 2025 Start

bytedance

Singapore (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Cupertino, California, United States

Nasdaq - Senior Specialist - Equities Product Management

Nasdaq

New York, New York, United States (On-Site)
1 Month ago
Optiv - Sr. Unix/Linux Engineer

Optiv

Columbia, Maryland, United States (On-Site)
4 Days ago
Meta - Research Scientist, Machine Learning (PhD)

Meta

Bellevue, Washington, United States (On-Site)
1 Month ago
Imanage - Accountant

Imanage

Sunnyvale, California, United States (Hybrid)
1 Month ago
Epoch Games - 3D Character Artist

Epoch Games

North Carolina, United States (Remote)
2 Months ago
Meta - Software Engineer, Machine Learning

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
7 Months ago
Temporal Technologies - Senior Solutions Architect

Temporal Technologies

Chicago, Illinois, United States (Remote)
2 Weeks ago
Plug power - Traveling Regional Field Service Technician

Plug power

Bentonville, Arkansas, United States (On-Site)
1 Month ago
Flow - Project Manager, Interior Fit-Outs

Flow

Miami, Florida, United States (On-Site)
1 Month ago
Blizzard Entertainment - Senior Software Engineer, Game Services

Blizzard Entertainment

Austin, Texas, United States (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Cary, North Carolina, United States (On-Site)

Austin, Texas, United States (Remote)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Elk Grove, California, United States (On-Site)

San Diego, California, United States (On-Site)

Austin, Texas, United States (On-Site)

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Apple

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug