Data Center Cluster Architect

2 Months ago • All levels • Data Analysis • $207,800 PA - $378,700 PA

Job Summary

Job Description

The Data Center Systems Architecture team is seeking a Cluster Architect to design and optimize computer architectures for high-performance computing (HPC) clusters. This role involves creating complex system architectures and meeting product goals related to performance, size, power, thermal, and cost. The architect will define infrastructure details, collaborate with engineering teams on cluster network integration, and ensure efficient data flow. Responsibilities include defining rack and cluster configurations, designing optimized networks for AI/ML clusters, influencing hardware and software selection, analyzing network traffic, and collaborating with various stakeholders. Innovation, championing new features, and mentoring junior engineers are also part of the role. This position may require occasional travel.

Job Details

The Datacenter Systems Architecture team seeks an outstanding Cluster architect to design and optimize computer architectures specifically for high-performance computing (HPC) clusters. This position is a multi-disciplinary and cross-functional lead engineering role encompassing all aspects of computer system design. The candidate will have the skills and experience to create complex system architectures, surprise and delight our customers, and advance our products’ performance, size, power, thermal and cost goals. As a technical specialist, negotiate and document the solution details of the infrastructure from a physical, electrical, and logical perceptive of compute clusters within the datacenter. Collaborate and leverage domain expertise knowledge to provide guidance, and leadership to cross-functional engineering teams to integrate cluster network architectures into overall system architecture to ensure efficient data flow, impact product definitions, and meet scalability requirements. Define the rack and cluster capabilities, configurations, and scale out requirements to support the deployment of dense compute and specialty compute workloads and applications, including but not limited to the following: Pathfinding on novel cluster architecture choices with a broad group of architects and system engineers, networking, technical leads, and HW/SW stakeholders. Creating optimized network designs for large-scale AI/ML clusters considering factors like bandwidth, latency, and scalability. Influencing networking hardware and software components selection for the cluster, including switches, adapters, and protocols. Analyzing network traffic patterns and implementing strategies to improve data transfer speeds within the cluster for target topologies and choice configurations. Collaborate with mechanical, physical, electrical, thermal, power, networking, OS, SW, datacenter infrastructure stakeholders for performant scalable deployments. Be innovative and curious. Explore and champion new product-level features and workflows. Define, develop and utilize tools, scripts, automation and methods of system analysis for performance of compute clusters within a DC environment. Mentor junior engineers to best practices and data-driven processes The role may require occasional domestic and international travel.

Similar Jobs

Notion - Software Engineer, Android

Notion

San Francisco, California, United States (On-Site)
9 Months ago
Ion - Data Center Architect, Italy

Ion

Italy (Hybrid)
9 Months ago
Apple - System Hardware & Software Quality Engineer

Apple

Cupertino, California, United States (On-Site)
2 Months ago
Ziff Davis - Planner, Sales Planning

Ziff Davis

United States (Remote)
2 Weeks ago
Aristocrat - Mobile Applications Product Manager

Aristocrat

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago
Amber - Senior Data Analyst

Amber

Montreal, Quebec, Canada (Remote)
1 Month ago
beghou consulting - Data Engineer

beghou consulting

Hyderabad, Telangana, India (Hybrid)
2 Months ago
Roblox - Data Scientist / Senior Data Scientist - Social Communities

Roblox

San Mateo, California, United States (On-Site)
4 Weeks ago
Grab - Senior Data Scientist

Grab

Beijing, China (On-Site)
3 Weeks ago
binance - Binance Accelerator Program - Data Analyst

binance

Dubai, Dubai, United Arab Emirates (Remote)
3 Years ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Adyen - Strategy & Program Manager - Regulatory

Adyen

Amsterdam, North Holland, Netherlands (On-Site)
1 Month ago
Moloco - Senior Staff Software Engineer (TLM)

Moloco

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Gameloft - Backend Services Developer - Python

Gameloft

Montreal, Quebec, Canada (Hybrid)
1 Week ago
Aristocrat - .Net Developer

Aristocrat

Sofia, Sofia City Province, Bulgaria (Hybrid)
1 Month ago
Dynamis Inc - Senior Scientific Advisor

Dynamis Inc

Huntsville, Alabama, United States (On-Site)
1 Week ago
Milestone - Technical Product Manager

Milestone

United States (Remote)
2 Months ago
Capgemini - Network Voice

Capgemini

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Moloco - Growth Manager

Moloco

Beijing, China (On-Site)
1 Month ago
Shipt - Network Planning Manager, Launches

Shipt

Birmingham, Alabama, United States (Hybrid)
1 Month ago
playrix  - Golang Tech Lead (GameOps)

playrix

Ireland (Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Cupertino, California, United States

Electronic Arts - Senior Manager Event Security Operations

Electronic Arts

Redwood City, California, United States (Hybrid)
1 Month ago
Scout - Senior Specialist, Product Management Energy Systems

Scout

Novi, Michigan, United States (On-Site)
6 Days ago
Toast - Finance Manager, R&D

Toast

San Francisco, California, United States (Hybrid)
4 Weeks ago
Visa - Systems Engineer - Sr. Consultant, IaC

Visa

Ashburn, Virginia, United States (Hybrid)
1 Month ago
Next Level Business Services - SQL Developer

Next Level Business Services

Bellevue, Washington, United States (On-Site)
8 Months ago
cirrus logic - Business Development Manager, PC and Ecosystem Partners

cirrus logic

Austin, Texas, United States (Hybrid)
1 Week ago
Qualcomm - Principal Software Engineering - WindowsOS Platform

Qualcomm

San Diego, California, United States (On-Site)
2 Months ago
Apple - Site Services Supervisor, Data Center

Apple

Mesa, Arizona, United States (On-Site)
1 Month ago
PPfa - Staff Attorney/Senior Staff Attorney

PPfa

New York, United States (Hybrid)
2 Months ago
WebTech Corporation - Vice President, Commercial Global Product Management

WebTech Corporation

Pittsburgh, Pennsylvania, United States (On-Site)
4 Days ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Rackner - Data Engineer

Rackner

Falls Church, Virginia, United States (Hybrid)
6 Days ago
Calix - Distinguished Data Architect

Calix

(Remote)
2 Months ago
Haleon - Sales Operations Executive - Data Insight & Analytics

Haleon

Jakarta, Indonesia (On-Site)
2 Weeks ago
binance - Data Scientist (TechOps)

binance

Taipei City, Taiwan (Remote)
1 Year ago
cyara - Data Analyst

cyara

Hyderabad, Telangana, India (Hybrid)
3 Months ago
Winzo - Data Analyst

Winzo

New Delhi, Delhi, India (On-Site)
2 Months ago
Apple - Business Analyst - Sports & TV+

Apple

Culver City, California, United States (On-Site)
1 Month ago
playrix  - Senior Data Analyst (Attribution)

playrix

Almaty, Almaty Region, Kazakhstan (Remote)
8 Months ago
Anavation - Modeling Data Engineer - Ontologist

Anavation

Lorton, Virginia, United States (Hybrid)
1 Month ago
Extreme Inc. - Data Analysis Engineer

Extreme Inc.

Tokyo, Japan (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Sunnyvale, California, United States (On-Site)

Austin, Texas, United States (On-Site)

San Diego, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Culver City, California, United States (On-Site)

Cupertino, California, United States (On-Site)

Cupertino, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Apple

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug