Principal AI Network Architect

1 Hour ago • 6-11 Years • DevOps

About the job

Job Description

Microsoft's Azure Hardware Systems & Infrastructure (AHSI) seeks a Principal AI Network Architect to innovate hardware designs driving cloud growth. The role involves system and network architecture, workload modeling, and co-design for high-performance distributed systems, particularly focusing on AI/neural networks. Responsibilities include proposing and evaluating system architectures, utilizing performance modeling tools, identifying performance bottlenecks, and optimizing resource utilization to enhance performance of breakthrough AI workloads. The ideal candidate possesses strong AI/neural network expertise, experience with network simulation environments (HTSim, NS3), and proficiency in languages like C, C++, C#, Java, JavaScript, or Python. The position contributes to shaping Azure's AI infrastructure roadmap and driving neural network model/hardware co-design.
Must have:
  • Bachelor's Degree in CS or related field
  • 6+ years technical engineering experience
  • 5+ years experience developing/architecting hardware/networks
  • Experience with network simulation environments
  • Proficiency in C, C++, C#, Java, JavaScript, or Python
Good to have:
  • MS/PhD in related field
  • Understanding of LLMs, training, and inference
  • Experience in Python programming and software engineering
  • Working knowledge of LLMs and frameworks like TensorFlow, PyTorch
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

Do you want to be at the forefront of innovating the latest hardware designs to propel Microsoft’s cloud growth? Are you seeking a unique career opportunity that combines both technical capabilities, cross team collaboration, with business insight and strategy?

 

Join our Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems & Infrastructure (AHSI) organization and be a part of the organization behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission.  

Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure. We deliver the core infrastructure and foundational technologies for Microsoft's cloud businesses including Microsoft Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live. 

 

The SPARC organization manages Azure’s hardware roadmap from architecture concept through production for all of Microsoft’s current and future on-line services.  We are looking for a Principal AI Network Architect with a good background in Artificial Intelligence(AI)/neural networks and experience designing and evaluating networks and technologies for high performance distributed systems. You will be involved with system architecture, network architecture, as well as workload understanding, modelling and co-design.

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

 

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Qualifications

Required Qualifications:

  • Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • 5+ years of experience developing/architecting hardware and/or networks.
  • Experience with network simulation environments like HTSim and NS3.

 

Other requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • MS/PhD in Machine learning, Computer Architecture/Systems, Electrical Engineering, High-Performance Computing or related areas.  
  • Understanding of large language models, training and inference. 
  • Experience in python programming and Software engineering. 
  • Effective communication and collaborative mindset.
  • Good problem-solving skills and attention to detail.
  • Working knowledge of prevailing Large Language Models (LLM) and frameworks like Tensorflow, Pytorch is a plus.

Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $137,600 - $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 - $294,000 per year.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:    

 

Microsoft will accept applications for the role until January 3, 2025.

 

Responsibilities

  • Propose and evaluate system architectures to meet stringent workload and datacenter requirements.
  • Maintain and leverage in-house performance modeling tools for AI workload and Networking evaluation.
  • Identify performance bottlenecks, optimize resource utilization, and implement improvements to enhance performance.
  • Help architect large scale systems which support breakthrough performance AI workloads to shape Azure’s AI infrastructure roadmap.  
  • Drive Neural Networks(NN) model/Hardware(HW) codesign.  
  • Understand business critical AI workloads/applications.
  • Other
    • Embody our and  
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$137.6K - $294.0K/yr (Outscal est.)
$215.8K/yr avg.
Redmond, Washington, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-Site)

San José, San José Province, Costa Rica (On-Site)

Prague, Prague, Czechia (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Nasdaq - Java Application Support

Nasdaq, Philippines (Hybrid)

SSC Technologies - Senior Business Systems Analyst (Riyadh, KSA)

SSC Technologies, Saudi Arabia (On-Site)

Paypal - Staff Software Backend Engineer (GenAI)

Paypal, United States (Hybrid)

Keywords Studios (Player Support) - Architecte de solutions

Keywords Studios (Player Support), Canada (Remote)

Lulalend - Senior Azure Infrastructure Engineer

Lulalend, South Africa (On-Site)

PwC - Senior Associate | Devops SRE

PwC, India (On-Site)

LeoVegas - Site Reliability Engineer

LeoVegas, Sweden (Hybrid)

Luxoft - KDB Developer

Luxoft, India (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Acceldata - Senior SDET - ADOC

Acceldata, India (On-Site)

Evernorth Health Services - Software Engineering Advisor [T500-12394]

Evernorth Health Services, India (On-Site)

Maersk Careers - Senior Software Engineer

Maersk Careers, China (On-Site)

Progress - Senior Full Stack Engineer

Progress, Bulgaria (Hybrid)

Meta - Software Engineer, Infrastructure

Meta, United States (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Onward Search - Business Development Specialist (Real Estate)

Onward Search, United States (On-Site)

2K - Feature Lead

2K, United States (On-Site)

ByteDance - Compliance Internal Auditor Lead - Payments

ByteDance, United States (Hybrid)

Next Level Business Services - Performance Test Manager

Next Level Business Services, United States (On-Site)

Snail Games - Video Editor

Snail Games, United States (On-Site)

Varonis  - Federal Alliance Manager

Varonis , United States (On-Site)

ByteDance - Software Engineer - Applied Machine Learning

ByteDance, United States (On-Site)

Google - Senior Delivery Executive

Google, United States (On-Site)

Regent Craft - Manufacturing Supervisor

Regent Craft, United States (On-Site)

Paypal - Legal Counsel, Arbitration

Paypal, United States (Hybrid)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Avalara - Senior Site Reliability Engineer

Avalara, India (Remote)

Unisys - AVD Support Senior Engineer

Unisys, India (On-Site)

Luxoft - Tech Lead (Python+Azure)

Luxoft, (Remote)

KBG Blockchain Game Studios - Back-End Developer (NodeJS)

KBG Blockchain Game Studios, Vietnam (On-Site)

Avathon - Senior DevOps Engineer

Avathon, India (On-Site)

Cadence - Senior Cloud Platform Architect

Cadence, United States (On-Site)

Toast - Staff Software Engineer

Toast, India (On-Site)

Get notifed when new similar jobs are uploaded