Sr. HW Quality Engineer

3 Months ago • 12-12 Years • Manufacturing

Job Summary

Job Description

The Senior HW Quality Engineer will develop and implement a robust supplier quality management strategy for data center hardware. This role involves leading quality issue resolution task forces, conducting debug and failure analysis for GPU subsystems in the Azure fleet, driving continuous improvement through RCA, and establishing critical-to-quality metrics. Responsibilities include quality readouts based on telemetry data analysis, acting as the voice of quality in hardware change management, and collaborating with diverse teams. The ideal candidate possesses extensive experience in managing manufacturing quality in the electronics industry, particularly with GPU servers, and has a proven track record of root cause analysis and corrective action.
Must have:
  • 12+ years relevant technical engineering experience
  • 8+ years managing manufacturing quality in electronics
  • 5+ years hardware system issue resolution for GPU servers
  • Experience in debugging and failure analysis
  • Root cause analysis and corrective action expertise
Good to have:
  • Patent or track record of engineering excellence
  • Experience with modern server architectures (GPU, CPU)
  • System-level server debugging experience
  • Direct GPU-related engineering experience
  • Leadership and collaboration skills
  • Data analysis and presentation skills
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Job Details

Overview

Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive, and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate, high-energy engineers to help achieve that mission. 

  

As Microsoft's cloud business continues to grow the ability to deploy new offerings and hardware infrastructure on time, in high volume with high quality and lowest cost is of paramount importance. To achieve this goal, the Hardware, Infrastructure Management, and Fundamentals Engineering (HIFE) team is instrumental in defining and delivering operational measures of success for hardware manufacturing, improving the planning process, quality, delivery, scale and sustainability related to Microsoft cloud hardware. We are looking for seasoned engineers with a dedicated passion for customer focused solutions, insight and industry knowledge to envision and implement future technical solutions that will manage and optimize the Cloud infrastructure.  

 We are looking for a Senior HW Quality Engineer to join the team. 

 

#azurehwjobs #HIFE 

Qualifications

Required Qualifications: 

  • 12+ years relevant technical engineering experience 

o OR Bachelor's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 5+ years technical engineering experience 

o OR Master's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 4+ years technical engineering experience 

o OR Doctorate Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 2+ years technical engineering experience. 

  • 8+ years of work experience in managing manufacturing quality in the electronic industry.  
  • 5+ years of direct engineering experience in hardware system issue resolution for GPU Servers.  
  • Versed in filtering through applicable debug data, like telemetry and logs to identify and investigate HW failure signatures    

 

Other Qualifications: 

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.  

 

Preferred Qualifications: 

  • Bachelor's Degree in manufacturing, material, mechanical, electrical, and industrial engineering, or related field AND 7+ years experience in a manufacturing environment/repair 
  • OR Master's Degree in manufacturing, material, mechanical, electrical, and industrial engineering, or related field AND 6+ years experience in a high-volume manufacturing environment 
  • OR Doctorate in manufacturing, material, mechanical, electrical, and industrial engineering, or related field AND 3+ years experience in a manufacturing environment/repair 
  • OR 9+ years equivalent experience. 
  • Patent or track record of engineering excellency. 
  • 12+ years of experience in working with the modern server architectures – includes understanding of GPU, CPU methods for failure analysis, debugging or validation. 
  • 8+ years of system level server debugging with an understanding of power, system and network environments 
  • 3+ years of direct GPU related engineering experience in issue debug/test log review.  
  • Leadership skills and ability to collaborate with diverse teams and drive a call to action.  
  • Expert of root cause analysis and corrective action methods to identify contributing factors of production defects.  
  • Ability to analyze large data sets, extract key insights, and effectively present and communicate the results. 
  • Proficient communication and project management skills.  

 

 

Responsibilities

  • Develop and implement a robust supplier quality management strategy to ensure the data center hardware is manufactured at the highest level of quality standards.  
  • Lead quality issues and improvement task force to contain, mitigate, and resolve the top-quality issues impacting global data centers.  
  • Conduct debug and failure analysis for GPU subsystems in the Azure fleet and drive resolution with partners and suppliers. 
  • Drive the continuous improvement process based on Root Cause Analysis (RCA) and identified opportunities.  
  • Responsible for quality readouts based on your telemetry data analysis, to bring clarity on status, actions across the organization and next steps for issue resolution. 
  • Establish Critical-to-Quality performance metrics to measure and improve product quality.  
  • Act as the voice of quality in the hardware change management process, ensuring quality requirements are considered and met and improved.  
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Ajmera Infotech - ASP.NET Developer with Azure Expertise

Ajmera Infotech

San Jose, California, United States (On-Site)
6 Months ago
Microsoft - Principal Software Engineering Manager

Microsoft

(On-Site)
2 Months ago
The Walt Disney Company - Lead Software Engineer (Identity)

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
4 Months ago
PwC - Senior Associate_Azure Data Engineer-- Data and Analytics_Advisory_Gurugram

PwC

Gurugram, Haryana, India (On-Site)
4 Months ago
Assystems - Ingénieur Génie Civil Nucléaire H/F

Assystems

Lyon, Auvergne-Rhône-Alpes, France (On-Site)
5 Months ago
Fluence - Controls Software Engineer-II(m/f/d)

Fluence

Berlin, Berlin, Germany (Hybrid)
5 Months ago
Assystems - Technicien Surveillance Electrique H/F

Assystems

Cherbourg-en-Cotentin, Normandy, France (On-Site)
5 Months ago
Fluence - Chief Mechanical Engineer

Fluence

Arlington, Virginia, United States (Hybrid)
5 Months ago
Scientific Games  - Benefits Manager

Scientific Games

Alpharetta, Georgia, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Lionsgate Games - IT Program Management & Transformation Intern

Lionsgate Games

Toronto, Ontario, Canada (On-Site)
2 Months ago
PwC - IN_Senior Associate_Azure Data Engineer _OneCloud _Advisory _Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Glean - Solutions Architect - Central

Glean

(Remote)
4 Months ago
Wolters Kluwer - Manager, Product Software Engineering

Wolters Kluwer

Coppell, Texas, United States (Hybrid)
5 Months ago
Glean - Technical Support Engineer

Glean

Palo Alto, California, United States (On-Site)
4 Months ago
Hitachi - Azure Developer

Hitachi

Hyderabad, Telangana, India (Remote)
5 Months ago
Ajmera Infotech - SENIOR ASP.NET DEVELOPER

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
8 Months ago
X Studios,  Inc  - Engineer, Unity

X Studios, Inc

Winter Park, Florida, United States (On-Site)
9 Months ago
Wildlife Studios - Site Reliability Engineering Manager

Wildlife Studios

São Paulo, State Of São Paulo, Brazil (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

Appier - IT Software Engineer

Appier

Taipei City, Taiwan (On-Site)
4 Months ago
WongDoody - PRODUCT SERVICE DESIGNER, TAIWAN

WongDoody

Taipei City, Taiwan (On-Site)
4 Months ago
NVIDIA - Design Verification Engineer (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Google - Student Training in Engineering Program (STEP) Intern, 2025

Google

New Taipei, New Taipei City, Taiwan (On-Site)
3 Months ago
Trend Micro - (Sr.) Cloud Backend Engineer

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Trend Micro - Sr. Data Scientist (AI Lab)

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Axinous - Account Executive - Majors

Axinous

Taiwan (Remote)
2 Months ago
Logitech - L4B Launch Project Manager

Logitech

Hsinchu City, Taiwan (Hybrid)
4 Months ago
Appier - Campaign Analyst

Appier

Taipei City, Taiwan (On-Site)
5 Months ago
NVIDIA - Silicon Photonics Test Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Manufacturing Jobs

Salesforce - Named Account Executive 9

Salesforce

Colombia (Remote)
1 Month ago
NVIDIA - Senior System Product Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
Buckman - Process Safety Engineer

Buckman

Mpumalanga, KwaZulu-Natal, South Africa (On-Site)
5 Months ago
Tesla - Operations Control Center Technician

Tesla

Brandenburg, Germany (On-Site)
1 Month ago
NVIDIA - Senior Materials and Process Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Fluence - Chief Mechanical Engineer

Fluence

Erlangen, Bavaria, Germany (Hybrid)
5 Months ago
Tesla - Technical Product Designer Apprenticeship - Machine and Plant Construction

Tesla

Prüm, Rhineland-Palatinate, Germany (On-Site)
1 Month ago
Assystems - Design Lead – Solar (Mechanical)

Assystems

Gurugram, Haryana, India (On-Site)
5 Months ago
Mattel  Inc  - Industrial Engineer

Mattel Inc

General Escobedo, Nuevo Leon, Mexico (On-Site)
4 Months ago
NXP - 2024-2025 Pre-assembly Process Innovation Engineer Intern

NXP

Bangkok, Bangkok, Thailand (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (Hybrid)

Redmond, Washington, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Redmond, Washington, United States (On-Site)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

Redmond, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug