Sr. Hardware Engineer - DEBUG

1 Hour ago • 7 Years + • DevOps

About the job

Job Description

This Senior Hardware Engineer role involves leading end-to-end system-level debugging at scale for Microsoft's cloud infrastructure. Responsibilities include collaborating with hardware, firmware, and software teams for root cause analysis, meeting debug SLAs, prioritizing issues, and developing robust debug methodologies and validation plans. The ideal candidate will have 7+ years of experience in platform architecture, validation, or debug engineering, a deep understanding of server architectures, and strong remote/hands-on debugging skills. The role requires effective communication, problem-solving, and collaboration with internal and external partners to ensure high quality, reliability, and service levels in a cloud environment.
Must have:
  • 7+ years experience in platform architecture/validation/debug engineering
  • Deep understanding of modern server architectures (CPU, Memory, Storage)
  • Strong remote/hands-on debug experience
  • Lead E2E system level debug activities
  • Collaborate with hardware, firmware, and software teams
Good to have:
  • Platform debug and validation experience
  • Data analytical skills
  • Excellent communication skills
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Overview

Microsoft Silicon, Cloud Hardware Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering trusted experiences to customers and partners worldwide and we are looking for passionate, high energy engineers to help achieve that mission.

 

As Microsoft's cloud business continues to grow the ability to deploy new offerings and HW infrastructure on time, in high volume with high quality and lowest cost is of paramount importance. To achieve this goal, the Hardware, Infrastructure Management, and Fundamentals Engineering (HIFE) team is instrumental in defining and delivering operational measures of success for hardware manufacturing, improving the planning process, quality, delivery, scale and sustainability related to Microsoft cloud hardware. We are looking for seasoned engineers with a strong passion for customer focused solutions, insight, and industry knowledge to envision and implement future technical solutions that will manage and optimize the Cloud infrastructure.

 

#SCHIE  #HIFE

Qualifications

Required Qualifications 

  • 7+ years of experience of technical leadership as a platform architect or validation architect or a lead debug engineer or equivalent industry experience.
  • Deep understanding of modern server architectures – Memory or CPU or storage or system level firmware.
  • Strong remote or hands-on debug experience. 

Other Qualifications


Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter. 

 

Preferred Qualifications:

  • Platform Debug and validation experience
  • Data analytical skills- Knowing how to use data based and analytical tools.
  • Excellent communication skills using various forms of media.
  • Able to plan work, and work to a plan adapting as necessary in a rapidly evolving Environment.
  • Individual effectiveness skills such as discipline, time management, decision making, planning, and organizing work, summarizing results through technical reports
  • Self-driven, self-motivated individual must be able work independently as well as collaboratively in a team environment and across the team of engineers.

 

Responsibilities

  • Lead E2E @scale system level debug activities in the cluster to meet fleet KPIs.
  • Collaborate with hardware, firmware and software teams to enable comprehensive root cause analysis.
  • Accountable to meet L1-L3 debug/triage SLA.
  • Provide priority to issues based on technical and business understanding of both complexity & impact.
  • Identify, drive execution and E2E verification plans to proactively address open issues.
  • Develop best in class debug methodologies, test strategies and platform validation plans for the at-scale clusters
  • Solve problems relating to mission critical services and build automation to drive debug efficiency.
  • Collaborate with internal and external partners to ensure systems meet significant quality, reliability, and service level requirements for a cloud environment.
  • Effectively communicate with partners and stakeholders for planning and progress on initiatives using data.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Noida, Uttar Pradesh, India (On-Site)

Paris, Île-de-France, France (On-Site)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Noida, Uttar Pradesh, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Symplr - Devops Engineer

Symplr, India (Hybrid)

Trek - .NET Engineer

Trek, India (On-Site)

SatSure - Senior Software Development Engineer

SatSure, India (On-Site)

Microsoft - Senior Data Science Manager

Microsoft, India (On-Site)

HP - MSP Engineer

HP, Mexico (On-Site)

Microsoft - Principal Software Engineer

Microsoft, United States (On-Site)

Hitachi - Terraform with DevOps

Hitachi, India (On-Site)

Magna International - Data Scientist

Magna International, India (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

Appier - Product Marketing Associate

Appier, Taiwan (On-Site)

Logitech - Mechatronics Intern

Logitech, Taiwan (On-Site)

USE Insider - Technical Support Engineer - Taiwan

USE Insider, Taiwan (Hybrid)

Logitech - Al tools intern

Logitech, Taiwan (Hybrid)

Rivos - Silicon DFT - Full time

Rivos, Taiwan (Hybrid)

PwC - S.業務類-不動產商仲

PwC, Taiwan (On-Site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

DOTSOFT SA - Solutions Architect

DOTSOFT SA, Greece (On-Site)

Auros Global - Strategy Developer - Asia

Auros Global, (Remote)

Take-Two Interactive - Senior Build & Release Engineer

Take-Two Interactive, United States (Remote)

Avalara - Sr. Site Reliability Engineer

Avalara, India (Remote)

PwC - ETIC, Cloud DevOps Lead - M

PwC, Egypt (On-Site)

Microsoft - Technical Program Manager II

Microsoft, India (On-Site)

DISCO - Software Engineer III, Backend

DISCO, India (On-Site)

Moveworks - Staff Site Reliability Engineer

Moveworks, India (On-Site)

Info Stretch - .Net Architect

Info Stretch, United States (On-Site)

Get notifed when new similar jobs are uploaded