Site Reliability Engineer

38 Minutes ago • All levels • DevOps • Undisclosed

About the job

Job Description

The Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team at Microsoft seeks a customer-focused Reliability Engineer. This role focuses on improving customer experience on Azure, involving diagnosing and troubleshooting mission-critical applications. Responsibilities include on-call rotation, collaborating with engineering and product teams, driving product improvements, performing root cause analyses, and identifying/implementing customer-centric mitigation strategies. The ideal candidate will have service engineering experience in a 24/7 enterprise environment, expertise in Azure services, fluency in automation languages, strong communication skills, and a deep understanding of high availability and disaster recovery. The position requires driving continuous improvements in the Azure platform based on customer feedback and working with diverse stakeholders.
Must have:
  • Service engineering experience
  • Azure service expertise
  • Automation language fluency
  • Strong communication skills
  • High availability & DR understanding
Good to have:
  • Windows/Linux knowledge
  • Developer tools experience
  • BS/BA in CS or related field
Perks:
  • Industry leading healthcare
  • Educational resources
  • Product & service discounts
  • Savings and investments
  • Maternity/paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

Are you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you!

Azure CXP CRE is a top-level pillar of Azure Engineering that leads world-class customer reliability initiatives. It provides modern, customer-centric experiences at scale and infuses deep customer insights and empathy throughout the Azure Engineering organization. Our teams continuously listen to customers, driving enhancements and new capabilities across services, support programs, incident response, community engagements, and more. Our "no dead-ends" philosophy ensures that every customer, regardless of size or scale, can realize their full potential through the Microsoft Cloud.

Azure CXP CRE is seeking a customer-focused Reliability Engineer passionate about customer reliability engineering, including availability, reliability, resiliency, and uptime at scale for the Azure platform. This role is accountable for improving customer experience on Azure and involves diagnosing and troubleshooting mission-critical customer applications built on the Microsoft Azure platform. The ideal candidate will demonstrate technical breadth while managing complex, highly available services and have a deep understanding of the underlying components (Azure Platform, Azure SDK, Azure Portal). They will work directly with customers, customer support, live site teams, and engineering.

To be successful in this role, you must have a proven track record of customer empathy, an engineering mindset, an aptitude for agility, and technical excellence in site reliability engineering.

Qualifications

 

• Must have service engineering experience in a 24/7/365 enterprise environment.
• Desired: Technical expertise in Azure services and capabilities or cloud platforms.
• Fluency in one or more automation languages (e.g., PowerShell, CLI).
• Strong communication skills that enable you to lead and manage communication with customers, internal Microsoft stakeholders, and third-party vendors.
• Understanding of high availability, disaster recovery, business continuity, and performance tuning.
• Demonstrates strategic thinking, quantitative and analytical skills, team leadership, and collaboration.
• Excellent problem resolution, judgment, negotiation, and decision-making skills.
• Desired: Strong knowledge of the Windows platform or Linux, developer tools, and the ability to diagnose and debug user code.
• Effectively manage and prioritize multiple tasks according to high-level objectives and projects.
• Excellent written and oral communication skills; ability to communicate with a variety of audiences, including high-profile customers, executive management, and engineering teams.
• Desired: BS/BA in computer science, engineering, mathematics, or equivalent experience.

 

Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
 

 

 

 

#AzCXP

Responsibilities

• Participate in an on-call coverage rotation (approximately 15% of the time) for platform communications and security.
• Collaborate closely with engineering and product management teams to drive product improvements based on customer feedback.
• Improve the customer experience by analyzing signals from various sources and driving root cause analyses (RCAs) and service improvements involving bug fixes.
• Drive continuous improvement in the Azure platform by incorporating feedback from internal and external customers.
• Identify and drive requirements for enhanced customer resiliency and platform reliability.
• Identify and drive the implementation of customer-centric mitigation strategies and playbooks for operations.
• Participate in the design of next-generation architecture for cloud infrastructure services, with a focus on strategic customer scenarios.
• Be enthusiastic, self-motivated, and a great team player.
• Demonstrate excellent collaboration, organizational, and time management skills.
• Be data-driven with a focus on achieving business results in projects.
• Demonstrate the ability to develop key partnerships.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Bengaluru, Karnataka, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Busan, Busan, South Korea (On-Site)

Paris, Île-de-France, France (On-Site)

North Holland, Netherlands (On-Site)

Reston, Virginia, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

CloudHire - Microsoft /Inquoto Sales Specialist

CloudHire, United States (On-Site)

Inworld AI - Staff Platform Engineer  - Canada

Inworld AI, Canada (On-Site)

Microsoft - Manager, DPU Software

Microsoft, India (On-Site)

Microsoft - Fiber Delivery Engineer

Microsoft, United States (Hybrid)

Palo Alto Networks - Presales, Prisma Cloud Solutions Architect, Majors

Palo Alto Networks, United States (Remote)

Gameye - Senior DevOps Engineer

Gameye, United States (Remote)

Luxoft - Lead Software Solution Architect

Luxoft, United States (Remote)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Microsoft - Technical Program Manager

Microsoft, Czechia (Remote)

HP - Specialist Software Engineer

HP, United States (On-Site)

Rackspace Technology - Lead AppDev Enterprise Architect

Rackspace Technology, United States (Remote)

InMobiInMobi - Data Scientist III

InMobiInMobi, India (On-Site)

Microsoft - Software Engineer II - Frontend

Microsoft, India (On-Site)

Assurant - Lead DevOps Cloud Engineer

Assurant, India (Remote)

Microsoft - Mechanical Engineer - Data Centre

Microsoft, Australia (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Sydney, New South Wales, Australia

Aristocrat Gaming - Field Service Manager

Aristocrat Gaming, Australia (Hybrid)

Easygo - VIP Operations Manager

Easygo, Australia (On-Site)

Salesforce - Account Executive Marketing Cloud

Salesforce, Australia (On-Site)

Sinch - Chief of Staff, APAC

Sinch, Australia (Hybrid)

IGT - Warehouse Storeperson

IGT, Australia (On-Site)

Tesla - Technical Support Tier 2 Specialist III

Tesla, Australia (On-Site)

Aristocrat Gaming - Strategic Sourcing Manager

Aristocrat Gaming, Australia (Hybrid)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Hitachi Digital Services - Container Security - Expert (Hyderabad, Bangalore, Pune)

Hitachi Digital Services, India (Hybrid)

Rackspace Technology - Senior Site Reliability Engineer

Rackspace Technology, United States (Remote)

Nielsen Holdings - Devops Engineer (026)

Nielsen Holdings, India (Hybrid)

Thrasio - Cloud Engineer II

Thrasio, India (Remote)

PowerSchool - Cloud Operations Engineer 1

PowerSchool, India (On-Site)

Rackspace Technology - Senior Consulting Cloud Architect

Rackspace Technology, Germany (Remote)

Luxoft - Senior Azure DevOps Engineer

Luxoft, Poland (On-Site)

Get notifed when new similar jobs are uploaded