Customer Experience Engineering

1 Month ago • All levels • DevOps

Job Summary

Job Description

The Azure Customer Experience (CXP) team at Microsoft seeks a Principal Site Reliability Engineer to design, implement, and maintain robust SLO monitoring systems for customer applications hosted in Azure. This role is crucial for ensuring the reliability, availability, and performance of these applications. Responsibilities include implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs), designing and implementing monitoring solutions using tools like OpenTelemetry, Prometheus, Grafana, Dynatrace, Datadog, and Azure Monitor. The ideal candidate will have extensive experience in designing observability and monitoring solutions, strong customer-facing skills, a growth mindset, and excellent communication abilities. They should be passionate about customers and focused on delivering exceptional customer experiences.
Must have:
  • Proven expertise in implementing and managing SLOs/SLIs
  • Experience designing and implementing monitoring solutions for cloud customers
  • Extensive experience with monitoring tools (OpenTelemetry, Prometheus, Grafana, etc.)
  • Strong customer-facing skills and communication abilities
  • Experience with Azure (or AWS/GCP) observability and monitoring solutions
Good to have:
  • Advanced certifications in SRE or related fields
  • Experience with AI/ML for monitoring and observability
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Every minute of every day, customers stake their entire business and reputation on Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into fans. 

We are customer obsessed problem-solvers. We orchestrate deep engagements in areas like incident management, support and enablement. We analyze and amplify those customer voices, both within our own team, and across the Cloud + AI team, bringing the customer connection to the Quality vision for Azure. We innovate ways to scale what we learn across our customer baseDiversity and inclusion are central to who we are, how we work, and what we enable our customers to achieve. We know that empowering our customers starts with empowering our team to show up authentically, work in ways that are best for them, and achieve their career goals. 

Would you like to join one of the fastest-growing teams within Microsoft Azure Engineering? Are you constantly customer-obsessed, and focused on enhancing customer experience? Are you passionate about cloud computing and love the challenge of solving the most complex technical problems? Are you interested in a start-up like environment, passionate about building automations, observability, proactive & SLO monitoring experiences? 

Our organization is looking for you, a customer obsessed Principal Site Reliability Engineer with extensive experience in implementing Service Level Objectives (SLOs) monitoring solutions to top Azure customers. As a key member of our Observability team, you will play a critical role in ensuring the reliability, availability, and performance of customer applications hosted in Microsoft Azure. You will be responsible for designing, implementing, and maintaining robust SLO monitoring systems to track and meet the service level objectives defined in our offerings, customer engagement agreements. This position is critical to the success of our team's charter and embodies our inclusive culture, growth & learning mindsets, and unwavering dedication to diversity. 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. 

“Customer obsession”, “measure what matters”, “no dead-ends”, “get it done”, “collaboration” “teamwork” , “whatever it takes” are few characteristics we look for in this role. We are growing fast but remain agile.  

Qualifications

 

  • Degree: Bachelor’s or master’s degree in computer engineering (or equivalent) 
    • Technical Skills: 
    • Proven expertise in implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud customers.  
    • Proven experience in designing and implementing monitoring solutions for customers. 
    • Extensive experience with monitoring tools and platforms 
    • Advanced certifications in SRE or related fields. 
    • Experience in observability, SRE OpenTelemetry, Prometheus, Grafana, Dynatrace, Datadog, AzureMonitor, AI, ML 

    #AZCXP #AZCXPACES #ACES500 #AZCXPSUPPORT, #AzureCXP

T        The ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirequirements include, but are not limited to the following specialized security screenings: 

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. 

 

Responsibilities

  • Experience:
  • At least proven years of experience with designing, implementing, debugging and launching commercial software products or web services.  
    Expertise in designing and implementing monitoring design and Architectures for end customers in Azure (or AWS/GCP) 

Proven years of experience in designing Observability and monitoring solutions in Azure(or AWS/GCP), SLO/SLI Implementation is a plus. 

Proven years of experience in an external client facing role or customer handling. 

 

  •  
  • Customer Obsession: Passion for customers and focus on delivering the right customer experience. 
  • Growth Mindset: Openness and ability to learn new skills and technologies in a fast-paced environment. 
  • Excellent Communication: Must have the ability to empathize with customers and convey confidence. Able to explain highly technical issues to varied audiences. Able to prioritize and advocate customer’s needs to the proper channels. Take ownership and work towards a resolution. 
    •  

     

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Gameopedia - Sr Manager IT

Gameopedia

Hyderabad, Telangana, India (On-Site)
3 Months ago
Palo Alto Networks - Presales, Prisma Cloud Solutions Architect, Majors

Palo Alto Networks

Chicago, Illinois, United States (Remote)
3 Months ago
Magna International - Full-Stack Developer

Magna International

Bengaluru, Karnataka, India (On-Site)
4 Months ago
XenServer - Senior Escalation Engineer

XenServer

Bengaluru, Karnataka, India (On-Site)
5 Months ago
PlayStation Global - IT Systems Engineer - Cloud

PlayStation Global

Aliso Viejo, California, United States (On-Site)
3 Months ago
GoTo Group - Site Reliability Engineer - EP (SE4)

GoTo Group

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Luxoft - Salesforce Developer

Luxoft

Bucharest, Bucharest, Romania (On-Site)
3 Months ago
Keywords Studios (Player Support) - Architecte de solutions

Keywords Studios (Player Support)

Montréal, Québec, Canada (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Playtech - Junior Cloud Security Engineer

Playtech

(On-Site)
4 Months ago
PwC - IN-Senior Associate_Azure data Engineer_Data &  Analytics_Advisory_PAN India

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Visa - Machine Learning Engineer - Sr Consultant - Cloud AI Platform

Visa

Austin, Texas, United States (Hybrid)
2 Months ago
RSM US LLP - IT Cloud Management Analyst 1

RSM US LLP

Hyderabad, Telangana, India (Hybrid)
3 Months ago
Lulalend - Senior Site Reliability Engineer

Lulalend

Cape Town, Western Cape, South Africa (On-Site)
4 Months ago
Mouser Electronics - Cloud Engineer II

Mouser Electronics

Pune, Maharashtra, India (On-Site)
4 Months ago
Ecolab - Senior Software Engineer

Ecolab

Bengaluru, Karnataka, India (On-Site)
3 Months ago
CloudHire - Microsoft /Inquoto Sales Specialist

CloudHire

Charlotte, North Carolina, United States (On-Site)
4 Months ago
Microsoft - Sr. AI HW Quality Engineer

Microsoft

Taipei City, Taiwan (On-Site)
1 Month ago
PwC - ETIC, Cloud DevOps Lead - M

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in undefined

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

DevOps Jobs

Virtana Corp - Senior Software Engineer

Virtana Corp

Pune, Maharashtra, India (Remote)
4 Months ago
Glean - Solutions Architect ( EMEA/US East Customer hours )

Glean

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Sony Interactive Entertainment - Senior Cloud Security Engineer

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
2 Months ago
Swiss Re - Senior Cloud Engineer

Swiss Re

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Pentair - DevOps Engineer- IoT

Pentair

Noida, Uttar Pradesh, India (On-Site)
4 Months ago
Luxoft - Senior Cloud Engineer

Luxoft

Gurugram, Haryana, India (On-Site)
2 Months ago
Adtran - Software Engineer (Devops)

Adtran

Hyderabad, Telangana, India (On-Site)
4 Months ago
Imagineio - DevOps Engineer

Imagineio

Delhi, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

New York, New York, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

London, England, United Kingdom (On-Site)

Dublin, County Dublin, Ireland (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug