Customer Experience Engineering

8 Months ago • All levels
Devops

Job Description

The Azure Customer Experience (CXP) team at Microsoft seeks a Principal Site Reliability Engineer to design, implement, and maintain robust SLO monitoring systems for customer applications hosted in Azure. This role is crucial for ensuring the reliability, availability, and performance of these applications. Responsibilities include implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs), designing and implementing monitoring solutions using tools like OpenTelemetry, Prometheus, Grafana, Dynatrace, Datadog, and Azure Monitor. The ideal candidate will have extensive experience in designing observability and monitoring solutions, strong customer-facing skills, a growth mindset, and excellent communication abilities. They should be passionate about customers and focused on delivering exceptional customer experiences.
Good To Have:
  • Advanced certifications in SRE or related fields
  • Experience with AI/ML for monitoring and observability
Must Have:
  • Proven expertise in implementing and managing SLOs/SLIs
  • Experience designing and implementing monitoring solutions for cloud customers
  • Extensive experience with monitoring tools (OpenTelemetry, Prometheus, Grafana, etc.)
  • Strong customer-facing skills and communication abilities
  • Experience with Azure (or AWS/GCP) observability and monitoring solutions
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Add these skills to join the top 1% applicants for this job

microsoft-azure
grafana
azure
aws
prometheus
communication
agile-development
problem-solving
team-management

Overview

Every minute of every day, customers stake their entire business and reputation on Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into fans. 

We are customer obsessed problem-solvers. We orchestrate deep engagements in areas like incident management, support and enablement. We analyze and amplify those customer voices, both within our own team, and across the Cloud + AI team, bringing the customer connection to the Quality vision for Azure. We innovate ways to scale what we learn across our customer baseDiversity and inclusion are central to who we are, how we work, and what we enable our customers to achieve. We know that empowering our customers starts with empowering our team to show up authentically, work in ways that are best for them, and achieve their career goals. 

Would you like to join one of the fastest-growing teams within Microsoft Azure Engineering? Are you constantly customer-obsessed, and focused on enhancing customer experience? Are you passionate about cloud computing and love the challenge of solving the most complex technical problems? Are you interested in a start-up like environment, passionate about building automations, observability, proactive & SLO monitoring experiences? 

Our organization is looking for you, a customer obsessed Principal Site Reliability Engineer with extensive experience in implementing Service Level Objectives (SLOs) monitoring solutions to top Azure customers. As a key member of our Observability team, you will play a critical role in ensuring the reliability, availability, and performance of customer applications hosted in Microsoft Azure. You will be responsible for designing, implementing, and maintaining robust SLO monitoring systems to track and meet the service level objectives defined in our offerings, customer engagement agreements. This position is critical to the success of our team's charter and embodies our inclusive culture, growth & learning mindsets, and unwavering dedication to diversity. 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. 

“Customer obsession”, “measure what matters”, “no dead-ends”, “get it done”, “collaboration” “teamwork” , “whatever it takes” are few characteristics we look for in this role. We are growing fast but remain agile.  

Qualifications

 

  • Degree: Bachelor’s or master’s degree in computer engineering (or equivalent) 
    • Technical Skills: 
    • Proven expertise in implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud customers.  
    • Proven experience in designing and implementing monitoring solutions for customers. 
    • Extensive experience with monitoring tools and platforms 
    • Advanced certifications in SRE or related fields. 
    • Experience in observability, SRE OpenTelemetry, Prometheus, Grafana, Dynatrace, Datadog, AzureMonitor, AI, ML 

    #AZCXP #AZCXPACES #ACES500 #AZCXPSUPPORT, #AzureCXP

T        The ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirequirements include, but are not limited to the following specialized security screenings: 

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. 

 

Responsibilities

  • Experience:
  • At least proven years of experience with designing, implementing, debugging and launching commercial software products or web services.  
    Expertise in designing and implementing monitoring design and Architectures for end customers in Azure (or AWS/GCP) 

Proven years of experience in designing Observability and monitoring solutions in Azure(or AWS/GCP), SLO/SLI Implementation is a plus. 

Proven years of experience in an external client facing role or customer handling. 

 

  •  
  • Customer Obsession: Passion for customers and focus on delivering the right customer experience. 
  • Growth Mindset: Openness and ability to learn new skills and technologies in a fast-paced environment. 
  • Excellent Communication: Must have the ability to empathize with customers and convey confidence. Able to explain highly technical issues to varied audiences. Able to prioritize and advocate customer’s needs to the proper channels. Take ownership and work towards a resolution. 
    •  

     

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Set alerts for more jobs like Customer Experience Engineering
Set alerts for new jobs by Microsoft
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙