Senior Site Reliability Engineering Manager

44 Minutes ago • 6-7 Years • Network Engineering • Research & Development • $117,200 PA - $250,200 PA

Job Summary

Job Description

The Senior Site Reliability Engineering Manager at Azure Storage will lead a team in optimizing fleet availability and health for a massive, globally distributed system. Responsibilities include designing, developing, and improving automation and uptime; planning and investigating complex issues at scale; and driving sprint planning, code reviews, and cross-team meetings. The role requires deep technical expertise in server architecture, troubleshooting, and distributed systems. Incident response and post-mortem reporting are also key aspects, along with contributing to cost reduction initiatives. The position offers significant impact and high-level visibility within Microsoft.
Must have:
  • 6+ years experience in relevant field
  • 4+ years Agile/SCRUM experience
  • Lead large cross-team efforts
  • Investigate and solve complex issues
  • Develop and improve automation
  • Incident response and post-mortem analysis
Good to have:
  • Understanding of server architecture
  • Familiarity with distributed systems
  • Experience with management techniques

Job Details

Overview

Are you passionate about hardware and enabling new technology? Do you enjoy complex problem solving and investigation? Azure has one of the largest storage services on the planet, holding Exabytes of data and files not just for our 3rd party customers, but also many of Microsoft’s own services. This role will focus on managing an ever growing and changing fleet at scale to maximize efficiency while providing a stable environment for our customers.  

As a Senior Site Reliability Engineering Manager in Azure Storage team you will be working with a team of engineers focused on optimizing fleet availability and health. Leading a team of engineers to design, develop and improve automation and uptimeYou will take lead of planning, investigating complex issues and designing solutions to solve problems at scale. 

This opportunity will allow you to deepen your knowledge and experience with massive distributed systems. Opportunities to have significant impact on reducing cost to the business. Exposure and visibility at VP and CVP levels.  This position is located in Redmond and has a flexible work environment that supports working from home. 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. 

Qualifications

Required Qualifications:

  • 6+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration.
  • 4+ years of Agile / SCRUM planning, and leading large cross team efforts.

 

Other Requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: 
    • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

 

Preferred Qualifications:

  • 7+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering,
  • Understanding of server architecture and the ability to debug and trouble shoot isues impacting the fleet.
  • Understadning of server componants, Firmware, BIOS and how they interact. 
  • Understanding management techinques, and methods for ensuring scope control.
  • Familiarity with distributed systems. 

 

Site Reliability Engineering M4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:


Microsoft will accept applications for the role until September 9, 2024.

 

 

#azurecorejobs

Responsibilities

  • Develop, test, and implement changes to optimize code and improve scalability. You leverage end-to-end technical expertise and telemetry analysis to identify patterns and opportunities to implement configuration and automation improvments. You review the effect of changes to documents and share development insights within your team.  
  • You drive Sprint planning, SCRUM stand ups, code/design reviews, and host regular cross team / org meetings. 
  • Investigate hardware and system issues that are impacting available capacity and impacting customers. 
  • Understand the long term goals of the organization and understand the steps your team will have to take to achieve those. 
  • You respond to incidents during regular on-call rotations and share details related to incidents and their resolution through post-mortem reports and regular review meetings. As a member of the team you willl be expected to help drive bridges for recovery durring major outages. 
  • Embody our  and   

Similar Jobs

Playtika - PHP Developer

Playtika

Netherlands (Hybrid)
3 Weeks ago
Egnyte - Sr DevOps Engineer - Azure

Egnyte

India (Remote)
3 Weeks ago
Ajmera Infotech - React Developer

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
6 Months ago
Zones - Azure Backend Developer

Zones

Noida, Uttar Pradesh, India (On-Site)
5 Months ago
Microsoft - Director Sourcing/Supply Management

Microsoft

Redmond, Washington, United States (On-Site)
1 Hour ago
ByteDance - Cloud Network Engineer

ByteDance

Seattle, Washington, United States (On-Site)
1 Day ago
ByteDance - Software Engineer Graduate (RDMA Network - High Speed Network)

ByteDance

Seattle, Washington, United States (On-Site)
3 Weeks ago
ByteDance - Site Reliability Engineer, Edge Services

ByteDance

Boston, Massachusetts, United States (On-Site)
1 Day ago
ByteDance - Site Reliability Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Day ago
Tencent - Tencent Cloud - Senior Cloud Network Engineer

Tencent

(On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Trend Micro - Senior Software Development Engineer

Trend Micro

Manila, Metro Manila, Philippines (Hybrid)
6 Months ago
PwC - IN-Associate_Full Stack Developer(Node JS)_MS Engg_Advisory_Kolkata

PwC

Kolkata, West Bengal, India (On-Site)
6 Months ago
Arrise Solutions (India)   - Senior ML Engineer

Arrise Solutions (India)

Hyderabad, Telangana, India (On-Site)
7 Months ago
PwC - IN-Manager – D365 Scm -Ms Dynamics– Advisory  - Mumbai

PwC

Mumbai, Maharashtra, India (On-Site)
6 Months ago
ByteDance - Senior Software Engineer - IaaS AI Infra

ByteDance

San Jose, California, United States (On-Site)
1 Day ago
Toppan Merrill - Systems Engineer

Toppan Merrill

Chennai, Tamil Nadu, India (On-Site)
6 Months ago
N-iX - Senior .NET Full-Stack Engineer

N-iX

Poland (Hybrid)
3 Weeks ago
PwC - Senior Associate_Azure Data Engineer_Data & Analytics_Advisory_PAN  India

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
ION - Cloud Engineer/Architect (DevOps)

ION

Italy (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Ziff Davis - Senior Software Engineer, Backend - Lose It!

Ziff Davis

United States (On-Site)
5 Months ago
Trek - Sales Associate

Trek

Redmond, Washington, United States (On-Site)
2 Months ago
Niantic - Wayfarer Operations Program Lead

Niantic

Sunnyvale, California, United States (Hybrid)
6 Days ago
Scientific Games  - Marketing Manager – CRM, Affiliates, and Promotions

Scientific Games

Pennsylvania, United States (Remote)
1 Day ago
Aristocrat Gaming - Vice President, Finance – Product Development & Technology

Aristocrat Gaming

Las Vegas, Nevada, United States (Hybrid)
4 Days ago
Epic Games - Senior Data Analyst

Epic Games

Cary, North Carolina, United States (On-Site)
2 Months ago
Onward Search - Fullstack Engineer

Onward Search

San Jose, California, United States (On-Site)
3 Weeks ago
Nagarro - Associate Staff Engineer, Database Oracle

Nagarro

New York, New York, United States (On-Site)
5 Months ago
NVIDIA - Senior Backend Software Engineer – GeForce NOW Cloud

NVIDIA

Santa Clara, California, United States (On-Site)
4 Days ago
ByteDance - Senior Machine Learning Ops Engineer, ML System

ByteDance

San Jose, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Network Engineering Jobs

ByteDance - AI/LLM Network Software Engineer (High Speed Network)

ByteDance

Seattle, Washington, United States (On-Site)
3 Weeks ago
Playtika - IT Infrastructure Engineer

Playtika

Poland (Hybrid)
5 Months ago
Meta - Technical Program Manager, Net Infra (Backbone)

Meta

Menlo Park, California, United States (On-Site)
5 Months ago
ByteDance - Research Scientist Intern (Traffic Infrastructure Global Engineering)

ByteDance

Seattle, Washington, United States (On-Site)
1 Day ago
ByteDance - Network Software Development Engineer, High Speed Network

ByteDance

Seattle, Washington, United States (On-Site)
3 Weeks ago
Meta - Network Production Engineer, Network Infrastructure

Meta

New York, New York, United States (On-Site)
4 Months ago
ByteDance - Software Engineer Graduate (Multi Cloud CDN)

ByteDance

San Jose, California, United States (On-Site)
1 Day ago
ION - Network Engineer - 7401

ION

Noida, Uttar Pradesh, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (On-Site)

Mountain View, California, United States (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (Hybrid)

Redmond, Washington, United States (Hybrid)

Redmond, Washington, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug