Site Reliability Engineer II

34 Minutes ago • 4-5 Years • DevOps • $98,300 PA - $208,800 PA

Job Summary

Job Description

The Site Reliability Engineer II at Microsoft's Azure Data team ensures the reliability, scalability, and performance of Microsoft Fabric and other data services. Responsibilities include working with high-throughput, multi-tenant services; collaborating with internal and partner teams; participating in on-call rotations; designing, implementing, and refining solutions; championing operational excellence; documenting data engineering processes; and ensuring system reliability, uptime, and performance. This role requires strong scripting skills (PowerShell, Python), experience with automation, and a focus on incident management, performance monitoring, and continuous improvement.
Must have:
  • 4+ years experience in relevant field
  • 2+ years scripting experience (PowerShell, Python)
  • Experience with automation
  • System reliability & uptime
  • Incident management
  • Collaboration skills
Good to have:
  • 5+ years experience
  • Experience with Azure services

Job Details

Overview

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.

 

Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture.

 

Within Azure Data, the Microsoft Fabric platform team builds and maintains the operating system and provides customers a unified data stack to run an entire data estate. The platform provides a unified experience, unified governance, enables a unified business model and a unified architecture.

 

This team (SRE) ensures the reliability, scalability, and performance of systems and services. By integrating software engineering with IT operations, the team automates processes, manages incidents, and enhances system resilience. Acting as a bridge between development and operations, SREs help organizations maintain highly reliable and efficient systems while enabling fast and seamless software delivery.

 

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.

Qualifications

Required/Minimum Qualifications
• 4+ years technical experience in software engineering, network engineering, or systems administration OR bachelor's degree in computer science, Information Technology, or related field AND 2+ year(s) technical experience in software engineering, network engineering, or systems administration OR Master's Degree in Computer Science, Information Technology, or related field.
• 2+ years’ experience with scripting languages such as PowerShell, Python etc.
• Experience writing code to automate day-to-day tasks.

 

Other Requirements
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:
• This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

 

Preferred/Additional Qualifications
• 5+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration.

 

Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

 

Microsoft will accept applications and processes offers for these roles on an ongoing basis.

 

 

#azdat
#azuredata
#sre
#fabric
#powerbi

Responsibilities

• Work with all aspects of a high throughput and multi-tenant service
• Collaborate effectively within the team and with partner teams across Microsoft.
• Be part of the on-call rotation for maintaining service health.
• Design, implement, and refine chosen solutions in close partnership with Product Management and partner teams.
• Champion operational excellence via established metrics, process governance, and policy controls for regular assessment and improvement.
• Document and define existing data engineering processes, data and technology, while evaluating them for optimization.


Core responsibilities breakdown includes:
• System Reliability & Uptime – Ensuring high availability of services.
• Incident Management – Detecting, responding to, and mitigating system failures.
• Performance Monitoring – Tracking system health and resolving bottlenecks.
• Automation & Tooling – Reducing manual work through scripts and automation.
• Capacity Planning – Scaling infrastructure efficiently to handle demand.
• Postmortems & Continuous Improvement – Analyzing failures to prevent recurrence.
• Embody our and

Similar Jobs

Microsoft - Environmental, Social and Governance Disclosure Director

Microsoft

Dublin, County Dublin, Ireland (On-Site)
6 Hours ago
Match Group - Machine Learning Engineer

Match Group

New York, New York, United States (Hybrid)
6 Months ago
The Walt Disney Company - Lead Software Engineer - Full-Stack

The Walt Disney Company

Santa Monica, California, United States (On-Site)
3 Weeks ago
Microsoft - Principal Data Science Manager

Microsoft

Redmond, Washington, United States (Hybrid)
6 Hours ago
Werplay - QA Engineer

Werplay

Islamabad, Islamabad Capital Territory, Pakistan (On-Site)
3 Months ago
Intrepid Studios,  Inc  - Associate Software Engineer

Intrepid Studios, Inc

(Remote)
2 Months ago
PlayStation Global - IT Support Engineer II

PlayStation Global

London, England, United Kingdom (On-Site)
1 Month ago
Crunchyroll - DevOps Engineer, Core Infrastructure Engineering

Crunchyroll

San Francisco, California, United States (Hybrid)
1 Month ago
ByteDance - Solutions Architect

ByteDance

(On-Site)
3 Weeks ago
 Vizrt - Director of Platform

Vizrt

Lisbon, Lisbon, Portugal (Remote)
5 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Onward Search - Software Engineer

Onward Search

Rochester, Minnesota, United States (Remote)
2 Days ago
GoMotive - Software Engineer, Backend

GoMotive

India (Remote)
4 Weeks ago
Definitive Healthcare - IT Support Engineer

Definitive Healthcare

Bengaluru, Karnataka, India (On-Site)
4 Months ago
PwC - Azure Cloud Solutions Architect, Senior Manager

PwC

Toronto, Ontario, Canada (On-Site)
4 Months ago
Beghou Consulting - Software Developer – Full Stack

Beghou Consulting

Hyderabad, Telangana, India (Hybrid)
6 Months ago
Ajmera Infotech - Site Reliability Engineer (SRE) - Kubernetes

Ajmera Infotech

Austin, Texas, United States (On-Site)
2 Months ago
Hitachi - CE Developers-Jul-2024

Hitachi

Bengaluru, Karnataka, India (On-Site)
6 Months ago
PwC - Senior Data Engineer

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
6 Months ago
PwC - IN-Senior Associate – D365- PMO -Ms Dynamics– Advisory  - - Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Luxoft - BI Developer (SSIS and SSAS)

Luxoft

Gurugram, Haryana, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Next Level Business Services - Business Analyst - Mobility

Next Level Business Services

Collegeville, Pennsylvania, United States (On-Site)
6 Months ago
ByteDance - Immersive Video Research Intern (Multimedia Streaming) 2023 Summer/Fall (BS)

ByteDance

San Diego, California, United States (On-Site)
5 Months ago
Next Level Business Services - Adobe CQ5/AEM Developer (Full Time)

Next Level Business Services

Sunnyvale, California, United States (On-Site)
5 Months ago
ByteDance - Software Engineer Graduate (RDMA Network- High Speed Network)

ByteDance

San Jose, California, United States (On-Site)
2 Days ago
Canva - Strategic Partnership Manager

Canva

San Francisco, California, United States (Remote)
1 Month ago
ByteDance - Principal Product Manager - IaaS AI Infra

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
NVIDIA - Director of Product - AI Training Platform Software

NVIDIA

Santa Clara, California, United States (On-Site)
4 Weeks ago
DraftKings - Risk Payment Operations Associate

DraftKings

Las Vegas, Nevada, United States (On-Site)
4 Days ago
ByteDance - Tech Lead Manager, Network Security

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
ByteDance - Senior Software Engineer - IaaS AI Infra

ByteDance

San Jose, California, United States (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

CharacterAI - Software Engineer, Machine Learning Infrastructure

CharacterAI

New York, New York, United States (On-Site)
3 Weeks ago
Rackspace Technology - Senior Site Reliability Engineer - GCP Focussed

Rackspace Technology

(Remote)
1 Week ago
Info Stretch - Lead Data Engineer

Info Stretch

Bengaluru, Karnataka, India (On-Site)
5 Months ago
N-iX - Senior DevOps Engineer

N-iX

Argentina (Remote)
3 Weeks ago
Omnissa - Member of technical staff (C++,iOS)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Razer - Software Engineer (DevOps)

Razer

Shah Alam, Selangor, Malaysia (On-Site)
6 Months ago
ByteDance - Cloud Solution Architect (Automotive Industry)

ByteDance

(On-Site)
3 Weeks ago
N-iX - Middle DevOps Engineer

N-iX

Colombia (Remote)
2 Days ago
Canva - Senior Software Engineer (Cloud FinOps) - remote across ANZ

Canva

Sydney, New South Wales, Australia (Remote)
3 Months ago
Hitachi - Senior Offshore Azure Infrastructure - EST Shift

Hitachi

Pune, Maharashtra, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (Hybrid)

New York, New York, United States (On-Site)

Redmond, Washington, United States (On-Site)

Beijing, Beijing, China (On-Site)

Hyderabad, Telangana, India (On-Site)

Barcelona, Catalonia, Spain (On-Site)

Prague, Prague, Czechia (Hybrid)

Prague, Prague, Czechia (Hybrid)

São Paulo, State Of São Paulo, Brazil (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug