Site Reliability Engineer

1 Month ago • All levels • DevOps

Job Summary

Job Description

The Site Reliability Engineer (SRE) at Microsoft's Azure Core team will maintain the world's computer, ensuring new servers come online efficiently at hyperscale. Responsibilities involve collaborating with various teams (developers, hardware engineers, datacenter technicians, etc.) to debug and resolve issues, drive continuous improvements, and prevent future problems. This role requires analyzing data to identify problem areas, automating mitigations, and participating in design reviews and problem management. The ideal candidate will have a foundational understanding of distributed systems and experience with programming languages (C, C++, C#, Java). The role involves working with large-scale server and network device management, investigation, and root cause analysis across multiple systems.
Must have:
  • Technical experience in software engineering, network engineering, or systems administration.
  • Distributed systems experience
  • Programming skills (C, C++, C#, Java)
  • Root cause analysis and problem resolution
  • Collaboration with multiple teams
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Come build and maintain the world’s computer as a member of the Microsoft Capacity Infrastructure Services team in Azure Core. The team ensures new servers are brought online (capacity buildout) to enable Azure customers to leverage the latest offerings, see the illusion of infinite capacity, and grow the Azure business efficiently at hyperscale.

As a Site Reliability Engineer, you’ll work with a breadth of partners across Microsoft including developers in service teams, hardware engineers, datacenter technicians, supply chain managers, and business leaders to rapidly debug and resolve issues delaying this carefully orchestrated buildout sequence. You’ll drive continuous improvements with these teams to prevent repeats and address common classes of issues across the Azure software stack through design reviews and problem management.

This opportunity will enable you to learn unparalleled system-wide knowledge of how the Azure cloud is built and maintained. The contacts you make with experts will enable you to deep dive on services and new technologies and partner for improvements. You’ll be stretched to automate mitigations tactically and strategically analyze data to identify problem areas for driving prioritization. This role requires flexibility to hold virtual meetings and collaborate with partners worldwide. It supports remote work up to 100% of the time working from home.

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Qualifications

Required Qualifications: 

  • Technical experience in software engineering, network engineering, or systems administration.
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field.
  • You must be legally authorised to work in Romania to be eligible for this role (Legallly authorised= has citizenship or has been granted a valid visa or work permit).

 

***Relocation expenses are not provided as part of this role

 

Other Requirements:

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Additional / Preferred Qualifications: 

  • Distributed systems - developing, debugging, monitoring, and deploying.
  • Programming - C, C++, C#, Java.
  • Systems - hardware and software interface, host and networking, large scale server and network device management, investigation and root cause analysis across multiple systems/services/teams.

 

#Azurecorejobs

Responsibilities

  • Develops a foundational understanding of distributed systems design, interactions between cloud technology layers and components, basic dependencies at scale, and the code that defines infrastructures. Can contribute to the code base the defines components or features of systems or cloud technologies to improve the reliability and operability of supported products, with direction with other engineers.
  • Supports ongoing engagements with product engineering teams by participating in code/design reviews, regular meetings, on-call rotations, and incident responses throughout product development and operations cycles; draws insights from engagements with product engineering teams and basic analyses of telemetry data to propose potential improvements to code and designs for a defined set of product components or features with guidance from other engineers.
  • Implements simple configuration and data changes across a predefined range of product components or features with guidance from other engineers to develop an understanding of how configurations, binaries, and data can be managed using code, tooling, and automation.
  • Develops an understanding of how to safely and reliably manage changes in production by using existing tools and automation to enable product engineering teams implement changes across a defined range of components or features, with direction from other engineers.
  • Uses existing tools to troubleshoot problems or flaws affecting the availability, reliability, performance, and/or efficiency of components or features with guidance from other engineers. Suggests potential solutions to resolve and prevent recurring issues and brings them to the attention of other engineers or team leads.
  • Responds to incidents during regular on-call rotations by identifying the level of impact, troubleshooting basic issues, and deploying appropriate fixes to resolve root cause(s); alerts product teams or owners to major customer impacting issues and escalates the resolution of complex issues and/or those affecting multiple components or features to other engineers as needed. Shares details related to incidents and their resolution through post-mortem reports and during regular review meetings
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Velotio Technologies - Data Engineer

Velotio Technologies

Maharashtra, India (Remote)
1 Day ago
The Walt Disney Company - Senior Software Engineer - Audience Targeting

The Walt Disney Company

Seattle, Washington, United States (On-Site)
1 Month ago
Nagarro - Associate Principal Engineer, QA-Automation- Cypress

Nagarro

Bengaluru, Karnataka, India (On-Site)
4 Months ago
The Walt Disney Company - Sr Software Engineer

The Walt Disney Company

Raleigh, North Carolina, United States (On-Site)
3 Weeks ago
Playnetic - Engineering Team Lead

Playnetic

Hungary (Remote)
2 Months ago
DigitalOcean - Senior Cloud Support Engineer

DigitalOcean

Hyderabad, Telangana, India (Hybrid)
4 Months ago
IO Interactive - Lead Online Programmer

IO Interactive

Brighton And Hove, England, United Kingdom (Hybrid)
1 Month ago
Zeta - Lead Data Reliability Engineer

Zeta

Hyderabad, Telangana, India (On-Site)
4 Months ago
Nagarro - Principal Engineer (Python)

Nagarro

Gurugram, Haryana, India (On-Site)
4 Months ago
Larian Studios - DEVOPS BUILD ENGINEER

Larian Studios

Quebec, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Netflix - Distributed Systems/Full Stack - Software Engineer (L5) - Platform Engineering

Netflix

Los Gatos, California, United States (On-Site)
1 Month ago
Techjays - Android Developer

Techjays

Tamil Nadu, India (Remote)
4 Months ago
Netflix - Security Software Engineer (L4), Client Security Integrations

Netflix

United States (Remote)
1 Month ago
Activision - Senior Staff Software Engineer (Data) - Activision Blizzard Media

Activision

San Francisco, California, United States (On-Site)
3 Months ago
Saviynt - Engineering Manager, Software Engineering

Saviynt

El Segundo, California, United States (Hybrid)
4 Months ago
King - Senior Data Engineer

King

San Francisco, California, United States (Hybrid)
4 Days ago
Zoox - Senior/Staff Backend Software Engineer - Product

Zoox

Foster City, California, United States (On-Site)
3 Months ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Knuddels - Data Scientist*in (m/w/d) - remote oder Karlsruhe

Knuddels

Karlsruhe, Baden-Württemberg, Germany (Remote)
7 Months ago
Info Stretch - Guidewire Developer

Info Stretch

Mechanicsburg, Pennsylvania, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bucharest, Bucharest, Romania

NXP - Software Engineering Intern, Linux Kernel/BSP

NXP

Bucharest, Bucharest, Romania (On-Site)
5 Months ago
PwC - Credit Risk & IFRS 9 Consultant

PwC

Bucharest, Bucharest, Romania (On-Site)
4 Months ago
Ubisoft - Web Developer

Ubisoft

Bucharest, Bucharest, Romania (Hybrid)
2 Weeks ago
PwC - Senior Consultant Financial Services with German

PwC

Bucharest, Bucharest, Romania (On-Site)
4 Months ago
Amber - Senior Unreal Game Engineer (Project Based)

Amber

Bucharest, Bucharest, Romania (On-Site)
7 Months ago
Amber - Game Designer - Mobile (Project Based)

Amber

Bucharest, Bucharest, Romania (On-Site)
8 Months ago
Ness Digital - Software Implementation Engineer

Ness Digital

Iași, Iași County, Romania (Remote)
2 Weeks ago
Playtika - R&D Team Leader

Playtika

Romania (Hybrid)
3 Months ago
In The Pocket - QUALITY ENGINEER

In The Pocket

Bucharest, Bucharest, Romania (On-Site)
3 Months ago
Amber - UX Designer (Project Based)

Amber

Bucharest, Bucharest, Romania (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Granicus - Sr. DevOps Engineer

Granicus

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Meta - Production Engineer

Meta

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
Modio - Cloud Systems Engineer

Modio

Victoria, Australia (On-Site)
6 Days ago
Warner Bros Discovery - Sr. Manager, Integrations

Warner Bros Discovery

Mexico City, Mexico City, Mexico (On-Site)
2 Months ago
Script Assist - Junior DevOps Engineer

Script Assist

Ahmedabad, Gujarat, India (Hybrid)
5 Months ago
Interactive Brokers - Senior DevOps/Software Engineer

Interactive Brokers

Greenwich, Connecticut, United States (Hybrid)
4 Months ago
Limit Break - Senior Site Reliability Engineer

Limit Break

Tokyo, Japan (On-Site)
5 Months ago
PwC - IN-Associate_ Azure DevOps Engineer_OneCloud_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Patterned Learning Career - Technical DevOps Coach

Patterned Learning Career

(Remote)
6 Days ago
Garena - Senior/Expert Site Reliability Engineer (SRE)

Garena

Singapore (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

New York, New York, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

London, England, United Kingdom (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Mountain View, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug