Software Engineer III, Site Reliability Engineering, Platforms Infrastructure

3 Months ago • 2 Years + • DevOps

Job Summary

Job Description

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally, SRE’s will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem-solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences, and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow. You will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
Must have:
  • Bachelor’s degree in Computer Science or related field
  • 2 years of experience with data structures/algorithms
  • Software development in one or more programming languages
  • Write product or system development code
  • Review code developed by other engineers
  • Contribute to existing documentation or educational content
  • Triage product or system issues and debug/track/resolve
  • Participate in, or lead design reviews
Good to have:
  • Master's degree in Computer Science or Engineering
  • 2 years of experience designing, analyzing, and troubleshooting large-scale distributed systems

Job Details


Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages.

Preferred qualifications:

  • Master's degree in Computer Science or Engineering.
  • 2 years of experience designing, analyzing, and troubleshooting large-scale distributed systems.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

You will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Responsibilities

  • Write product or system development code.
  • Review code developed by other engineers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.

Similar Jobs

Samsung Semiconductor - Staff Engineer, Embedded Security Software Developer

Samsung Semiconductor

San Jose, California, United States (Hybrid)
4 Months ago
Fabric - Applied Cryptographer, ZKP Research

Fabric

Seattle, Washington, United States (Remote)
4 Months ago
BlackLine - Sr. Software Engineer (Frontend)

BlackLine

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
ByteDance - Software Engineer in ML Systems Graduate (AML - Machine Learning Systems) - 2024 Start (BS/MS)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ION - Cloud Engineer/Architect (DevOps)

ION

Pisa, Tuscany, Italy (On-Site)
4 Months ago
Blue Yonder - Software Engineer II

Blue Yonder

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Razer - Lead Site Reliability Engineer

Razer

Shanghai, Shanghai, China (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Interview Kickstart - Senior Software Engineer

Interview Kickstart

India (Remote)
4 Months ago
Warner Bros. Discovery - Senior Machine Learning Engineer - (AI/ML Team), Bangalore

Warner Bros. Discovery

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Velotio Technologies - Senior Fullstack Engineer (NodeJS & ReactJS)

Velotio Technologies

Pune, Maharashtra, India (Remote)
3 Months ago
Saviynt - Technical Lead, Support operations- JSON

Saviynt

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
The Walt Disney Company - Sr Software Engineer (webOS/Tizen)

The Walt Disney Company

Santa Monica, California, United States (On-Site)
3 Months ago
Google - Senior Software Engineer, Machine Learning, Platforms and Devices

Google

Berlin, Berlin, Germany (On-Site)
3 Months ago
Google - Software Engineer III, Core

Google

(On-Site)
2 Months ago
The Walt Disney Company - Lead Software Engineer (C/C++ or Rust)

The Walt Disney Company

Seattle, Washington, United States (On-Site)
2 Months ago
Google - Senior Software Engineer, Machine Learning, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Poland

Activision - Lead Tools Engineer

Activision

Warsaw, Masovian Voivodeship, Poland (On-Site)
6 Months ago
Egnyte - Senior Analytics Engineer

Egnyte

Poznań, Greater Poland Voivodeship, Poland (On-Site)
3 Months ago
Egnyte - Sales Operation Analyst

Egnyte

Poland (Remote)
3 Months ago
sigma software - JavaScript Core Engineer (Accessibility Product House)

sigma software

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
Luxoft - Process and Project Management Officer

Luxoft

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
2 Months ago
Keywords Studios (Player Support) - Hungarian Speaking Game Tester (LQA)

Keywords Studios (Player Support)

Katowice, Silesian Voivodeship, Poland (On-Site)
7 Months ago
ARHS - Solutions Architect (Sharepoint)

ARHS

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
LeoVegas - Display & Programmatic Specialist

LeoVegas

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
seeking alpha - Senior Data Engineer

seeking alpha

Poland (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Google - Technical Solutions Engineer, Cloud Infrastructure Compute

Google

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Nagarro - Principal Engineer - Senior Salesforce Architect

Nagarro

Boston, Massachusetts, United States (Hybrid)
3 Months ago
Google - Software Engineer, Site Reliability Engineering

Google

(On-Site)
2 Months ago
Acceldata - Senior Product Support Engineer - Cloud Support

Acceldata

Bengaluru, Karnataka, India (On-Site)
3 Months ago
OpenGov - Director, Infrastructure Engineering

OpenGov

Atlanta, Georgia, United States (Hybrid)
4 Months ago
Google - Customer Engineer III, High Performance Computing, BioTech, Cloud

Google

Cambridge, Massachusetts, United States (On-Site)
3 Months ago
BigID - Service Delivery Engineer

BigID

Sydney, New South Wales, Australia (On-Site)
3 Months ago
Coralogix - Cloud and Observability Engineer

Coralogix

Gurugram, Haryana, India (On-Site)
8 Months ago
Tencent - Principal / Senior Cloud Solution Architect - Tencent Cloud

Tencent

Palo Alto, California, United States (On-Site)
3 Months ago
Gaming Innovation Group  - Infrastructure Engineer

Gaming Innovation Group

Catalonia, Spain (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug