Databases Site Reliability Engineer

1 Month ago • 2 Years + • DevOps

About the job

Job Description

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow. Responsibilities: Collaborate with other teams and the Cloud Support organization to ensure Spanner is easy to manage and meets customers' needs with minimal operational load. Plan and execute projects that improve reliability or efficiency. Participate in an on-call rotation as required. Manage the responsibilities of the GCP Spanner allocations.
Must have:
  • Bachelor's degree or equivalent practical experience.
  • 2 years of experience with programming in one or more programming languages.
  • Experience with Unix/Linux operating systems internals and administration or networking.
Good to have:
  • Experience with Site Reliability Engineering, System Design, and Distributed Computing.
  • Experience delivering projects in systems.
  • Excellent influencing skills.

Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 2 years of experience with programming in one or more programming languages.
  • Experience with Unix/Linux operating systems internals and administration or networking.

Preferred qualifications:

  • Experience with Site Reliability Engineering, System Design, and Distributed Computing.
  • Experience delivering projects in systems.
  • Excellent influencing skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation.

On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design.

SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

To learn more: check out our books on or read a about why a Software Engineer chose to join SRE.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Collaborate with other teams and the Cloud Support organization to ensure Spanner is easy to manage and meets customers' needs with minimal operational load.
  • Plan and execute projects that improve reliability or efficiency.
  • Participate in an on-call rotation as required.
  • Manage the responsibilities of the GCP Spanner allocations.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

San Francisco, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

San Bruno, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

Sunnyvale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Monsters Aliens Robots Zombies - Senior Full Stack Developer

Monsters Aliens Robots Zombies, Canada (On-Site)

PlayStation Global - Data Science Intern, Payments & Fraud - Master's or PhD

PlayStation Global, United States (Hybrid)

NinjaVan - Fleet Assistant - Serdang

NinjaVan, Malaysia (On-Site)

Trendyol - Senior Data Scientist - Seller

Trendyol, Türkiye (Hybrid)

Consilio LLC - SR Site Reliability Engineer

Consilio LLC, India (Hybrid)

Nielsen Holdings - Principal Data Engineer - AWS

Nielsen Holdings, India (Hybrid)

Paytm - Devops - Senior DevOps Engineer

Paytm, India (On-Site)

Rackspace Technology - Senior Streaming Engineer (GCP) - Canada

Rackspace Technology, Canada (Remote)

Blue Yonder - Software Engineer II

Blue Yonder, Germany (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Nagarro - Staff Engineer, Cloud

Nagarro, India (On-Site)

Lotus Interworks - Growth Manager - Social Media Marketing

Lotus Interworks, India (Remote)

Antarctica Global - Illustrator

Antarctica Global, India (On-Site)

Liven - Senior Data Product Manager

Liven, India (Remote)

Analog Devices - CAD Engineer

Analog Devices, India (On-Site)

Salesforce - Account Executive - Prime

Salesforce, India (On-Site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Magna International - Senior Cloud Engineer

Magna International, India (On-Site)

Rackspace Technology - Senior Big Data Hadoop ML Engineer (GCP) - Canada

Rackspace Technology, Canada (Remote)

ION - Cloud Engineer Kubernetes

ION, Italy (Hybrid)

Varonis  - Technical Support Engineer-L2

Varonis , United States (On-Site)

Luxoft - Senior Java Developer

Luxoft, India (On-Site)

AI Fund - DevOps Engineer

AI Fund, Taiwan (Hybrid)

Acceldata - Resident Solutions Architect

Acceldata, United States (Remote)

Guidewire Software - Site Reliability Engineer - Cloud Platform

Guidewire Software, India (Hybrid)

Get notifed when new similar jobs are uploaded