Senior Site Reliability Engineer

1 Month ago • 5 Years + • Devops

Job Summary

Job Description

GitLab is seeking a Senior Site Reliability Engineer to join their Runway team, which is building a next-generation platform for rapid backend service deployment. The role involves designing, implementing, and maintaining infrastructure on GCP and AWS, creating and maintaining Kubernetes tooling, and improving monitoring, alerting, and logging systems. Responsibilities also include participating in on-call rotations, automating manual processes, leading incident response with postmortem analysis, and contributing to capacity planning and cost optimization. The position requires strong collaboration with cross-functional teams to ensure system reliability, scalability, and performance in a fully remote, asynchronous environment.
Must have:
  • 5+ years of experience in DevOps, SRE, or similar roles
  • Strong experience with GCP and AWS
  • Proficiency with Kubernetes
  • Solid Golang and scripting skills
  • Experience with logging solutions
  • Ability to automate infrastructure
  • Experience with on-call and incident management
  • Strong troubleshooting skills
  • Excellent communication and teamwork
  • Comfortable in a remote, asynchronous environment
Good to have:
  • Experience with Terraform or Pulumi
  • Knowledge of Prometheus, Grafana
  • Secrets management with HashiCorp Vault
  • Experience with Istio or Linkerd
  • Background in distributed systems
  • Cloud-native security best practices
  • Experience with GitLab CI/CD
Perks:
  • Benefits for health, finances, and well-being
  • All remote, asynchronous work environment
  • Flexible Paid Time Off
  • Team Member Resource Groups
  • Equity Compensation & Employee Stock Purchase Plan
  • Growth and development budget
  • Parental leave
  • Home office support

Job Details

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our platform unites teams and organizations, breaking down barriers and redefining what's possible in software development. Thanks to products like Duo Enterprise and Duo Agent Platform, customers get AI benefits at every stage of the SDLC. 

The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier, with all team members expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact. GitLab is where careers accelerate, innovation flourishes, and every voice is valued. Our high-performance culture is driven by our values and continuous knowledge exchange, enabling our team members to reach their full potential while collaborating with industry leaders to solve complex problems. Co-create the future with us as we build technology that transforms how the world develops software.

Senior Site Reliability Engineer

About the Role

The GitLab Runway team is working on our next-generation platform to rapidly deploy backend services that automatically take advantage of GitLab infrastructure, security, observability, and data access. It’s a platform engineering project in the truest sense. We’re enabling self-service development across the entire GitLab engineering ecosystem to quickly build and deploy services to complement the GitLab product offerings. 

We're seeking a Senior Site Reliability Engineer to join our team and help us build, maintain, and optimize our infrastructure. In this role, you'll collaborate with cross-functional teams to ensure our systems are reliable, scalable, and performant.

Key Responsibilities

  • Design, implement, and maintain infrastructure on both GCP and AWS
  • Help create and maintain Kubernetes tooling, logging, secrets management and utilities
  • Build and improve monitoring, alerting, and logging systems
  • Participate in on-call rotation to address critical issues
  • Automate manual processes to increase efficiency and reduce errors
  • Lead incident response, including postmortem analysis
  • Contribute to capacity planning and cost optimization

Required Qualifications

  • 5+ years of experience in DevOps, SRE, or similar roles
  • Strong experience with both GCP and AWS cloud platforms
  • Proficiency with Kubernetes and container orchestration
  • Solid programming skills in Golang and scripting languages
  • Experience designing and implementing logging solutions
  • Demonstrated ability to automate infrastructure operations
  • Experience with on-call rotations and incident management
  • Strong troubleshooting and problem-solving skills
  • Excellent communication skills and ability to work in a team
  • Comfortable in a fully remote, heavily asynchronous environment across AMER, EMEA, and APAC regions

Preferred Qualifications

  • Experience with infrastructure as code tools (Terraform, Pulumi)
  • Knowledge of observability tools (Prometheus, Grafana, etc.)
  • Secrets management with HashiCorp Vault or OpenBao
  • Experience with service mesh technologies (Istio, Linkerd)
  • Background in distributed systems and microservice architectures
  • Security best practices in cloud-native environments
  • Experience with GitLab CI/CD pipelines and workflows

 

Mandatory non-technical skills, experience and characteristics

  1. Willingness and ability to live and promote Gitlab's unique CREDIT Values in one's day to day work and interactions with teammates.
  2. Superior verbal and written communication skills
  3. Cool, collected and composed under pressure
  4. Comfortable and productive working asynchronously across timezones and cultures, at the speed and scale of business.
  5. Enable others to excel
  6. Be a Leader of One
  7. Act Like an Owner with Gitlab's resources.

 

How GitLab will support you

 

Please note that we welcome interest from candidates with varying levels of experience; many successful candidates do not meet every single requirement. Additionally, studies have shown that people from underrepresented groups are less likely to apply to a job unless they meet every single qualification. If you're excited about this role, please apply and allow our recruiters to assess your application.

 

 


Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.  

Privacy Policy: Please review our Recruitment Privacy Policy. Your privacy is important to us.

GitLab is proud to be an equal opportunity workplace and is an affirmative action employer. GitLab’s policies and practices relating to recruitment, employment, career development and advancement, promotion, and retirement are based solely on merit, regardless of race, color, religion, ancestry, sex (including pregnancy, lactation, sexual orientation, gender identity, or gender expression), national origin, age, citizenship, marital status, mental or physical disability, genetic information (including family medical history), discharge status from the military, protected veteran status (which includes disabled veterans, recently separated veterans, active duty wartime or campaign badge veterans, and Armed Forces service medal veterans), or any other basis protected by law. GitLab will not tolerate discrimination or harassment based on any of these characteristics. See also GitLab’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know during the recruiting process.

Similar Jobs

Luxoft - Android Developer

Luxoft

Gurugram, Haryana, India (On-Site)
8 Months ago
Apple - Sports Viewership & Revenue Growth

Apple

Culver City, California, United States (On-Site)
1 Month ago
Abridge - Partner Success Director, Key Accounts

Abridge

New York, New York, United States (Hybrid)
2 Months ago
Minecast - Senior Tax Analyst

Minecast

London, England, United Kingdom (On-Site)
3 Months ago
Illumina - Staff Data Engineer

Illumina

Bengaluru, Karnataka, India (On-Site)
1 Month ago
deel. - Senior Backend Engineer, Node.js + AWS

deel.

Malta (Remote)
3 Weeks ago
Game Boost - Senior Build Engineer – Bazel Expert

Game Boost

Stockholm, Stockholm County, Sweden (Remote)
3 Months ago
SimpliSafe - CCaaS Solutions Architect

SimpliSafe

Boston, Massachusetts, United States (Hybrid)
2 Months ago
Demandbase - Staff Platform Engineer (DevOps)

Demandbase

Hyderabad, Telangana, India (Remote)
3 Months ago
bytedance - Senior Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Yodo1 - UA Manager (Gaming)

Yodo1

India (Remote)
1 Year ago
C3 IoT - AI Engagement Manager / Director - Energy

C3 IoT

Atlanta, Georgia, United States (On-Site)
2 Months ago
Abridge - Senior Operations Manager

Abridge

San Francisco, California, United States (Hybrid)
3 Months ago
Mindtickle - Specialist, GTM Strategy & Operations

Mindtickle

Pune, Maharashtra, India (Hybrid)
3 Weeks ago
Realworld one - Senior Account Manager

Realworld one

Freiburg, Lower Saxony, Germany (On-Site)
9 Months ago
AiDash - Technical Writer

AiDash

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Tekion Corp - Senior Creative Content Writer

Tekion Corp

Bengaluru, Karnataka, India (On-Site)
3 Months ago
bytedance - IT Program Manager

bytedance

San Jose, California, United States (On-Site)
3 Weeks ago
Moloco - Product Design Lead

Moloco

Redwood City, California, United States (On-Site)
2 Months ago
Calix - Senior Sales Engineer – Major Accounts

Calix

United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Thumbtack - Senior Data Engineer

Thumbtack

Ontario, Canada (Remote)
2 Months ago
Canonical - MAAS Systems Engineer - Python

Canonical

Toronto, Ontario, Canada (Hybrid)
3 Months ago
Autodesk - Manager, Software Development - Global Developer Relations

Autodesk

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
PwC - Accounting and Transaction Advisory Senior Manager

PwC

Montreal, Quebec, Canada (On-Site)
10 Months ago
Hyper Hippo - Register Your Interest

Hyper Hippo

Canada (Remote)
2 Months ago
Unity - Senior Developer Support Engineer

Unity

Montreal, Quebec, Canada (On-Site)
3 Months ago
bounteous - IAM Reliability Engineer

bounteous

Montreal, Quebec, Canada (Hybrid)
1 Month ago
Track VFX - Vancouver Technical Artist

Track VFX

Vancouver, British Columbia, Canada (On-Site)
4 Months ago
Highspot - Software Development Engineer

Highspot

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago
Cineplex - Part Time Cast Member

Cineplex

Thunder Bay, Ontario, Canada (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Malabar Gold & Diamonds - Executive - Cloud Engineer

Malabar Gold & Diamonds

Sri Vijaya Puram, Andaman And Nicobar Islands, India (On-Site)
1 Year ago
Mindtickle - Solution Architect

Mindtickle

Pune, Maharashtra, India (Hybrid)
6 Months ago
zeta - Site Reliability Engineer I

zeta

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
extreme network - Solutions Architect

extreme network

Texas, United States (Remote)
1 Month ago
deel. - Back-End Engineer - Infrastructure Team

deel.

Brazil (Remote)
3 Weeks ago
zoox - Senior Enterprise Solutions Engineer

zoox

Foster City, California, United States (On-Site)
5 Months ago
datcroft - DEVOPS ENGINEER

datcroft

Voronezh, Voronezh Oblast, Russia (On-Site)
3 Months ago
Nice - Solution Engineer - NG911

Nice

United States (Remote)
2 Months ago
PhonePe - Server Administrator (Devops and Linux)

PhonePe

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded