Intermediate Site Reliability Engineer, Foundations

4 Months ago • All levels • Devops

Job Summary

Job Description

GitLab is seeking an Intermediate Site Reliability Engineer (SRE) to join their Foundations team. The SRE will be responsible for maintaining the smooth operation of user-facing services and production systems. This role involves applying engineering principles and automation to operating environments and the GitLab codebase, with a focus on systems, reliability, and scalability. Responsibilities include designing and implementing scalable networking infrastructure, collaborating with cross-functional teams, responding to incidents in an on-call rotation, leading initiatives, and acting as a subject matter expert in networking and rate limiting services. A key aspect of the role is automating operational tasks.
Must have:
  • Google Cloud Platform expertise (networking, GKE)
  • Terraform infrastructure as code experience
  • Ansible/Chef configuration management experience
  • Kubernetes ecosystem (Helm) experience
  • Programming in Ruby or Go
  • Understanding of network protocols
  • Scripting languages (Ruby, Go, Bash)
  • GitLab CI experience
  • Problem definition and solution thinking
  • Drive for automation
  • Manager of one mindset
  • Bias for action
  • Independent, proactive, self-organized
  • Strong asynchronous communication
Good to have:
  • Familiarity with network observability tools

Job Details

An overview of this role

GitLab is a complete DevOps platform, delivered as a single application. From project planning and source code management to CI/CD, monitoring, and security, we help teams deliver software faster and more efficiently while strengthening their security and compliance postures.

As an Intermediate Site Reliability Engineer (SRE) at GitLab, you are responsible for keeping all user-facing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments and the GitLab codebase.

GitLab SREs specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.

What you’ll do  

  • Design and implement a highly scalable networking infrastructure to support the needs of current and future GitLab platforms and offerings.
  • Collaborate closely with cross-functional teams and other teams throughout Infrastructure-Platforms on projects to drive GitLab’s future.
  • Respond to incidents on an on-call rotation (our team is distributed globally, so you are only on call during your daytime hours!) and participate in incident review.
  • Lead initiatives through problem definition, scoping, design, and project management.
  • Act as subject matter experts within the GitLab Infrastructure-Platforms department, specializing in knowledge of our networking and rate limiting services.
  • Automate every operational task.

 

What you’ll bring 

  • Google Cloud Platform expertise, specifically around networking (VPCs, subnets, load balancers), GKE configuration, and scaling.
  • Experience with Terraform infrastructure as code.
  • Experience with configuration management tools such as Ansible and Chef.
  • Experience with the Kubernetes ecosystem, including Helm.
  • Programming skills and professional experience in Ruby or Go.
  • Understanding of network protocols (TCP/IP, HTTP/HTTPS, DNS)
  • Familiarity with network observability tools and traffic analysis
  • Comfortable with scripting languages (Ruby, Go, Bash) for automation
  • Experience with GitLab CI or equivalent
  • Ability to clearly define problems and think beyond initial solutions, looking at how to make things better in the future.
  • A drive for automating everything.
  • Ability to be a manager of one and have a strong bias for action.
  • An independent, proactive, and self-organized mindset.
  • Strong ability to clearly communicate asynchronously.
  • Excitement to be doing something different every day from project work to production change requests to emergency response.

About the team

The Production Engineering Foundations team owns the networking infrastructure for GitLab from edge to ingress. Running the largest GitLab instance in existence (and in fact, one of the largest single-tenancy open-source SaaS sites on the Internet) means we are constantly faced with unique and rewarding challenges that directly impact our users every day. Our future is all about increasing automation and enabling other teams by building paved roads for things like rate limiting and edge networks, so we can continue to scale even bigger with enterprise-level expectations around reliability and availability. Thanks to our Transparency value, you can see how we work on our team page. You can even see what we’re working on right now.


Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.  

Privacy Policy: Please review our Recruitment Privacy Policy. Your privacy is important to us.

GitLab is proud to be an equal opportunity workplace and is an affirmative action employer. GitLab’s policies and practices relating to recruitment, employment, career development and advancement, promotion, and retirement are based solely on merit, regardless of race, color, religion, ancestry, sex (including pregnancy, lactation, sexual orientation, gender identity, or gender expression), national origin, age, citizenship, marital status, mental or physical disability, genetic information (including family medical history), discharge status from the military, protected veteran status (which includes disabled veterans, recently separated veterans, active duty wartime or campaign badge veterans, and Armed Forces service medal veterans), or any other basis protected by law. GitLab will not tolerate discrimination or harassment based on any of these characteristics. See also GitLab’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know during the recruiting process.

Similar Jobs

Stem,  Inc  - Tier 2 Support Specialist

Stem, Inc

Broomfield, Colorado, United States (On-Site)
1 Month ago
Palo Alto Networks - Senior Site Reliability Engineer (Cortex Cloud Security Posture Management)

Palo Alto Networks

Santa Clara, California, United States (On-Site)
1 Month ago
EvenUp - Account Executive

EvenUp

Atlanta, Georgia, United States (Remote)
4 Months ago
CyberArk - Principal Account Executive - France

CyberArk

France (On-Site)
3 Months ago
Resolver - Vice President Sales

Resolver

Chicago, Illinois, United States (Remote)
1 Month ago
luxsoft - Solution Architect

luxsoft

Ukrainka, Kyiv Oblast, Ukraine (Remote)
1 Month ago
Nagarro - SAP SuccessFactors Solution Architect (m/f/d)

Nagarro

Germany (Remote)
10 Months ago
London stock Exchange - Devops Engineer

London stock Exchange

Hyderabad, Telangana, India (On-Site)
2 Months ago
Epic Games - Senior DevOps Engineer

Epic Games

(On-Site)
4 Months ago
Simcorp - Senior Site Reliability Engineer

Simcorp

Mexico City, Mexico (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

GoMotive - UK SDR Associate Manager

GoMotive

United Kingdom (Hybrid)
3 Months ago
Domo - Apps and Connector Engineer

Domo

Tokyo, Japan (On-Site)
2 Months ago
EveryMatrix - Studio Technician

EveryMatrix

Batumi, Adjara, Georgia (On-Site)
10 Months ago
extreme network - Manager, QA Engineering

extreme network

San Jose, California, United States (On-Site)
5 Months ago
Vimeo - Revenue Planning & Performance Manager

Vimeo

New York, United States (On-Site)
1 Month ago
Miro - Scaled Customer Success Manager, Japan

Miro

Tokyo, Japan (On-Site)
1 Month ago
WebFX - AI Digital Marketing Specialist

WebFX

United States (Remote)
3 Months ago
Britive - Enterprise Sales Development Representative

Britive

Boston, Massachusetts, United States (On-Site)
3 Months ago
ChainGuard - Enterprise Account Executive

ChainGuard

Minnesota, United States (Remote)
1 Month ago
Lurkit - Customer Success Manager (NA time zone) - Remote

Lurkit

Stockholm, Stockholm County, Sweden (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Scanline VFX - Senior Compositor

Scanline VFX

Montreal, Quebec, Canada (Hybrid)
4 Months ago
Amber - Localization Quality Assurance with Spanish (LATAM)

Amber

Quebec, Canada (On-Site)
6 Months ago
bounteous - Murex Integration Developer

bounteous

Montreal, Quebec, Canada (On-Site)
1 Month ago
Evolution  - Customer Service - Korean Speaking Online Game Presenter

Evolution

Burnaby, British Columbia, Canada (On-Site)
2 Months ago
CAE - Strategic Sourcing Specialist – Human Resources, Communications & Marketing

CAE

Montreal, Quebec, Canada (On-Site)
2 Months ago
Nagarro - Staff Engineer, Java Fullstack

Nagarro

Canada (Remote)
10 Months ago
yellow brick games - Gameplay Programmer

yellow brick games

Montreal, Quebec, Canada (Remote)
3 Months ago
Alphawave Semi - Senior Manager - Custom Layout Serdes

Alphawave Semi

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
Cineplex - Line Cook

Cineplex

Mississauga, Ontario, Canada (On-Site)
2 Months ago
Ubisoft - Accountant

Ubisoft

Montreal, Quebec, Canada (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Intel  - Sr. Infrastructure Engineer - Windows OS

Intel

Hillsboro, Oregon, United States (On-Site)
3 Months ago
Sonar Source - Sales Solutions Engineer - EMEA

Sonar Source

London, England, United Kingdom (On-Site)
1 Year ago
e2 open - Senior Solution Architect

e2 open

United States (On-Site)
1 Month ago
Sonar Source - Senior Platform Engineer – Developer Experience

Sonar Source

Austin, Texas, United States (Hybrid)
2 Months ago
Veeam Software - Platform Engineer

Veeam Software

California, United States (Remote)
1 Month ago
AccelData - Senior Platform Engineer

AccelData

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Notion - Customer Experience (CX) Automation Engineer

Notion

San Francisco, California, United States (On-Site)
3 Months ago
Glean - Solutions Engineer - East

Glean

United States (Remote)
1 Month ago
Capgemini - Ansible Automation Engineer

Capgemini

Pune, Maharashtra, India (On-Site)
2 Months ago
Tesla - Automation Engineer

Tesla

Brandenburg, Germany (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded