Intermediate Site Reliability Engineer, Foundations

2 Months ago • All levels • Devops

Job Summary

Job Description

As an Intermediate Site Reliability Engineer (SRE) at GitLab, you will ensure the smooth operation of user-facing services and GitLab production systems. Responsibilities include designing and implementing scalable networking infrastructure, collaborating with cross-functional teams, responding to incidents, leading initiatives, acting as a subject matter expert in networking and rate limiting, and automating operational tasks. The role requires expertise in Google Cloud Platform, Terraform, configuration management tools, and the Kubernetes ecosystem, along with programming skills in Ruby or Go.
Must have:
  • Google Cloud Platform expertise, specifically around networking.
  • Experience with Terraform infrastructure as code.
  • Experience with configuration management tools.
  • Experience with the Kubernetes ecosystem, including Helm.
  • Programming skills in Ruby or Go.
  • Understanding of network protocols.

Job Details

GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating the rate of human progress. This mission is integral to our culture, influencing how we hire, build products, and lead our industry. We make this possible at GitLab by running our operations on our product and staying aligned with our values. Learn more about Life at GitLab.

Thanks to products like Duo Enterprise, and Duo Workflow, customers get the benefit of AI at every stage of the SDLC. The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier. All team members are encouraged and expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact across our global organisation.

An overview of this role

GitLab is a complete DevOps platform, delivered as a single application. From project planning and source code management to CI/CD, monitoring, and security, we help teams deliver software faster and more efficiently while strengthening their security and compliance postures.

As an Intermediate Site Reliability Engineer (SRE) at GitLab, you are responsible for keeping all user-facing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments and the GitLab codebase.

GitLab SREs specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.

What you’ll do  

  • Design and implement a highly scalable networking infrastructure to support the needs of current and future GitLab platforms and offerings.
  • Collaborate closely with cross-functional teams and other teams throughout Infrastructure-Platforms on projects to drive GitLab’s future.
  • Respond to incidents on an on-call rotation (our team is distributed globally, so you are only on call during your daytime hours!) and participate in incident review.
  • Lead initiatives through problem definition, scoping, design, and project management.
  • Act as subject matter experts within the GitLab Infrastructure-Platforms department, specializing in knowledge of our networking and rate limiting services.
  • Automate every operational task.

 

What you’ll bring 

  • Google Cloud Platform expertise, specifically around networking (VPCs, subnets, load balancers), GKE configuration, and scaling.
  • Experience with Terraform infrastructure as code.
  • Experience with configuration management tools such as Ansible and Chef.
  • Experience with the Kubernetes ecosystem, including Helm.
  • Programming skills and professional experience in Ruby or Go.
  • Understanding of network protocols (TCP/IP, HTTP/HTTPS, DNS)
  • Familiarity with network observability tools and traffic analysis
  • Comfortable with scripting languages (Ruby, Go, Bash) for automation
  • Experience with GitLab CI or equivalent
  • Ability to clearly define problems and think beyond initial solutions, looking at how to make things better in the future.
  • A drive for automating everything.
  • Ability to be a manager of one and have a strong bias for action.
  • An independent, proactive, and self-organized mindset.
  • Strong ability to clearly communicate asynchronously.
  • Excitement to be doing something different every day from project work to production change requests to emergency response.

About the team

The Production Engineering Foundations team owns the networking infrastructure for GitLab from edge to ingress. Running the largest GitLab instance in existence (and in fact, one of the largest single-tenancy open-source SaaS sites on the Internet) means we are constantly faced with unique and rewarding challenges that directly impact our users every day. Our future is all about increasing automation and enabling other teams by building paved roads for things like rate limiting and edge networks, so we can continue to scale even bigger with enterprise-level expectations around reliability and availability. Thanks to our Transparency value, you can see how we work on our team page. You can even see what we’re working on right now.


Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.  

Privacy Policy: Please review our Recruitment Privacy Policy. Your privacy is important to us.

GitLab is proud to be an equal opportunity workplace and is an affirmative action employer. GitLab’s policies and practices relating to recruitment, employment, career development and advancement, promotion, and retirement are based solely on merit, regardless of race, color, religion, ancestry, sex (including pregnancy, lactation, sexual orientation, gender identity, or gender expression), national origin, age, citizenship, marital status, mental or physical disability, genetic information (including family medical history), discharge status from the military, protected veteran status (which includes disabled veterans, recently separated veterans, active duty wartime or campaign badge veterans, and Armed Forces service medal veterans), or any other basis protected by law. GitLab will not tolerate discrimination or harassment based on any of these characteristics. See also GitLab’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know during the recruiting process.

Similar Jobs

Bright Edge - Sales Development Representative (Illinois State Students)

Bright Edge

Chicago, Illinois, United States (On-Site)
9 Months ago
Toast - Software Engineering Manager II - Funds Management

Toast

Dublin, County Dublin, Ireland (Hybrid)
2 Weeks ago
Neolytix - Lead Development Representative (Healthcare Services)

Neolytix

Chicago, Illinois, United States (Hybrid)
2 Weeks ago
NCR Voyix - Software Engineer III / Java Full Stack Developer

NCR Voyix

Chennai, Tamil Nadu, India (On-Site)
1 Month ago
Airbyte - Mid Market Account Executive

Airbyte

New York, New York, United States (On-Site)
2 Months ago
Figma - Software Engineer, Mobile Platform

Figma

San Francisco, California, United States (Remote)
1 Month ago
Nagarro - Senior Staff Engineer, DevOps

Nagarro

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)
9 Months ago
luxsoft - Azure/AzureML Engineer

luxsoft

Pune, Maharashtra, India (On-Site)
2 Weeks ago
Apple - Sr. Infrastructure Software Engineer (ASE Infra Hardware)

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Loft Orbital - Space Infrastructure Software Engineer

Loft Orbital

San Francisco, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

reversing labs  - Senior Full Stack Software Engineer

reversing labs

Ireland (Remote)
3 Months ago
Optiv - Principal Analyst

Optiv

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Glean - Technical Support Manager

Glean

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Aristocrat - Solutions Architect

Aristocrat

London, England, United Kingdom (Hybrid)
4 Months ago
Sonar Source - Sales Manager, Enterprise Expansion

Sonar Source

Austin, Texas, United States (On-Site)
9 Months ago
PwC - Senior AI Developer - Roma [DIG]

PwC

Rome, Lazio, Italy (On-Site)
9 Months ago
Tekion Corp - Program Manager II

Tekion Corp

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Sprinkler - Technical Support Engineer - Lead

Sprinkler

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Salesforce - Growth Business Account Executive - Bilingual (English/French)

Salesforce

Montreal, Quebec, Canada (Remote)
6 Months ago
undefined - Customer Success Manager, West

United States (Remote)
9 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Side - Swedish Localization Video Game QA Tester

Side

Montreal, Quebec, Canada (On-Site)
1 Month ago
Behaviour Interactive - Senior Sound Designer

Behaviour Interactive

Montreal, Quebec, Canada (Hybrid)
2 Weeks ago
AECOM - Senior Civil Engineer

AECOM

Victoria, British Columbia, Canada (On-Site)
1 Month ago
Expedia - Senior Manager, Software Development Engineering, Partner Connectivity

Expedia

Montreal, Quebec, Canada (On-Site)
1 Month ago
Electronic Arts - Sr. Software Engineer - AdTech (SDK / Rendering)

Electronic Arts

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago
Highspot - Principal Software Development Engineer, Engineering Excellence

Highspot

Vancouver, British Columbia, Canada (Hybrid)
2 Weeks ago
Maxis Studios - Full Stack Software Engineer - Web Applications

Maxis Studios

Vancouver, British Columbia, Canada (Hybrid)
2 Weeks ago
Boomi  - Manager, Commercial Sales Engineering

Boomi

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago
Airlab Inc  - Junior Programmer Artificial Intelligence

Airlab Inc

Quebec, Canada (On-Site)
3 Months ago
Unity - Senior Machine Learning/MLOps Developer

Unity

Montreal, Quebec, Canada (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

luxsoft - Solution Architect

luxsoft

India (Remote)
3 Weeks ago
PhonePe - Server Administrator (Devops and Linux)

PhonePe

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Axon - Sr. Solutions Architect, Fusus

Axon

Atlanta, Georgia, United States (Hybrid)
1 Month ago
Nice - Senior Software Engineer (.Net, AWS)

Nice

Pune, Maharashtra, India (Hybrid)
2 Weeks ago
Globalization Partners - Principal Solution Architect

Globalization Partners

United States (Remote)
2 Months ago
Nice - Cloud Operations Engineer

Nice

Hoboken, New Jersey, United States (On-Site)
2 Weeks ago
Jane Street - Cross-Platform Software Engineer

Jane Street

New York, United States (Hybrid)
2 Months ago
Zazz - Cloud Engineer (Azure)

Zazz

(Remote)
5 Months ago
Rackspace Technology - Machine Learning Architect (AWS)

Rackspace Technology

San Diego, California, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded