Site Reliability Engineer

1 Month ago • 3 Years + • DevOps

About the job

Job Description

Sonar seeks a Site Reliability Engineer with 3+ years of experience in large-scale distributed systems. You'll be responsible for AWS infrastructure, monitoring, incident response, and enhancing observability tools. Strong Python development skills are essential.
Must have:
  • SRE Principles
  • AWS Services
  • Distributed Systems
  • Python Development
Good to have:
  • CloudFormation
  • CDK Python
  • Error Budgets
  • Fault Analysis
Perks:
  • Dynamic Culture
  • Work-Life Balance
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
Why should I Apply:

At Sonar, we’re a group of brilliant, motivated, and driven professionals working hard to help organizations build responsible, secure, high-quality code quickly and systematically. We build solutions that don’t just solve symptoms of problems – we fix problems at the source – source code, to be specific.

We have a dynamic culture with employees worldwide and hub offices in the USA, Switzerland, the UK, Singapore, and Germany. We believe team members should have the opportunity to come to work every day, work on a product they are proud of, love what they do, and feel energized by their peers. With our roots deep in the open source community, we’re all about the mission: provide solutions that deliver Clean Code.

The Impact you can have

SonarCloud is an online service that eliminates bugs and vulnerabilities, and champions quality code in the software development process. It is already a standard product that extends GitHub, GitLab, Bitbucket, and Azure DevOps. Our goal is to make it the ultimate online automatic code analysis solution and get it adopted by millions of users for millions of projects and billions of lines of code.

The team behind SonarCloud is composed of passionate developers who are progressively re-architecting SonarCloud to a pure cloud-native application composed of multiple services in order to deliver great new features more fluently. As a Site Reliability Engineer, you will implement massively scalable services, automate all systems operations, and measure and continuously optimize service quality. Joining our team in Bochum, Germany, you'll collaborate with a talented group of engineers on Identity and Access Management, Platform Security, and more. If you thrive in a collaborative environment and believe in building exceptional software, apply today and join us on the journey to revolutionize software quality!

On a daily basis, you will

    • Have significant responsibility for working with the Software Engineers, Cloud Platform Engineers, Release Engineers, Security Officers, and other SREs to ensure that SonarCloud reliability and performance meet our customers' needs, while being efficient and with a manageable operational load.
    • Participate in designing and implementing features and observability best practices by having a practical understanding of SonarCloud functionality and business processes.
    • Be fully responsible for operating the AWS infrastructure owned by your squad.
    • Monitor and proactively maintain SonarCloud's Service Levels (availability, scalability, and performance).
    • Respond to and own alerts, troubleshoot issues, and mitigate incidents.
    • Analyze abnormal trends in SLI, breached SLOs, and trigger actions.
    • Enhance the monitoring and observability tools (CFN and CDK python) and own implementation of four golden signals across the SonarCloud platform (Latency, Traffic, Errors and Saturation).
    • Maintain operational documentation.
    • Support the customer-facing support functions.
    • Support deployments and perform technical scheduled maintenance.
    • As part of the role, you will participate in an on-call rotation to provide timely support and address any critical system issues that may arise outside of regular working hours.

The technical skills you will demonstrate

    • You have excellent engineering skills and good computer science fundamentals.
    • You have spent multiple years in software engineering and have at least 3 years of experience in an SRE role, focusing on large-scale distributed systems. You understand SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts.
    • You have proven experience building and supporting large-scale, highly available, distributed, and fault-tolerant systems.
    • You have solid experience with AWS services, including CloudWatch, CloudFormation, ECS, Lambda, RDS, DynamoDB, and more.
    • With an SRE mindset, you apply software engineering to observing and operating production.
    • You have software development experience with Python.
Why you will love it here:

Our culture and mission set us apart. We have a dynamic work culture that values respect and kindness – and embraces the right to fail (and get right back up again!). We believe that the best idea wins and everyone has a voice.
We believe that great people make a great company. We value people skills as much as technical skills and strive to keep things friendly and laid-back while still being passionate leaders in our domains. Our 550+ SonarSourcers from 33 different nationalities can relate!
We embrace work-life balance. It is important to maintain a healthy work-life balance. This is why we have a flexible work policy that includes remote and in-office hybrid work (minimum three days a week in the office - Monday/Tuesday/Thursday).
We have a growth mindset. We love to learn and believe that continuous education is critical to our success. In an ever-changing industry, new skills are a must, and we're happy to help our team acquire them.


We prioritize Diversity, Equity, and Inclusion:

At Sonar, we are a global workforce and recognize the value of different backgrounds, and global cultures.

We are committed to creating a diverse work environment and are proud to be an equal-opportunity employer. All qualified applicants will be considered for employment without regard to race, colour, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

All offers of employment at Sonar are contingent upon the clear results of a comprehensive background check conducted prior to the start date.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Austin, Texas, United States (Hybrid)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

Bochum, North Rhine-Westphalia, Germany (On-Site)

Geneva, Geneva, Switzerland (On-Site)

London, England, United Kingdom (On-Site)

Austin, Texas, United States (Hybrid)

Geneva, Geneva, Switzerland (On-Site)

London, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by Sonar Source

Similar Jobs

Nisum - Mobile Engineer A5828

Nisum, India (Hybrid)

PwC - Azure Data Engineer|Bangalore

PwC, India (On-Site)

Trimble  Inc  - Site Reliability Engineer

Trimble Inc , India (On-Site)

Codeninja - Senior PHP Engineer / Lead

Codeninja, Pakistan (On-Site)

Nisum - Android Developer - A6643

Nisum, India (Hybrid)

Journee - Senior Cloud Infrastructure Engineer

Journee, Germany (Hybrid)

Netskope - Staff Software Engineer, SSPM

Netskope, India (Remote)

Activision - Expert Platform Engineer

Activision, Canada (On-Site)

Publicis Groupe - Senior Manager Infrastructure - DevOps GCP/Azure

Publicis Groupe, India (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Sonar Source - Support Engineer

Sonar Source, Switzerland (On-Site)

Enphase Energy - Solargraf - Devops  Engineer

Enphase Energy, India (On-Site)

Codeninja - Senior PHP Engineer / Lead

Codeninja, Pakistan (On-Site)

Trimble  Inc  - SDET Engineer

Trimble Inc , India (Hybrid)

Enphase Energy - EVSE - Tech Lead FrontEnd Developer

Enphase Energy, India (On-Site)

RapidBrains - iOS/tvOS Developer

RapidBrains, India (Remote)

Get notifed when new similar jobs are uploaded

Jobs in Bochum, North Rhine-Westphalia, Germany

Get notifed when new similar jobs are uploaded

DevOps Jobs

Varonis  - DevOps Engineer

Varonis , United States (On-Site)

JIFFYai - STAFF ENGINEER SRE

JIFFYai, India (Hybrid)

SSC Technologies - Senior Technical Consultant (Riyadh, KSA)

SSC Technologies, Saudi Arabia (On-Site)

Unity - Site Reliability Engineer

Unity, United States (On-Site)

Onit India - Senior DevOps Engineer

Onit India, India (Hybrid)

Clarivate - Senior Data Engineer

Clarivate, India (On-Site)

Procore Technologies - Staff IT Systems Engineer

Procore Technologies, India (On-Site)

Nagarro - Principal Engineer, QA Automation

Nagarro, India (Remote)

Zones - Azure Backend Developer

Zones, India (On-Site)

Get notifed when new similar jobs are uploaded