Site Reliability Engineer

2 Weeks ago • 3 Years + • DevOps

About the job

SummaryBy Outscal

Must have:
  • SRE Principles
  • AWS Services
  • Distributed Systems
  • Python Development
Good to have:
  • CloudFormation
  • CDK Python
  • Error Budgets
  • Fault Analysis
Perks:
  • Dynamic Culture
  • Work-Life Balance
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
Why should I Apply:

At Sonar, we’re a group of brilliant, motivated, and driven professionals working hard to help organizations build responsible, secure, high-quality code quickly and systematically. We build solutions that don’t just solve symptoms of problems – we fix problems at the source – source code, to be specific.

We have a dynamic culture with employees worldwide and hub offices in the USA, Switzerland, the UK, Singapore, and Germany. We believe team members should have the opportunity to come to work every day, work on a product they are proud of, love what they do, and feel energized by their peers. With our roots deep in the open source community, we’re all about the mission: provide solutions that deliver Clean Code.

The Impact you can have

SonarCloud is an online service that eliminates bugs and vulnerabilities, and champions quality code in the software development process. It is already a standard product that extends GitHub, GitLab, Bitbucket, and Azure DevOps. Our goal is to make it the ultimate online automatic code analysis solution and get it adopted by millions of users for millions of projects and billions of lines of code.

The team behind SonarCloud is composed of passionate developers who are progressively re-architecting SonarCloud to a pure cloud-native application composed of multiple services in order to deliver great new features more fluently. As a Site Reliability Engineer, you will implement massively scalable services, automate all systems operations, and measure and continuously optimize service quality. Joining our team in Bochum, Germany, you'll collaborate with a talented group of engineers on Identity and Access Management, Platform Security, and more. If you thrive in a collaborative environment and believe in building exceptional software, apply today and join us on the journey to revolutionize software quality!

On a daily basis, you will

    • Have significant responsibility for working with the Software Engineers, Cloud Platform Engineers, Release Engineers, Security Officers, and other SREs to ensure that SonarCloud reliability and performance meet our customers' needs, while being efficient and with a manageable operational load.
    • Participate in designing and implementing features and observability best practices by having a practical understanding of SonarCloud functionality and business processes.
    • Be fully responsible for operating the AWS infrastructure owned by your squad.
    • Monitor and proactively maintain SonarCloud's Service Levels (availability, scalability, and performance).
    • Respond to and own alerts, troubleshoot issues, and mitigate incidents.
    • Analyze abnormal trends in SLI, breached SLOs, and trigger actions.
    • Enhance the monitoring and observability tools (CFN and CDK python) and own implementation of four golden signals across the SonarCloud platform (Latency, Traffic, Errors and Saturation).
    • Maintain operational documentation.
    • Support the customer-facing support functions.
    • Support deployments and perform technical scheduled maintenance.
    • As part of the role, you will participate in an on-call rotation to provide timely support and address any critical system issues that may arise outside of regular working hours.

The technical skills you will demonstrate

    • You have excellent engineering skills and good computer science fundamentals.
    • You have spent multiple years in software engineering and have at least 3 years of experience in an SRE role, focusing on large-scale distributed systems. You understand SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts.
    • You have proven experience building and supporting large-scale, highly available, distributed, and fault-tolerant systems.
    • You have solid experience with AWS services, including CloudWatch, CloudFormation, ECS, Lambda, RDS, DynamoDB, and more.
    • With an SRE mindset, you apply software engineering to observing and operating production.
    • You have software development experience with Python.
Why you will love it here:

Our culture and mission set us apart. We have a dynamic work culture that values respect and kindness – and embraces the right to fail (and get right back up again!). We believe that the best idea wins and everyone has a voice.
We believe that great people make a great company. We value people skills as much as technical skills and strive to keep things friendly and laid-back while still being passionate leaders in our domains. Our 550+ SonarSourcers from 33 different nationalities can relate!
We embrace work-life balance. It is important to maintain a healthy work-life balance. This is why we have a flexible work policy that includes remote and in-office hybrid work (minimum three days a week in the office - Monday/Tuesday/Thursday).
We have a growth mindset. We love to learn and believe that continuous education is critical to our success. In an ever-changing industry, new skills are a must, and we're happy to help our team acquire them.


We prioritize Diversity, Equity, and Inclusion:

At Sonar, we are a global workforce and recognize the value of different backgrounds, and global cultures.

We are committed to creating a diverse work environment and are proud to be an equal-opportunity employer. All qualified applicants will be considered for employment without regard to race, colour, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

All offers of employment at Sonar are contingent upon the clear results of a comprehensive background check conducted prior to the start date.
View Full Job Description

Texas, United States (Hybrid)

Geneva, Switzerland (Hybrid)

Geneva, Switzerland (Hybrid)

Geneva, Switzerland (On-Site)

North Rhine-Westphalia, Germany (On-Site)

North Rhine-Westphalia, Germany (On-Site)

Geneva, Switzerland (On-Site)

Geneva, Switzerland (On-Site)

North Rhine-Westphalia, Germany (On-Site)

Texas, United States (On-Site)

View All Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug