Site Reliability Engineer

47 Minutes ago • 4 Years +
Devops

Job Description

As a Site Reliability Engineer at Thales ODC, you will bridge development and operations, ensuring the reliability, scalability, and performance of cloud-native eSIM applications on Google Cloud Platform. You will develop and maintain Infrastructure-as-Code, automate pipelines, manage monitoring, and support end-to-end deployments, contributing to a secure and seamless digital world.
Good To Have:
  • GCP and/or Kubernetes certifications
  • Experience in the Telecom domain or with eSIM technologies
  • Familiarity with Scrum, incident management, and service delivery frameworks
Must Have:
  • Work in a DevOps environment, supporting ODC products on Google Cloud Platform (GCP).
  • Apply SRE principles for reliability and automation.
  • Take ownership of applications and infrastructure, ensuring availability, performance, and scalability.
  • Develop and maintain Infrastructure-as-Code (IaC) and automation pipelines (Terraform, Helm, GitLab CI/CD).
  • Collaborate with development teams to enhance product reliability.
  • Act as a key technical voice during Change Advisory Board (CAB) reviews.
  • Manage real-time system monitoring and drive improvements in observability and alerting.
  • Support end-to-end deployment of ODC applications in the cloud.
  • Perform operational readiness reviews and shape product roadmaps.
  • Guide technical documentation, incident response processes, and SLA compliance.
  • Participate in on-call support rotations (24/7).
  • Degree in Computer Science or a related technical field.
  • 4+ years of experience with application deployment, system operations, and cloud infrastructure.
  • Proficiency with GCP services and Kubernetes in production environments.
  • Proficiency with automation tools like GitLab, Terraform, Helm.
  • Proficiency with Linux systems, networking, and HTTP(S)/TCP/IP protocols.
  • Proficiency with scripting languages like Shell or Python.
  • Experience with Agile methodologies and a DevOps/SRE mindset.
  • Strong understanding of CI/CD pipelines, system integration, and monitoring tools.
  • Ability to translate business requirements into scalable technical solutions.

Add these skills to join the top 1% applicants for this job

game-texts
agile-development
gitlab
networking
incident-response
linux
terraform
helm
google-cloud-platform
ci-cd
kubernetes
python
shell

Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billions of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000 organizations already rely on us to verify the identities of people and things, grant access to digital services, analyze vast quantities of information and encrypt data to make the connected world more secure.

Thales in the Czech Republic employs over 400 people from 45 different nationalities. A total of 15 teams work on projects for government agencies, banking, mobile services and the Internet Of Things (IoT) technology. At the core of our business is the development of software which we configure and embed in a multitude of different devices and form factors. These include many kinds of payment cards, SIM cards, travel passes, secure eBanking devices, authentication tokens, machine identification modules (MIM), and secure ID documents including ePassports, eID and eHealth cards, as well as eDriving licenses. Because of the international environment surrounding us every day, it comes as no surprise that English is our official corporate language.

This position is in Hybrid working model.

At Thales, trust is the foundation of the digital world. Our Cybersecurity and Digital Identity division helps businesses and governments protect identities and secure digital interactions. Within DIS, the On-Demand Connectivity (ODC) business line plays a pivotal role in powering secure eSIM technology, enabling seamless, remote connectivity for devices around the globe.

With eSIM solutions, you would help companies in deploying IoT devices, manage remote workforces, and launch connected products without the delays and logistics of physical SIM cards. Imagine logistics fleets updating routes on the fly, smart factories remotely optimizing production lines, or wearables connecting instantly to local networks during international travel—all with zero physical intervention. Learn more about our eSIM solutions here.

We’re looking for a Site Reliability Engineer (SRE) who thrives in both application-level problem-solving and infrastructure design. This role is about more than just keeping systems running — it’s about applying software engineering practices to make our ODC applications more reliable, scalable, and performant in a cloud-native, production-grade environment.

What You’ll Do:

As part of the SRE team within Thales ODC, you'll be instrumental in bridging the gap between application development and operations. You will:

  • Work in a DevOps environment, supporting ODC products running on Google Cloud Platform (GCP), with a focus on applying SRE principles for reliability and automation.
  • Take ownership of both applications and infrastructure, ensuring the availability, performance, and scalability of our cloud-based services.
  • Develop and maintain Infrastructure-as-Code (IaC) and automation pipelines using tools like Terraform, Helm, and GitLab CI/CD.
  • Collaborate with development teams to enhance product reliability using a “you build it, you run it” mindset.
  • Act as a key technical voice during Change Advisory Board (CAB) reviews for high-impact changes, providing Tier II expertise.
  • Manage real-time system monitoring and drive continuous improvements in observability and alerting.
  • Support the end-to-end deployment of ODC applications in the cloud — from design and configuration to rollout and performance tuning.
  • Perform operational readiness reviews and shape product roadmaps with a focus on stability and scalability.
  • Guide technical documentation, incident response processes, and SLA compliance across tiers.
  • Participate in on-call support rotations (24/7) to ensure production systems remain resilient.

What You Bring:

  • A degree in Computer Science or a related technical field.
  • 4+ years of experience with application deployment, system operations, and cloud infrastructure.
  • Proficiency with:
  • GCP services and Kubernetes in production environments
  • Automation tools like GitLab, Terraform, Helm
  • Linux systems, networking, and HTTP(S)/TCP/IP protocols
  • Scripting languages like Shell or Python
  • Experience with Agile methodologies and a DevOps/SRE mindset.
  • A strong understanding of CI/CD pipelines, system integration, and monitoring tools.
  • Ability to translate business requirements into scalable technical solutions.

Preferred qualifications:

  • GCP and/or Kubernetes certifications
  • Experience in the Telecom domain or with eSIM technologies
  • Familiarity with Scrum, incident management, and service delivery frameworks

At Thales we provide CAREERS and not only jobs. With Thales employing 80,000 employees in 68 countries our mobility policy enables thousands of employees each year to develop their careers at home and abroad, in their existing areas of expertise or by branching out into new fields. Together we believe that embracing flexibility is a smarter way of working. Great journeys start here, apply now!

Set alerts for more jobs like Site Reliability Engineer
Set alerts for new jobs by Thales
Set alerts for new Devops jobs in Czechia
Set alerts for new jobs in Czechia
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙