Principal Site Reliability Engineer

undefined ago • 10 Years + • Devops

Job Summary

Job Description

As a Principal Site Reliability Engineer, you will be a technical authority guiding the Azure-based SaaS platform and Site Reliability strategy. You will lead complex initiatives in platform reliability, observability, onboarding automation, and incident response, while advancing cloud capabilities. This role involves architectural leadership, designing scalable systems, mentoring senior engineers, and shaping Azure governance models. You will enhance platform efficiency and client experience, supporting the company's transformation into a cloud-native SaaS provider with optimized, secure, and automated services.
Must have:
  • Lead SRE initiatives across multiple product areas.
  • Implement strategic use of Microsoft Azure for onboarding and reliability.
  • Architect scalable, secure, and automated solutions for client operations.
  • Lead design of cross-cutting platform capabilities (observability, CI/CD, IaC, DR).
  • Shape and govern Azure implementation patterns for standardization and cost-efficiency.
  • Solve complex, business-critical reliability challenges in distributed cloud systems.
  • Advise engineering leads and product owners on cloud platform decisions.
  • Collaborate with Information Security, Platform Engineering, and Architecture teams.
  • Guide definition of SLOs, SLIs, and other reliability metrics.
  • Lead root cause analysis, major incident postmortems, and reliability retrospectives.
  • Provide thought leadership, mentoring, and coaching to senior engineers.
  • Build communities of practice for SRE principles and knowledge sharing.
  • Represent SRE function in executive-level planning and roadmap definition.
  • Contribute to the company’s transformation into a SaaS-first, cloud-native company.
  • 10+ years experience in SRE, Cloud Infrastructure, or Platform Architecture.
  • Proficient expertise in Microsoft Azure architecture, deployment, automation, and cost optimization.
  • Strong grasp of cloud-native/hybrid architectures, distributed systems, networking, security.
  • Mastery in Infrastructure as Code (IaC) using Terraform, ARM, Bicep.
  • Deep knowledge of observability stacks (Azure Monitor, Log Analytics, Grafana, Application Insights).
  • Experience leading complex incident and problem management at scale.
  • Broad technical skillset including Kubernetes, Docker, CI/CD, SQL, APIs, scripting.
  • Solid foundation in ITIL processes with a strategic mindset.
  • Demonstrated ability to engage senior stakeholders and lead through ambiguity.
  • Experience in regulated, security-conscious environments (e.g., financial services).
  • Dedication to mentorship, knowledge sharing, and building engineering culture.
  • Ability to think strategically while delivering pragmatic, hands-on solutions.
Perks:
  • Attractive salary
  • Executive-level bonus scheme
  • Pension
  • Flexible working hours
  • Hybrid work models
  • Individualized approach to professional growth
  • Access to strategic initiatives and innovation programs

Job Details

WHAT MAKES US, US

Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn, and pursue outcomes with our prestigious financial clients, say Hello!

At its foundation, the company is guided by our values — caring, customer success-driven, collaborative, curious, and courageous. Our people-centered organization focuses on skills development, relationship building, and client success. We take pride in cultivating an environment where all team members can grow, feel heard, valued, and empowered.

If you like what we’re saying, keep reading!

WHY THIS ROLE IS IMPORTANT TO US

As a Principal Site Reliability Engineer (SRE), you will act as a technical authority across one or more Product Areas, guiding the direction of our Azure-based SaaS platform and Site Reliability strategy. You will lead complex initiatives related to platform reliability, observability, onboarding automation, and incident response across product boundaries, while advancing our overall cloud capability.

This role goes beyond hands-on engineering. You will play a key architectural and leadership role in designing scalable systems, mentoring senior engineers, and shaping our Azure governance models. With deep expertise in SRE and ITIL practices, you will elevate platform efficiency and client experience across both onboarding and ongoing operations.

Your efforts will enhance departmental collaboration, supporting the company in becoming a fully cloud-native SaaS provider with optimized, secure, and automated services.

WHAT YOU WILL BE RESPONSIBLE FOR

  • Act as the technical lead on SRE initiatives across multiple Product Areas
  • Implement our strategic use of Microsoft Azure in onboarding and site reliability disciplines
  • Architect scalable, secure, and automated solutions for client onboarding and live operations
  • Lead the design and evolution of cross-cutting platform capabilities (e.g., observability, CI/CD pipelines, IaC standards, DR frameworks)
  • Shape and govern Azure implementation patterns to ensure platform standardization, reliability, and cost-efficiency
  • Solve the most complex and business-critical reliability challenges involving distributed cloud systems
  • Advise engineering leads and product owners on cloud platform decisions, including trade-offs and risk mitigation
  • Collaborate with Information Security, Platform Engineering, and Architecture teams on compliance and cloud controls
  • Guide the definition of SLOs, SLIs, and other reliability metrics across departments
  • Lead root cause analysis, major incident postmortems, and reliability retrospectives across teams
  • Provide thought leadership, mentoring, and coaching to senior and lead engineers
  • Build communities of practice to support SRE principles and knowledge sharing within the organization
  • Represent the SRE function in executive-level planning, roadmap definition, and technical due diligence
  • Contribute to the company’s overall transformation into a SaaS-first, cloud-native company

WHAT WE VALUE

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
  • 10+ years of experience in Site Reliability Engineering, Cloud Infrastructure, or Platform Architecture roles
  • Proficient expertise in Microsoft Azure, including architecture, deployment, automation, and cost optimization
  • Strong grasp of cloud-native and hybrid architectures, distributed systems, networking, and security
  • Mastery in Infrastructure as Code (IaC) using Terraform, ARM, Bicep, and related tooling
  • Deep knowledge of observability stacks (Azure Monitor, Log Analytics, Grafana, Application Insights)
  • Experience leading complex incident and problem management efforts at scale
  • Broad technical skillset including Kubernetes, Docker, CI/CD pipelines, SQL, APIs, and scripting
  • Solid foundation in ITIL processes with a strategic mindset for operational excellence
  • Demonstrated ability to engage senior stakeholders, lead through ambiguity, and align engineering with business needs
  • Experience working in or guiding teams within regulated, security-conscious environments (e.g., financial services)
  • Demonstrated dedication to mentorship, knowledge sharing, and building engineering culture
  • Ability to think strategically while delivering pragmatic, hands-on solutions

BENEFITS

Attractive salary, executive-level bonus scheme, and pension are essential for any work agreement. At the company, we go beyond by offering flexible working hours, hybrid work models, and an individualized approach to professional growth—tailored to your leadership journey.

You will also have access to strategic initiatives and innovation programs across the organization.

NEXT STEP

Please send us your application in English via our career site as soon as possible, we process incoming applications continually. Please note that only applications sent through our system will be processed. At the company, we recognize that bias can unintentionally occur in the recruitment process. To uphold fairness and equal opportunities for all applicants, we kindly ask you to exclude personal data such as photo, age, or any non-professional information from your application. Thank you for aiding us in our endeavor to mitigate biases in our recruitment process.

For any questions you are welcome to contact Paweł Andrzejewski, Talent Acquisition Partner, at pawel.andrzejewski@simcorp.com. If you are interested in being a part of the company but are not sure this role is suitable, submit your CV anyway. The company is on an exciting growth journey, and our Talent Acquisition Team is ready to assist you discover the right role for you. The approximate time to consider your CV is three weeks.

We are eager to continually improve our talent acquisition process and make everyone’s experience positive and valuable. Therefore, during the process we will ask you to provide your feedback, which is highly appreciated.

About Us

The company is a provider of industry-leading integrated investment management solutions for the global buy side. Founded in 1971, with more than 3,000 employees across five continents, the company is a truly global technology leader that empowers more than half of the world’s top 100 financial companies through its integrated platform, services, and partner ecosystem. The company is a subsidiary of Deutsche Börse Group. As of 2024, the company includes Axioma, the leading provider of risk and management and portfolio optimization solutions for the global buy side.

----------------------------------------------------------------

Discover more about our culture, recruitment process, and commitment to promoting meaningful work

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Warsaw, Masovian Voivodeship, Poland

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

SimCorp is a provider of industry-leading integrated investment management solutions for the global buy side. Founded in 1971, with more than 3,000 employees across five continents, SimCorp is a truly global technology leader that empowers more than half of the world’s top 100 financial companies through its integrated platform, services, and partner ecosystem. SimCorp is a subsidiary of Deutsche Börse Group. As of 2024, SimCorp includes Axioma, the leading provider of risk and management and portfolio optimization solutions for the global buy side.

Manila, Metro Manila, Philippines (Hybrid)

Warsaw, Masovian Voivodeship, Poland (On-Site)

Toronto, Ontario, Canada (Hybrid)

Warsaw, Masovian Voivodeship, Poland (Hybrid)

Manila, Metro Manila, Philippines (Hybrid)

Copenhagen, Denmark (Hybrid)

Copenhagen, Denmark (Hybrid)

Copenhagen, Denmark (Hybrid)

View All Jobs

Get notified when new jobs are added by Simcorp

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug