Principal Site Reliability Engineer (Azure)

28 Minutes ago • 10 Years +
Devops

Job Description

As a Principal Site Reliability Engineer (SRE) at SimCorp, you will be a technical authority guiding the Azure-based SaaS platform and Site Reliability strategy. This role involves leading complex initiatives in platform reliability, observability, and incident response, while advancing cloud capabilities. You will architect scalable systems, mentor engineers, and shape Azure governance models, enhancing platform efficiency and client experience. Your efforts will support SimCorp's transformation into a cloud-native SaaS provider with optimized, secure, and automated services.
Must Have:
  • Act as the technical lead on SRE initiatives across multiple Product Areas
  • Implement strategic use of Microsoft Azure in onboarding and site reliability disciplines
  • Architect scalable, secure, and automated solutions for client onboarding and live operations
  • Lead the design and evolution of cross-cutting platform capabilities (e.g., observability, CI/CD pipelines, IaC standards, DR frameworks)
  • Shape and govern Azure implementation patterns to ensure platform standardization, reliability, and cost-efficiency
  • Solve the most complex and business-critical reliability challenges involving distributed cloud systems
  • Advise engineering leads and product owners on cloud platform decisions, including trade-offs and risk mitigation
  • Collaborate with Information Security, Platform Engineering, and Architecture teams on compliance and cloud controls
  • Guide the definition of SLOs, SLIs, and other reliability metrics across departments
  • Lead root cause analysis, major incident postmortems, and reliability retrospectives across teams
  • Provide thought leadership, mentoring, and coaching to senior and lead engineers
  • Build communities of practice to support SRE principles and knowledge sharing within the organization
  • Represent the SRE function in executive-level planning, roadmap definition, and technical due diligence
  • Contribute to SimCorp’s overall transformation into a SaaS-first, cloud-native company
  • 10+ years of experience in Site Reliability Engineering, Cloud Infrastructure, or Platform Architecture roles
  • Proficient expertise in Microsoft Azure, including architecture, deployment, automation, and cost optimization
  • Strong grasp of cloud-native and hybrid architectures, distributed systems, networking, and security
  • Mastery in Infrastructure as Code (IaC) using Terraform, ARM, Bicep, and related tooling
  • Deep knowledge of observability stacks (Azure Monitor, Log Analytics, Grafana, Application Insights)
  • Experience leading complex incident and problem management efforts at scale
  • Broad technical skillset including Kubernetes, Docker, CI/CD pipelines, SQL, APIs, and scripting
  • Solid foundation in ITIL processes with a strategic mindset for operational excellence
Perks:
  • Attractive salary
  • Bonus scheme
  • Pension
  • Good work and life balance
  • Flexible working hours
  • Hybrid workplace model (2x a week in office)
  • IP sprints (3 weeks per quarter for skill development and company development)
  • Personalized approach to professional development

Add these skills to join the top 1% applicants for this job

saas-business-models
risk-management
risk-mitigation
talent-acquisition
game-texts
live-operations
networking
incident-response
azure
grafana
terraform
microsoft-azure
ci-cd
docker
kubernetes
sql

WHAT MAKES US, US

Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say Hello to SimCorp!

At its foundation, SimCorp is guided by our values — caring, customer success-driven, collaborative, curious, and courageous. Our people-centered organization focuses on skills development, relationship building, and client success. We take pride in cultivating an environment where all team members can grow, feel heard, valued, and empowered.

If you like what we’re saying, keep reading!

Why is this role important?

As a Principal Site Reliability Engineer (SRE), you will act as a technical authority across one or more Product Areas, guiding the direction of our Azure-based SaaS platform and Site Reliability strategy. You will lead complex initiatives related to platform reliability, observability, onboarding automation, and incident response across product boundaries, while advancing our overall cloud capability.

This role goes beyond hands-on engineering. You will play a key architectural and leadership role in designing scalable systems, mentoring senior engineers, and shaping our Azure governance models. With deep expertise in SRE and ITIL practices, you will elevate platform efficiency and client experience across both onboarding and ongoing operations.

Your efforts will enhance departmental collaboration, supporting SimCorp in becoming a fully cloud-native SaaS provider with optimized, secure, and automated services.

WHAT YOU WILL BE RESPONSIBLE FOR

  • Act as the technical lead on SRE initiatives across multiple Product Areas
  • Implement our strategic use of Microsoft Azure in onboarding and site reliability disciplines
  • Architect scalable, secure, and automated solutions for client onboarding and live operations
  • Lead the design and evolution of cross-cutting platform capabilities (e.g., observability, CI/CD pipelines, IaC standards, DR frameworks)
  • Shape and govern Azure implementation patterns to ensure platform standardization, reliability, and cost-efficiency
  • Solve the most complex and business-critical reliability challenges involving distributed cloud systems
  • Advise engineering leads and product owners on cloud platform decisions, including trade-offs and risk mitigation
  • Collaborate with Information Security, Platform Engineering, and Architecture teams on compliance and cloud controls
  • Guide the definition of SLOs, SLIs, and other reliability metrics across departments
  • Lead root cause analysis, major incident postmortems, and reliability retrospectives across teams
  • Provide thought leadership, mentoring, and coaching to senior and lead engineers
  • Build communities of practice to support SRE principles and knowledge sharing within the organization
  • Represent the SRE function in executive-level planning, roadmap definition, and technical due diligence
  • Contribute to SimCorp’s overall transformation into a SaaS-first, cloud-native company

WHAT WE VALUE

  • Bachelor’s degree or equivalent
  • 10+ years of experience in Site Reliability Engineering, Cloud Infrastructure, or Platform Architecture roles
  • Proficient expertise in Microsoft Azure, including architecture, deployment, automation, and cost optimization
  • Strong grasp of cloud-native and hybrid architectures, distributed systems, networking, and security
  • Mastery in Infrastructure as Code (IaC) using Terraform, ARM, Bicep, and related tooling
  • Deep knowledge of observability stacks (Azure Monitor, Log Analytics, Grafana, Application Insights)
  • Experience leading complex incident and problem management efforts at scale
  • Broad technical skillset including Kubernetes, Docker, CI/CD pipelines, SQL, APIs, and scripting
  • Solid foundation in ITIL processes with a strategic mindset for operational excellence
  • Demonstrated ability to engage senior stakeholders, lead through ambiguity, and align engineering with business needs
  • Experience working in or guiding teams within regulated, security-conscious environments (e.g., financial services)
  • Demonstrated dedication to mentorship, knowledge sharing, and building engineering culture
  • Ability to think strategically while delivering pragmatic, hands-on solutions

Other requirements

  • Flexible between APAC and EMEA shift hours
  • Hybrid working arrangement (2x a week in office)

BENEFITS:

Attractive salary, bonus scheme, and pension are essential for any work agreement. However, in SimCorp, we believe we can offer more. Therefore, in addition to the traditional benefit scheme, we provide a good work and life balance: flexible working hours and a hybrid workplace model. Simcorp follows a global hybrid policy, asking employees to work from the office two days each week while allowing remote work on other days.

On top of that, we have IP sprints where you have 3 weeks per quarter you can spend on developing your skills as well as contributing to the company development. There is never just only one route - we practice a personalized approach to professional development to support the direction you want to take.

NEXT STEP:

Please send us your application in English via our career site as soon as possible, we process incoming applications continually. Please note that only applications sent through our system will be processed. At SimCorp, we recognize that bias can unintentionally occur in the recruitment process. To uphold fairness and equal opportunities for all applicants, we kindly ask you to exclude personal data such as photo, age, or any non-professional information from your application. Thank you for aiding us in our endeavor to mitigate biases in our recruitment process.

For any questions you are welcome to contact xxx, Senior Talent Acquisition Partner, at email address. If you are interested in being a part of SimCorp but are not sure this role is suitable, submit your CV anyway. SimCorp is on an exciting growth journey, and our Talent Acquisition Team is ready to assist you discover the right role for you. The approximate time to consider your CV is three weeks.

We are eager to continually improve our talent acquisition process and make everyone’s experience positive and valuable. Therefore, during the process we will ask you to provide your feedback, which is highly appreciated.

WHO WE ARE:

For over 50 years, we have worked closely with investment and asset managers to become the world’s leading provider of integrated investment management solutions. We are 3,000+ colleagues with a broad range of nationalities, educations, professional experiences, ages, and backgrounds in general.

SimCorp is an independent subsidiary of the Deutsche Börse Group. Following the recent merger with Axioma, we leverage the combined strength of our brands to provide an industry-leading, full, front-to-back offering for our clients.

SimCorp is an equal opportunity employer and welcome applicants from all backgrounds, without regard to race, gender, age, disability, or any other protected status under applicable law. We are committed to building a culture where diverse perspectives and expertise are integrated into our everyday work. We believe in the continual growth and development of our employees, so that we can provide best-in-class solutions to our clients.

SimCorp Manila proudly announces that its Manila Delivery Center has been officially certified as a Great Place To Work for the second consecutive year – Apr25-Apr26 This certification underscores SimCorp's effort to cultivating a workplace that is not only inclusive and collaborative but also committed to the personal and professional growth of its employees

We are also honored to have been voted as a WealthTech100 company for three consecutive years. The new WealthTech100 list aims to highlight tech innovation leaders in the investment management industry.

#Li-Hybrid

Set alerts for more jobs like Principal Site Reliability Engineer (Azure)
Set alerts for new jobs by Simcorp
Set alerts for new Devops jobs in Philippines
Set alerts for new jobs in Philippines
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙