Site Reliability Engineer III (Tue - Sat)

34 Minutes ago • All levels
Devops

Job Description

CME Group is seeking a SRE III (Tuesday - Saturday) to help, build, operate and scale systems in our Markets portfolio. Markets SREs work on products and applications related to CME’s Globex trading platform. Our systems deliver an exceptional combination of low-latency performance and rock-solid reliability to seamlessly handle the world’s busiest trading days. The successful candidate will work alongside senior engineers to learn how we observe, monitor, automate, and improve Production service reliability and act as a mentor to junior colleagues. He/she will have a keen interest in SRE and enjoy the cut-and-thrust of operating Production systems. They will be a strong communicator, and may have previously worked in an SRE role, a software engineering role or a systems engineering role.
Good To Have:
  • Experience with metrics & monitoring, OpenTelemetry, Splunk, Prometheus, Grafana, etc.
  • Experience and knowledge of working with distributed systems
  • Experience with Kubernetes
  • Knowledge of networking (HTTP/TCP/UDP/IP).
  • Experience in Financial markets.
  • Experience working in an agile environment.
Must Have:
  • Participate in building observability, monitoring and alerting for key services
  • Collaborate with senior engineers and product teams to ensure requirements are mutually understood, planned carefully and implemented safely
  • Lead discussions for own work and present solution options and proposals
  • Participate in incident response and management
  • Participate in on-call rotation
  • Identify toil and reduce through automation
  • Contribute to DR and systems resiliency testing & improvements
  • Contribute own ideas and reliability improvement suggestions to the Product backlog
  • Support the migration of markets applications to Google Cloud Platform (GCP)
  • Act as a mentor to L2 and L1 SRE colleagues
  • Experience with Linux-based systems
  • Understanding of application architectures and messaging protocols
  • Competent programming/scripting skills (Python, Bash, etc.)
  • Strong problem-solving and analytical abilities.
  • Excellent communication and teamwork skills.
  • Eagerness to learn and adapt in a fast-paced trading environment.
Perks:
  • Bonus Programme
  • Equity Programme
  • Employee Stock Purchase Plan (ESPP)
  • Private Medical and Dental coverage
  • Mental Health Benefit Programme
  • Group Pension Plan
  • Income Protection
  • Life Assurance
  • Cycle To Work
  • EV Car Benefit Scheme
  • Gym Membership
  • Family Leave
  • Education Assistance – MBA/Advanced Degree/Bachelor Degree
  • Ongoing Employee Development Training/Certification
  • Hybrid Working

Add these skills to join the top 1% applicants for this job

team-management
communication
game-texts
agile-development
networking
linux
incident-response
prometheus
grafana
google-cloud-platform
kubernetes
python
splunk
bash

Markets SREs work on products and applications related to CME’s Globex trading platform. Our systems deliver an exceptional combination of low-latency performance and rock-solid reliability to seamlessly handle the world’s busiest trading days.

The successful candidate will work alongside senior engineers to learn how we observe, monitor, automate, and improve Production service reliability and act as a mentor to junior colleagues. He/she will have a keen interest in SRE and enjoy the cut-and-thrust of operating Production systems. They will be a strong communicator, and may have previously worked in an SRE role, a software engineering role or a systems engineering role.

Key Responsibilities:

  • Participate in building observability, monitoring and alerting for key services - continuously improving our SLI & SLOs and observability data enabling faster issue detection and incident resolution
  • Collaborate with senior engineers and product teams to ensure requirements are mutually understood, planned carefully and implemented safely
  • Lead discussions for own work and present solution options and proposals
  • Participate in incident response and management - engages with urgency in live incidents, takes ownership for minor incidents, ensures system recovery and contributes to post-mortems afterwards
  • Participate in on-call rotation
  • Identify toil and reduce through automation
  • Contribute to DR and systems resiliency testing & improvements
  • Contribute own ideas and reliability improvement suggestions to the Product backlog
  • Support the migration of markets applications to Google Cloud Platform (GCP)
  • Act as a mentor to L2 and L1 SRE colleagues

What We’re Looking for:

  • Experience with Linux-based systems
  • Experience with Cloud-based platform(s) - Google Cloud Platform, GCE, and/or GKE a bonus
  • Understanding of application architectures and messaging protocols
  • Competent programming/scripting skills (Python, Bash, etc.)
  • Strong problem-solving and analytical abilities.
  • Excellent communication and teamwork skills.
  • Eagerness to learn and adapt in a fast-paced trading environment.

Desirable

  • Experience with metrics & monitoring, OpenTelemetry, Splunk, Prometheus, Grafana, etc.
  • Experience and knowledge of working with distributed systems
  • Experience with Kubernetes
  • Knowledge of networking (HTTP/TCP/UDP/IP).
  • Experience in Financial markets.
  • Experience working in an agile environment.

Why Us:

  • Be part of a global leader in financial services technology.
  • Work on cutting-edge technology in a collaborative and innovative culture.
  • Competitive compensation and benefits package.
  • Opportunity to grow and advance your career in SRE with an organisation who is transforming to this approach

Join us and play a crucial role in ensuring the stability and performance of our Markets applications while contributing to the migration to Google Cloud Platform. Apply now to be a part of our dynamic SRE team!

Company Benefits:

  • Bonus Programme
  • Equity Programme
  • Employee Stock Purchase Plan (ESPP)
  • Private Medical and Dental coverage
  • Mental Health Benefit Programme
  • Group Pension Plan
  • Income Protection
  • Life Assurance
  • Cycle To Work
  • EV Car Benefit Scheme
  • Gym Membership
  • Family Leave
  • Education Assistance – MBA/Advanced Degree/Bachelor Degree
  • Ongoing Employee Development Training/Certification
  • Hybrid Working

#LI-RK2

#LI-Hybrid

#nijobs.com

Set alerts for more jobs like Site Reliability Engineer III (Tue - Sat)
Set alerts for new jobs by CME Group
Set alerts for new Devops jobs in United Kingdom
Set alerts for new jobs in United Kingdom
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙