Senior Platform Engineer - Observability

2 Minutes ago • All levels • Devops

Job Summary

Job Description

We are seeking a highly experienced Platform Engineer specializing in Observability, focusing on the open-source Grafana stack. This role involves managing the full lifecycle of our observability platform, ensuring robust monitoring, logging, tracing, and profiling for Kubernetes applications. Key responsibilities include architecting, implementing, and continuously improving the observability pipeline, collaborating with development teams, and ensuring security and compliance. The ideal candidate will have extensive experience with Kubernetes, the Grafana stack, cloud environments, and automation tools like Terraform and Helm.
Must have:
  • Manage observability platform lifecycle for Kubernetes applications.
  • Implement OpenTelemetry and manage related tooling.
  • Architect and manage Grafana-based observability stack.
  • Ensure comprehensive monitoring, logging, tracing for microservices.
  • Collaborate on dashboards, alerts, and automated incident responses.
  • Continuously improve observability platform scalability and reliability.
  • Support teams in adopting instrumentation and monitoring best practices.
  • Implement automation and IaC for observability infrastructure.
  • Integrate observability tooling with cloud and on-premise services.
  • Ensure security and compliance within the observability stack.
  • Extensive experience with Kubernetes and containerized observability.
  • Deep knowledge of Grafana stack (Mimir, Loki, Tempo, OpenTelemetry).
  • Experience building and managing cloud observability pipelines.
  • Proficiency with SaaS observability platforms like New Relic.
  • Strong automation skills using Terraform and Helm.
  • Proficient in Node, Python, Go, or Shell scripting.
  • Experience supporting production monitoring and root cause analysis.
Good to have:
  • AWS, Azure, or GCP certifications
Perks:
  • 18 weeks paid parental leave (primary/secondary carers)
  • Access to 'Employee Exclusives' program
  • Digital newspaper subscription to mastheads
  • Annual gift voucher for Stan subscription

Job Details

We are seeking a highly experienced Platform Engineer who specialises in Observability, primarily focused around the open-source Grafana observability stack. In this role, you will be instrumental in managing the lifecycle of our observability platform, ensuring robust monitoring, logging, tracing and profiling for our applications running on Kubernetes. You will contribute to the architecture, implementation, and continuous improvement of our observability pipeline, enabling teams to monitor and optimise system performance efficiently.:

  • Implementing OpenTelemetry within application codebases and managing Otel tooling and services.
  • Architect, implement, and manage an observability stack based on Grafana, Prometheus, Loki, Mimir, Tempo, and other related technologies within a Kubernetes environment.
  • Ensure comprehensive monitoring, logging, and tracing coverage for microservices and Kubernetes clusters.
  • Collaborate with development and platform teams to create meaningful dashboards, alerts, and automated incident responses.
  • Continuously improve the observability platform for scalability, multi-tenancy, and reliability.
  • Support and mentor teams in adopting best practices for instrumentation and monitoring.
  • Implement automation and infrastructure-as-code practices for managing observability infrastructure using Terraform, Helm, and CI/CD pipelines.
  • Integrate observability tooling with other cloud services and on-premise infrastructure as required.
  • Ensure security and compliance standards are met, focusing on auditability and data integrity within the observability stack.

Qualifications

What you'll bring;

You will have a strong passion for observability. You will have a strong “customer first” mentality and be comfortable in assisting developers of all levels. You will have excellent problem-solving and troubleshooting skills

  • Extensive experience working with Kubernetes, particularly in managing observability for containerised applications.
  • Deep knowledge of the open-source Grafana stack, including Mimir, Loki, Tempo, and OpenTelemetry Collectors.
  • Experience building and managing observability pipelines in a cloud environment (AWS, GCP, or Azure).
  • Experience utilising SaaS-based observability platforms such as New Relic
  • Strong automation skills and experience with IaC tools such as Terraform and Helm.
  • Proficient in scripting and programming languages such as Node, Python, Go, or Shell.
  • A customer-first mentality, with strong problem-solving and troubleshooting skills.
  • Experience supporting development teams with production monitoring and root cause analysis.
  • AWS, Azure, or GCP certifications are highly regarded.

Additional Information

How we work

At Nine, our flexible work options vary by role and team. Depending on the position, this may include flexible hours, hybrid work, or part-time arrangements. We welcome discussing your flexibility needs during the hiring process - just ask the Talent Acquisition team.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in North Sydney, New South Wales, Australia

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

North Sydney, New South Wales, Australia (On-Site)

North Sydney, New South Wales, Australia (On-Site)

South Melbourne, Victoria, Australia (On-Site)

South Melbourne, Victoria, Australia (On-Site)

North Sydney, New South Wales, Australia (Hybrid)

North Sydney, New South Wales, Australia (On-Site)

Docklands, Victoria, Australia (On-Site)

Sydney, New South Wales, Australia (On-Site)

South Melbourne, Victoria, Australia (On-Site)

North Sydney, New South Wales, Australia (On-Site)

View All Jobs

Get notified when new jobs are added by Nine

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug