Sr. Software Engineer, Observability

Dialpad AI

Job Summary

As a Sr. Software Engineer in Observability at Dialpad, you will develop and maintain the metrics and log collection platform, ensuring the health and availability of services. This involves creating instrumentation, tools, libraries, and documentation for other engineers, defining best practices, and staying updated with industry trends. You will collaborate with various teams and participate in on-call rotations within the Infrastructure Engineering division.

Must Have

  • Background in both Systems and/or Software Engineering
  • Experience in designing, automating, maintaining, and optimizing observability platforms (logging, metrics, and tracing)
  • Experience with configuration management tools such as Ansible, Terraform
  • Experience with Public Cloud environments such as GCP, AWS
  • Familiarity with languages such as Python, Go, Rust

Good to Have

  • Previous direct experience with Grafana, Loki, Prometheus
  • Experience with Linux
  • Experience with Kubernetes (including GKE/EKS) and building containerized applications
  • Undergraduate degree in Computer Science or Engineering

Perks & Benefits

  • Competitive benefits and perks
  • Robust training program
  • Inclusive office environment
  • Vibrant environment to cultivate collaboration and connection
  • Exceptional culture
  • Certified Great Place to Work

Job Description

About Dialpad

Dialpad is the leading AI-powered customer communications platform, transforming how businesses communicate with their customers. More than 50,000 companies around the globe — including Netflix, RE/MAX, Uber, Randstad, and Tractor Supply — rely on Dialpad to build stronger customer connections using real-time, AI-driven insights. Visit dialpad.com to learn more.

Being a Dialer

At Dialpad, you’ll be part of a collaborative team working toward our shared mission of making our customers and their employees wildly successful. We believe that every conversation matters, and we're elevating each one with a platform that drives real-time insights and automation for our customers. We thrive on continuous evolution, where every employee leverages industry-leading AI to constantly refine our platform and our own skills. We seek individuals who not only meet our high standards but go beyond them. Our ambition is significant, and achieving it requires a team that operates at the highest level. We look for individuals who are not just ambitious but who also possess the traits that are fundamental to our success: Scrappy, Curious, Optimistic, Persistent, and Empathetic.

Your role

As a Sr. Software Engineer in Observability, you’ll be responsible for our metrics and log collection platform. You’ll work closely with other Infrastructure engineers to determine resource usage and requirements. You’ll also help create tooling, libraries, and documentation that enable other engineers to instrument their own projects. In addition, you’ll keep our team aware of trends in the larger observability/monitoring industry.

This position reports to our Engineering Manager, Observability Platform, and has the opportunity to be based in our Bangalore, India office.

What you’ll do

  • Develop and improve instrumentation for monitoring and logging the health and availability of services.
  • Develop and maintain the observability stack within Dialpad engineering.
  • Define best practices and standards around making systems and services measurable and work with various teams to get those best practices applied.
  • Create tools and libraries for other engineering teams to enable them to build self-monitoring capabilities.
  • Create and own internal documentation used by the other engineering teams.
  • Stay up-to-date with the latest trends in observability, logging, monitoring, and cloud technologies. Introduce innovative solutions and best practices to improve system observability and reliability. Experiment with new tools and practices to enhance the observability landscape.
  • Collaborate with different engineering teams to integrate observability practices into their workflows.
  • Participate in a rotating on-call within the larger Infrastructure Engineering division.

Skills you’ll bring

  • Background in both Systems and/or Software Engineering.
  • Experience in designing, automating, maintaining, and optimizing observability platforms (logging, metrics, and tracing).
  • Experience with configuration management tools such as Ansible, Terraform, etc.
  • Experience with Public Cloud environments such as GCP, AWS, etc.
  • Familiarity with languages such as Python, Go, Rust, etc.

Bonus skills you may have

  • Previous direct experience with Grafana, Loki, Prometheus.
  • Experience with Linux.
  • Experience with Kubernetes (including GKE/EKS) and building containerized applications.
  • Undergraduate degree in Computer Science or Engineering.

We believe in investing in our people. Dialpad offers competitive benefits and perks, alongside a robust training program that helps you reach your full potential. We have designed our offices to be inclusive, offering a vibrant environment to cultivate collaboration and connection. Our exceptional culture, recognized repeatedly as a certified Great Place to Work, ensures every employee feels valued and empowered to contribute to our collective success.

Don’t meet every single requirement? If you’re excited about this role and you possess the fundamental traits, the drive, and strong ambition we seek, but your experience doesn’t satisfy every qualification, we encourage you to apply.

Dialpad is an equal-opportunity employer. We are dedicated to creating a community of inclusion and an environment free from discrimination or harassment.

10 Skills Required For This Role

Game Texts Linux Aws Rust Prometheus Ansible Terraform Grafana Kubernetes Python