Senior Monitoring Engineer (Zabbix/Prometheus)

7 Minutes ago • 3 Years + • Devops

Job Summary

Job Description

Social Discovery Group (SDG) is seeking a Senior Monitoring Engineer (Zabbix/Prometheus) to develop and optimize Zabbix monitoring instances, ensuring high availability and SLA compliance. The role involves configuring end-to-end monitoring, creating templates and Grafana dashboards, and ensuring PostgreSQL backups. The engineer will improve monitoring quality through retrospective analysis and generate reports from monitoring systems. This is a full-time remote opportunity.
Must have:
  • Develop and optimize Zabbix monitoring instance components, including High Availability.
  • Ensure SLA compliance through effective monitoring and timely incident response.
  • Configure monitoring end-to-end (metrics, triggers, alerts, escalations).
  • Create and modify monitoring templates and Grafana dashboards.
  • Ensure PostgreSQL backups and high availability for monitoring data.
  • Improve monitoring quality via retrospective analysis of trigger precision/recall and false-positive reduction.
  • Generate periodic reports based on data from monitoring systems.
  • 3+ years as a Senior Zabbix Administrator and Prometheus Administrator.
  • 1+ year working with PostgreSQL.
  • Strong hands-on experience with Zabbix, Prometheus, Grafana.
  • Practical experience with Ansible, Git/GitLab/CI/CD, RPM-based Linux (CentOS/AlmaLinux/RHEL).
  • Scripting skills in bash and/or Python.
Good to have:
  • PowerShell scripting skills.
Perks:
  • REMOTE OPPORTUNITY to work full time
  • Vacation 28 calendar days per year
  • 7 wellness days per year (time off)
  • Bonuses up to $5000 for recommending successful applicants
  • Full payment for professional training, international conferences and meetings
  • Corporate discount for English lessons
  • Health benefits (up to $1,000 gross per year for medical insurance or doctor’s fees)
  • Workplace organization (equipped workplace in offices/co-working or reimbursement up to $1000 gross every 3 years)
  • Internal gamified gratitude system

Job Details

Social Discovery Group (SDG) is the 3rd largest social discovery company in the world, uniting 60+ brands with 500 million users. We solve the problems of loneliness, isolation, and disconnection by transforming virtual intimacy into the new normal. Our portfolio includes online communication platforms focusing on AI, game mechanics, and video streaming - Dating.com, DateMyAge, Cupid Media, Dil Mil, Kiseki, and others.

SDG invests in IT startups around the world. Our investments include Open AI, Patreon, Flo, Clubhouse, Woebot, Flure, Astry, Coursera, Academia.edu, and many others.

We bring together a team of like-minded people and IT professionals specializing in the creation and development of globally impactful social discovery products. Our international team of 1200 professionals and digital nomads works all over the world.

Our teams of digital nomads work remotely from Cyprus, Malta, the USA, Armenia, Georgia, Kazakhstan, Montenegro, Poland, Latvia, Serbia, Spain, Portugal, UAE, Israel, Turkey, Thailand, Indonesia, Japan, Hong Kong, Australia and many other locations.

In August 2024, we achieved Great Place to Work US Certification™! This achievement reflects our core belief that a truly exceptional workplace is built on trust, pride, and camaraderie—not just great perks.

Your main tasks will be:

  • Develop and optimize all components of the Zabbix monitoring instance, including provisioning High Availability at different levels.
  • Ensure SLA compliance through effective monitoring and timely incident response.
  • Configure monitoring end-to-end (metrics, triggers, alerts, escalations).
  • Create and modify monitoring templates and Grafana dashboards.
  • Ensure PostgreSQL backups and high availability for monitoring data.
  • Improve monitoring quality via retrospective analysis of trigger precision/recall and false-positive reduction.
  • Generate periodic reports based on data from monitoring systems.

We expect from you:

  • 3+ years as a Senior Zabbix Administrator and Prometheus Administrator (mandatory).
  • 1+ year working with PostgreSQL (mandatory).
  • Strong hands-on with Zabbix, Prometheus, Grafana (required).
  • Practical experience with Ansible, Git/GitLab/CI/CD, RPM-based Linux (CentOS/AlmaLinux/RHEL).
  • Scripting skills in bash and/or Python; PowerShell is a plus.
  • Ability to own monitoring configuration and continuously raise quality and reliability.

What do we offer:

  • REMOTE OPPORTUNITY to work full time;
  • Vacation 28 calendar days per year;
  • 7 wellness days per year (time off) that can be used to deal with household issues, to lie down and recover without taking sick leave;
  • Bonuses up to $5000 for recommending successful applicants for positions in the company;
  • Full payment for professional training, international conferences and meetings;
  • Corporate discount for English lessons;
  • Health benefits. According to the paychecks, if you are not eligible for corporate medical insurance, the company will compensate you with up to $ 1,000 gross per year per employee. This can be spent on self-purchase of health insurance or on doctor’s fees for yourself and close relatives (spouse, children);
  • Workplace organization. The company provides all employees with an equipped workplace and all the necessary equipment (table, armchair, wifi, etc.) in our offices or co-working locations. In the other locations, the company provides reimbursement of workplace costs up to $ 1000 gross once every 3 years, according to the paychecks. This money can be spent on the rent of the co-working room, on equipping the working place at home (desk, chair, Internet, etc.) during those 3 years;
  • Internal gamified gratitude system: receive bonuses from colleagues and exchange them for our merchandise, team building activities, massage certificates, etc.

Sounds good? Join us now!

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in undefined

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Social Discovery Group(SDG) is the 3rd largest social discovery company in the world, uniting 60+ brands with 500 million users. We solve the problems of loneliness, isolation, and disconnection by transforming virtual intimacy into the new normal. Our portfolio includes online communication platforms focusing on AI, game mechanics, and video streaming - Dating.com, DateMyAge, Cupid Media, Dil Mil, Kiseki, and others.
View All Jobs

Get notified when new jobs are added by Social Discovery Ventures

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug
Contact Us
hello@outscal.com
Made in INDIA 💛💙