Data Reliability Engineer II

zeta

Job Summary

Zeta is a Next-Gen Banking Tech company empowering banks and fintechs with its cloud-native processing platform, Zeta Tachyon. With over 1700 employees globally, Zeta is transforming customer experience for multi-million card portfolios. The Data Reliability Engineer II will proactively monitor and tune PostgreSQL RDS instances, troubleshoot data pipeline issues with Debezium, Kafka Connect, Nifi, and Airflow, and ensure data integrity and security. This role involves developing automation scripts, participating in on-call rotations, and collaborating on query optimization and schema design.

Must Have

  • Proactively monitor PostgreSQL RDS instances for performance, availability, and resource utilization.
  • Assist in identifying and applying basic performance tuning techniques for PostgreSQL RDS.
  • Monitor and troubleshoot Debezium and Kafka Connect connectors for data capture and delivery issues.
  • Monitor Apache Nifi data flows for errors, backpressure, and performance issues, assisting in resolution.
  • Provide support for data related issues and participate in root cause analysis.
  • Monitor the execution of Apache Airflow DAGs, identify failed tasks, and perform troubleshooting and re-runs.
  • Develop and maintain automation scripts and infrastructure as code (IAC) templates.
  • Participate in on-call rotations to respond to database-related incidents and perform troubleshooting.
  • Assist in implementing and maintaining security best practices for cloud databases.
  • Regularly audit and assess database security configurations.
  • Configure and manage database backup and recovery strategies.
  • Analyse database query performance and collaborate with developers to optimize SQL queries and schemas.
  • Participate in continuous improvement initiatives to enhance reliability, scalability, and performance.
  • Assist in the design and optimization of database schemas for cloud environments.

Good to Have

  • Python scripting
  • Bash scripting
  • AWS Certified Database - Specialty certification

Job Description

About Zeta

Zeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015.

Our flagship processing platform - Zeta Tachyon - is the industry’s first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing, lending, core banking, fraud & risk, and many more capabilities as a single-vendor stack. 20M+ cards have been issued on our platform globally.

Zeta is actively working with the largest Banks and Fintechs in multiple global markets transforming customer experience for multi-million card portfolios.

Zeta has over 1700+ employees - with over 70% roles in R&D - across locations in the US, EMEA, and Asia. We raised $340 million at a $2 billion valuation from Softbank, Mastercard, and other investors in 2021.

Learn more @ www.zeta.tech, careers.zeta.tech, Linkedin, Twitter

Responsibilities

  • Proactively monitor PostgreSQL RDS instances for performance, availability, and resource utilization (CPU, memory, storage, connections) using established monitoring tools (e.g., CloudWatch, Prometheus).
  • Assist in identifying performance bottlenecks in PostgreSQL RDS. Apply basic performance tuning techniques like reviewing query execution plans, adding missing indexes, and recommending parameter adjustments.
  • Monitor the health and performance of Debezium and Kafka Connect connectors, identifying and troubleshooting basic issues related to data capture and delivery.
  • Monitor Apache Nifi data flows for errors, backpressure, and performance issues. Assist in troubleshooting and resolving common Nifi flow failures.
  • Provide support for data related issues and participate in root cause analysis.
  • Monitor the execution of Apache Airflow DAGs, identify failed tasks, and troubleshooting and re-runs.
  • Develop and maintain automation scripts and infrastructure as code (IAC) templates (e.g., using Crossplane, Terraform) to automate routine database tasks, deployments, and updates.
  • Participate in on-call rotations to respond to database-related incidents and perform troubleshooting and root cause analysis.
  • Assist in implementing and maintaining security best practices for cloud databases, including access controls, encryption, and compliance with regulatory requirements.
  • Regularly audit and assess database security configurations.
  • Configure and manage database backup and recovery strategies to ensure data integrity and availability in case of failures or data loss.
  • Analyse database query performance and collaborate with developers to optimize SQL queries and schemas.
  • Participate in continuous improvement initiatives to enhance the reliability, scalability, and performance of cloud databases.
  • Assist in the design and optimization of database schemas for cloud environments.

Skills

  • Familiarity with data pipeline concepts and technologies like Debezium, Kafka Connect, Apache Nifi.
  • Basic understanding of Amazon Redshift and S3.
  • Exposure to Apache Spark for data processing.
  • Basic understanding of Apache Airflow for workflow orchestration.
  • Strong SQL scripting skills for querying and basic data manipulation.
  • Familiarity with scripting languages (e.g., Python, Bash) is a plus.
  • Knowledge of database security best practices, including access controls, encryption, and compliance with regulatory requirements (e.g., GDPR, HIPAA).
  • Having ‘AWS Certified Database - Specialty' certification is a plus

Experience and Qualifications

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 3-5 years of experience in database administration, with a focus on PostgreSQL.
  • 1-2 years of hands-on experience with PostgreSQL RDS.

Equal Opportunity

Zeta is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We encourage applicants from all backgrounds, cultures, and communities to apply and believe that a diverse workforce is key to our success

10 Skills Required For This Role

Problem Solving Game Texts Postgresql Aws Prometheus Terraform Spark Python Sql Bash

Similar Jobs