Infrastructure Engineer – Clips/ML Data

Medal Esports

Job Summary

At Medal, we are redefining how people capture and share gameplay experiences. This role involves taking full ownership of our Clips/ML data infrastructure, building scalable, real-time pipelines that power user-facing discovery and machine learning research. You will lead the architecture, operations, and performance of data systems, influencing content indexing and model training, with a focus on petabyte-scale video pipelines and low-latency systems for real-time gaming insights.

Must Have

  • Architect and operate petabyte-scale ingestion pipelines
  • Design automated QA guard-rails (schema validation, anomaly detection, deduplication)
  • Build high-performance ETL and feature-extraction jobs to process and index hundreds of millions of clips
  • Own the end-to-end data ingestion stack (desktop, web & mobile clients)
  • Establish real-time monitoring, lineage, and “five-nines” SLAs
  • Partner with research and product to curate high-signal data slices and accelerate model experimentation
  • Champion security, privacy, and governance
  • 5+ years of experience in data engineering, backend systems
  • Deep knowledge of ETL/ELT pipelines, distributed systems, and streaming data architectures
  • Strong proficiency with Python, Java, Go, or similar languages
  • Experience with cloud infrastructure (AWS, GCP), provisioning tools (terraform, pulumi) and modern data stack tools (dbt, Airflow, Parquet, Arrow)
  • Track record of designing systems with extreme scale and performance requirements
  • Experience consolidating data from diverse sources into unified data models
  • Deep understanding of data QA methodologies, anomaly detection, and automated testing in production systems
  • Passion for mentorship and team development
  • A bias toward ownership, urgency

Good to Have

  • Experience with video data or ML infrastructure

Perks & Benefits

  • Work on cutting-edge tech and help shape the future of gaming
  • Passionate team that values ownership and innovation
  • Competitive salary
  • Equity options
  • Comprehensive health insurance
  • 401k

Job Description

The Role

At Medal, we’re redefining how people capture and share gameplay experiences. Every day, our platform ingests gameplay video that is raw, unfiltered, and packed with insights. We're looking for a seasoned Data Engineer to take full ownership of our Clips/ML data infrastructure, building the next generation of scalable, real-time pipelines that power everything from user-facing discovery to machine learning research. You'll lead the architecture, operations, and performance of data systems that sit at the heart of our product, influencing everything from content indexing to model training.

If you're passionate about building petabyte-scale video pipelines, love working on low-latency systems, and are excited to help define the future of real-time gaming insights, we want to hear from you.

You Will

  • Architect and operate petabyte-scale ingestion pipelines
  • Design automated QA guard-rails (schema validation, anomaly detection, deduplication)
  • Build high-performance ETL and feature-extraction jobs to process and index hundreds of millions of clips into columnar/video-native formats
  • Own the end-to-end data ingestion stack (desktop, web & mobile clients)
  • Establish real-time monitoring, lineage, and “five-nines” SLAs, driving continuous improvement across storage, compute, and network layers
  • Partner with research and product to curate high-signal data slices, data-health metrics, and accelerate model experimentation
  • Champion security, privacy, and governance: implement robust RBAC, audit trails, and compliant retention policies for sensitive gameplay footage and user inputs
  • Mentor and uplevel engineers (including internal Medal platform talent), fostering a culture of craftsmanship, documentation, and ruthless focus on data excellence

You Need

  • 5+ years of experience in data engineering, backend systems, or related roles. Experience with video data or ML infrastructure is a plus.
  • Deep knowledge of ETL/ELT pipelines, distributed systems, and streaming data architectures (e.g., Kafka, Spark, Flink, etc.)
  • Strong proficiency with Python, Java, Go, or similar languages used in data-intensive environments
  • Experience with cloud infrastructure (e.g., AWS, GCP), provisioning tools (e.g. terraform, pulumi) and modern data stack tools (e.g., dbt, Airflow, Parquet, Arrow)
  • Track record of designing systems with extreme scale and performance requirements
  • Experience consolidating data from diverse sources into unified data models
  • Deep understanding of data QA methodologies, anomaly detection, and automated testing in production systems
  • Passion for mentorship and team development; able to upskill engineers and advocate for engineering excellence
  • A bias toward ownership, urgency, and a desire to build systems that just work, even at scale

Why Join Us

  • Work on cutting-edge tech and help shape the future of gaming
  • Passionate team that values ownership and innovation
  • Competitive salary, equity options, comprehensive health insurance, 401k

Compensation Range: $170K - $275K

Apply for this Job

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

9 Skills Required For This Role

Game Texts Quality Control Automated Testing Aws Terraform Spark Python Java Machine Learning

Similar Jobs