Staff/Senior Software Engineer, Machine Learning Platform (Ad Cloud)

3 Months ago • 4 Years + • Devops

Job Summary

Job Description

The Staff/Senior Machine Learning Platform Engineer will be responsible for shaping the architecture and core components of the ML platform, including batch (Spark), streaming (Flink), job orchestration (Argo on Kubernetes), and infrastructure tools. The role involves designing and scaling pipelines, building and maintaining API servers and developer tools, ensuring high availability and observability, and collaborating with various teams to deliver reliable and efficient ML platform capabilities. The engineer will also actively adopt and promote the use of LLM-based tools to accelerate development and mentor junior engineers.
Must have:
  • 4+ years of experience in data systems/ML infrastructure.
  • Strong coding proficiency in Python/Java.
  • Experience with Spark, Flink, Kubernetes (GKE).
  • Experience with Terraform and Helm.

Job Details

About Appier

Appier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier’s mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange (Ticker number: 4180). Visit www.appier.com for more information.

 

The Impact You’ll Make at Appier 

We’re looking for a Staff/Senior Machine Learning Platform Engineer to join our Machine Learning Platform Team, which powers end-to-end infrastructure for model training, evaluation, deployment, and monitoring at scale. Our platform supports daily execution of hundreds of ML models and processes billions of data records across batch and streaming pipelines.
In this role, you’ll shape the architecture and core components of our ML platform—covering batch (Spark), streaming (Flink), job orchestration (Argo on Kubernetes), and infrastructure tools—while ensuring the platform remains robust, scalable, and developer-friendly. You’ll also champion best practices and modern development tools including LLM-based programming assistants.

 

What You’ll Work On

  • Architect, implement, and scale batch (Spark) and streaming (Flink) pipelines that process billions of records daily for ML training and evaluation.
  • Design and operate robust ML job execution frameworks for training, inference, and post-processing.
  • Build and maintain internal API servers and developer tools to orchestrate ML jobs on Kubernetes (via Argo Workflows, Helm, Terraform).
  • Design and monitor data infrastructure using ClickHouse and PostgreSQL.
  • Ensure high availability and observability through monitoring tools like Prometheus and Grafana.
  • Collaborate with data scientists, product managers, and engineers to deliver reliable and efficient ML platform capabilities.
  • Actively adopt and promote the use of LLM-based tools (e.g., GitHub Copilot, ChatGPT) to accelerate development, documentation, and debugging.
  • Mentor junior engineers and help evolve team engineering culture and standards.

 

What We’re Looking For

  • Bachelor’s degree in Computer Science, Engineering, or a related field; Master’s preferred.
  • 4+ years of hands-on experience in data systems, machine learning infrastructure, or platform engineering.
  • Strong coding proficiency in Python and/or Java, with experience building large-scale production systems.
  • Practical experience with Spark, Flink, Kubernetes (GKE), and infrastructure-as-code tools such as Terraform and Helm.
  • Experience managing high-throughput data infrastructure using ClickHouse, PostgreSQL, or similar systems.
  • Deep understanding of ML pipelines and distributed job execution in production environments.
  • Proven ability to apply LLM-based tools (e.g., Copilot, ChatGPT) to boost engineering productivity.
  • Strong ownership, architectural thinking, and ability to lead cross-functional platform projects.

 

 

#LI-AK1

Similar Jobs

Square - Technical Consultant

Square

Orlando, Florida, United States (Remote)
3 Weeks ago
Glean - Solutions Engineer - Central

Glean

(Remote)
3 Months ago
Granicus - Software Engineer 3 - Ruby/PHP

Granicus

Bengaluru, Karnataka, India (Remote)
2 Months ago
Wrike - Inside Sales Representative (German)

Wrike

Ireland (Remote)
1 Month ago
Autodesk - Field Marketing Manager

Autodesk

Mumbai, Maharashtra, India (Hybrid)
2 Months ago
bytedance - Software Engineer - Service Platform Intern - 2025 Start

bytedance

Singapore (On-Site)
4 Months ago
upwork - Principal ML Infrastructure Engineer

upwork

(Remote)
3 Months ago
bytedance - Senior Software Engineer, Traffic Platform

bytedance

San Jose, California, United States (On-Site)
9 Months ago
luxsoft - Senior/Lead DevOps Engineer

luxsoft

Zaragoza, Aragon, Spain (On-Site)
1 Month ago
Workato - Senior Infrastructure Engineer

Workato

Nicosia, Nicosia, Cyprus (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Ramp - Channel Partner Manager | Global Systems Integrator

Ramp

New York, New York, United States (Hybrid)
3 Weeks ago
Figma - Account Executive, Federal

Figma

San Francisco, California, United States (Remote)
2 Months ago
Glean - Solutions Architect

Glean

Seattle, Washington, United States (On-Site)
3 Months ago
Veeam Software - Platform Engineer, SaaS

Veeam Software

Warsaw, Masovian Voivodeship, Poland (Remote)
3 Months ago
Hudl - Senior Marketing Manager

Hudl

Lincoln, Nebraska, United States (On-Site)
1 Month ago
NCR Voyix - App Dev Engineer I

NCR Voyix

Gurugram, Haryana, India (On-Site)
2 Months ago
Capgemini - Application Consultant

Capgemini

Noida, Uttar Pradesh, India (On-Site)
2 Months ago
bytedance - Strategy Manager - BytePlus

bytedance

Singapore (On-Site)
9 Months ago
Varonis  - Regional Sales Director

Varonis

Minneapolis, Minnesota, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

GoMotive - Embedded Engineer

GoMotive

Taipei City, Taiwan (Remote)
2 Months ago
fluence - Field Services Engineer

fluence

Taipei City, Taiwan (Remote)
3 Weeks ago
Canonical - Silicon Alliances Ecosystem Development Manager - APAC

Canonical

Taipei City, Taiwan (Hybrid)
3 Months ago
appier - Customer Service Specialist , Ad.Creative

appier

Taipei City, Taiwan (On-Site)
3 Weeks ago
Trend Micro - Senior Cloud Developer

Trend Micro

Taipei City, Taiwan (On-Site)
1 Month ago
Canonical - Senior Ubuntu Embedded IoT System Engineer

Canonical

Taipei City, Taiwan (On-Site)
3 Months ago
Qualcomm - Hardware Baseband Engineer

Qualcomm

Taipei City, Taiwan (On-Site)
2 Months ago
rivos - Field Application Engineer (Senior)

rivos

Hsinchu, Hsinchu City, Taiwan (Hybrid)
5 Months ago
appier - Sales Operations Associate Manager

appier

Taipei City, Taiwan (On-Site)
1 Month ago
WongDoody - PRODUCT SERVICE DESIGNER, TAIWAN

WongDoody

Taipei City, Taiwan (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Electronic Arts - Site Reliability Engineer III

Electronic Arts

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago
Mistral AI - DevOps Engineer, HPC Services

Mistral AI

Paris, Île-de-France, France (Hybrid)
2 Months ago
Wargaming - Infrastructure Engineer

Wargaming

Nicosia, Nicosia, Cyprus (Hybrid)
2 Months ago
Trellix - DevOps/Software Engineer

Trellix

Cork, County Cork, Ireland (On-Site)
2 Months ago
Fractal - DevOps - Lead

Fractal

Mumbai, Maharashtra, India (On-Site)
9 Months ago
Zazz - IoT Solutions Architect

Zazz

(Remote)
6 Months ago
Nagarro - SAP SuccessFactors Solution Architect with German

Nagarro

Romania (Remote)
10 Months ago
appier - Senior Software Engineer, Backend Development (Ad Cloud Serving Services)

appier

Taipei City, Taiwan (On-Site)
1 Month ago
Salesforce - Principal, AgentForce Solution Engineer - Consumer Business Service

Salesforce

San Francisco, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded