Staff Data Infrastructure Engineer

1 Month ago • All levels • Data Analyst • DevOps

Job Summary

Job Description

Character.AI seeks a highly skilled Data Infrastructure Engineer to design and manage large-scale (5+TB/day), fault-tolerant data architectures. Responsibilities include architecting and managing data architectures using Hive, Spark, and Trino; utilizing GCP (BigQuery, GCS, Pub/Sub) and open-source technologies (Iceberg, Parquet/ORC); implementing GDPR and CCPA compliant data governance; applying SRE principles for continuous uptime; and designing and managing real-time data flows using Kafka, Pub/Sub, Flink, or Spark Streaming. The ideal candidate possesses expertise in distributed systems, cloud technologies, big data tools, and data governance, with proven experience in performance tuning and cost optimization strategies.
Must have:
  • Distributed systems & data architecture experience
  • Java (Spark, Trino, Iceberg) expertise
  • Cloud technologies & big data tools familiarity
  • Data governance & compliance knowledge
  • SRE principles & practices application
  • Performance tuning & cost optimization
  • Real-time data streaming pipeline design

Job Details

Overview

We are seeking a highly skilled Data Infrastructure Engineer with deep knowledge of distributed systems and extensive experience in designing and managing large-scale (5+TB/day), fault-tolerant data architectures. The ideal candidate will have expertise in cloud and big data technologies, as well as a strong understanding of compliance and privacy regulations.

Key Responsibilities

  • Design and Management: Architect and manage large-scale, fault-tolerant data architectures using technologies such as Hive, Spark, and Trino.

  • Cloud & Big Data Expertise: Utilize cloud platforms (GCP, including BigQuery, GCS, and Pub/Sub) and open-source data lake technologies (Iceberg, Parquet/ORC) to build scalable data solutions.

  • Compliance & Privacy: Implement data governance frameworks that comply with GDPR and CCPA, including developing data retention policies, access controls, and privacy-by-design principles.

  • Site Reliability Engineering: Apply SRE principles to ensure continuous uptime, including monitoring, alerting, incident response, and conducting postmortems for rapid issue resolution.

  • Performance & Cost Optimization: Configure partitioning, clustering, and compression strategies; tune queries and cluster resources to ensure low-latency queries and cost efficiency.

  • Design & Manage Streaming Pipelines: Architect and operate real-time data flows using technologies such as Kafka, Pub/Sub, Flink, or Spark Streaming to handle high-volume event streams with low latency.

Qualifications

  • Proven experience in distributed systems and data architecture.

  • Expertise in Java (Spark, Trino, Iceberg)

  • Strong familiarity with cloud technologies and big data tools.

  • Knowledge of data governance and compliance frameworks.

  • Experience in applying SRE principles and practices.

  • Expertise in performance tuning and cost optimization strategies.

  • Proficient in designing and managing real-time data streaming pipelines.

Location: SF Bay Area Preferred, NYC OK

About Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.


In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.


Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

Compensation Range: $150K - $275K

Similar Jobs

Microsoft - Software Engineer 2 (Core Search platform)

Microsoft

Beijing, Beijing, China (On-Site)
3 Months ago
The Walt Disney Company - Lead Software Engineer, Ad Platforms

The Walt Disney Company

California, United States (On-Site)
2 Months ago
Egnyte - Senior DevOps Engineer - Azure

Egnyte

India (Remote)
1 Month ago
PENN Interactive - Senior Software Developer, Pricing Engine

PENN Interactive

Philadelphia, Pennsylvania, United States (Hybrid)
2 Months ago
Anavation - Software Developer 4

Anavation

Chantilly, Virginia, United States (On-Site)
5 Months ago
Demonware - Principal Software Engineer (Distributed Systems/Data)

Demonware

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
ByteDance - Senior Data Scientist

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
The Walt Disney Company - Digital Insights Analyst

The Walt Disney Company

Richmond, Victoria, Australia (On-Site)
2 Months ago
ComeOn Group - Responsible Gaming Analyst

ComeOn Group

St. Julian's, Malta (Hybrid)
1 Month ago
Lionsgate Games - Coordinator, Research & Digital Insights

Lionsgate Games

Santa Monica, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Nagarro - QA-AUTOMATION

Nagarro

Cairo, Cairo Governorate, Egypt (On-Site)
5 Months ago
Highspot - Sr. Full Stack Engineer, Training & Coaching

Highspot

Hyderabad, Telangana, India (Hybrid)
6 Months ago
Microsoft - Senior Software Engineer - CTJ - TS/SCI

Microsoft

Redmond, Washington, United States (On-Site)
3 Months ago
Tesla - Controls Engineer Paint

Tesla

Brandenburg, Germany (On-Site)
2 Months ago
Snowprint Studios - Server Developer

Snowprint Studios

Berlin, Berlin, Germany (Hybrid)
1 Month ago
Rovio Entertainment Corporation - Lead/Principal Data Engineer

Rovio Entertainment Corporation

Uusimaa, Finland (Hybrid)
1 Month ago
Google - Senior Software Engineer, Full Stack

Google

(On-Site)
4 Months ago
ION - Lead Software Engineer, Italy

ION

Pisa, Tuscany, Italy (On-Site)
6 Months ago
Nagarro - Associate Staff Engineer, QA Automation

Nagarro

Cebu City, Central Visayas, Philippines (On-Site)
5 Months ago
Meta - Production Engineering

Meta

Seattle, Washington, United States (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Menlo Park, California, United States

Penumbra - Life Sciences Counsel

Penumbra

Alameda, California, United States (On-Site)
4 Months ago
Evolution - Equipment Support Specialist

Evolution

Atlantic City, New Jersey, United States (On-Site)
1 Month ago
Mattel  Inc  - Associate Digital Gaming Designer

Mattel Inc

El Segundo, California, United States (On-Site)
1 Month ago
Samsung Semiconductor - Intern, Visualization Engineer

Samsung Semiconductor

San Jose, California, United States (Hybrid)
3 Months ago
Riot Games - Senior Manager, Technical Product Management - League of Legends

Riot Games

Los Angeles, California, United States (On-Site)
4 Months ago
Netflix - Analytics Engineer (L5) - Product

Netflix

United States (Remote)
2 Months ago
The Walt Disney Company - Supervising Producer - Unannounced Kids CG Series

The Walt Disney Company

Glendale, California, United States (On-Site)
4 Weeks ago
Samsung Semiconductor - Intern, Logic Design Engineer

Samsung Semiconductor

San Jose, California, United States (Hybrid)
4 Weeks ago
The Walt Disney Company - Managing Producer - Graphics

The Walt Disney Company

Bristol, Connecticut, United States (Hybrid)
2 Months ago
Blind Squirrel Games - Sr. Level Designer

Blind Squirrel Games

California, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Meta - Data Science Director

Meta

Menlo Park, California, United States (Remote)
5 Months ago
Wargaming - Game Data Analyst (World of Tanks)

Wargaming

Prague, Prague, Czechia (On-Site)
1 Month ago
Next Level Business Services - Information Management Architect (Full Time)

Next Level Business Services

Santa Clara, California, United States (On-Site)
5 Months ago
Activision - Lead Analytics Engineer

Activision

Santa Monica, California, United States (On-Site)
5 Months ago
Wolters Kluwer - Lead Product Software Engineer -  Lead Cloud Data Engineer

Wolters Kluwer

Coppell, Texas, United States (Hybrid)
6 Months ago
SOFTGAMES - Head of Analytics - Fully Remote

SOFTGAMES

Berlin, Berlin, Germany (Remote)
1 Month ago
Next Level Business Services - SAP Master Data Standards Consultant

Next Level Business Services

King Of Prussia, Pennsylvania, United States (On-Site)
5 Months ago
Sinch - Data Analyst

Sinch

Stockholm, Stockholm County, Sweden (Hybrid)
5 Months ago
The Walt Disney Company - Hulu Analytics Experimentation Intern, Summer 2025

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Character is one of the world's leading personal AI platforms. Founded in 2021 by AI pioneers Noam Shazeer and Daniel De Freitas, Character is a full-stack AI company with a globally scaled direct-to-consumer platform. 

Menlo Park, California, United States (Remote)

San Francisco, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

Menlo Park, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Character.AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug