Staff Data Infrastructure Engineer

1 Hour ago • All levels • Data Analyst • DevOps

Job Summary

Job Description

Character.AI seeks a highly skilled Data Infrastructure Engineer to design and manage large-scale (5+TB/day), fault-tolerant data architectures. Responsibilities include architecting and managing data architectures using Hive, Spark, and Trino; utilizing GCP (BigQuery, GCS, Pub/Sub) and open-source technologies (Iceberg, Parquet/ORC); implementing GDPR and CCPA compliant data governance; applying SRE principles for continuous uptime; and designing and managing real-time data flows using Kafka, Pub/Sub, Flink, or Spark Streaming. The ideal candidate possesses expertise in distributed systems, cloud technologies, big data tools, and data governance, with proven experience in performance tuning and cost optimization strategies.
Must have:
  • Distributed systems & data architecture experience
  • Java (Spark, Trino, Iceberg) expertise
  • Cloud technologies & big data tools familiarity
  • Data governance & compliance knowledge
  • SRE principles & practices application
  • Performance tuning & cost optimization
  • Real-time data streaming pipeline design

Job Details

Overview

We are seeking a highly skilled Data Infrastructure Engineer with deep knowledge of distributed systems and extensive experience in designing and managing large-scale (5+TB/day), fault-tolerant data architectures. The ideal candidate will have expertise in cloud and big data technologies, as well as a strong understanding of compliance and privacy regulations.

Key Responsibilities

  • Design and Management: Architect and manage large-scale, fault-tolerant data architectures using technologies such as Hive, Spark, and Trino.

  • Cloud & Big Data Expertise: Utilize cloud platforms (GCP, including BigQuery, GCS, and Pub/Sub) and open-source data lake technologies (Iceberg, Parquet/ORC) to build scalable data solutions.

  • Compliance & Privacy: Implement data governance frameworks that comply with GDPR and CCPA, including developing data retention policies, access controls, and privacy-by-design principles.

  • Site Reliability Engineering: Apply SRE principles to ensure continuous uptime, including monitoring, alerting, incident response, and conducting postmortems for rapid issue resolution.

  • Performance & Cost Optimization: Configure partitioning, clustering, and compression strategies; tune queries and cluster resources to ensure low-latency queries and cost efficiency.

  • Design & Manage Streaming Pipelines: Architect and operate real-time data flows using technologies such as Kafka, Pub/Sub, Flink, or Spark Streaming to handle high-volume event streams with low latency.

Qualifications

  • Proven experience in distributed systems and data architecture.

  • Expertise in Java (Spark, Trino, Iceberg)

  • Strong familiarity with cloud technologies and big data tools.

  • Knowledge of data governance and compliance frameworks.

  • Experience in applying SRE principles and practices.

  • Expertise in performance tuning and cost optimization strategies.

  • Proficient in designing and managing real-time data streaming pipelines.

Location: SF Bay Area Preferred, NYC OK

About Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.


In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.


Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

Compensation Range: $150K - $275K

Similar Jobs

Anavation - Software Developer 4

Anavation

Chantilly, Virginia, United States (On-Site)
• 4 Months ago
Scientific Games  - Technical Software Release Engineer

Scientific Games

Warwick, Rhode Island, United States (Hybrid)
• 1 Month ago
Visa - Staff Site Reliability Engineer - IT Disaster Recovery (ITDR)

Visa

Highlands Ranch, Colorado, United States (On-Site)
• 4 Months ago
Glean - Software Engineer, Machine Learning (Infrastructure)

Glean

Palo Alto, California, United States (On-Site)
• 3 Months ago
PwC - Analytics & AI Engineering Senior Associate

PwC

Athens, Greece (Hybrid)
• 1 Month ago
Tencent - Data Science Intern

Tencent

(On-Site)
• 14 Hours ago
ION - Data Engineer

ION

Budapest, Hungary (On-Site)
• 4 Months ago
Dun & Bradstreet - Data Engineer I (R-16802)

Dun & Bradstreet

Hyderabad, Telangana, India (Hybrid)
• 4 Months ago
Nielsen Holdings - Data Analyst

Nielsen Holdings

Mexico City, Mexico City, Mexico (Hybrid)
• 1 Day ago
The Walt Disney Company - Manager Data Science

The Walt Disney Company

New York, New York, United States (On-Site)
• 3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Nagarro - Associate Staff Engineer, QA Automation

Nagarro

South Africa (On-Site)
• 3 Months ago
NinjaVan - Staff Data Engineer

NinjaVan

Hyderabad, Telangana, India (On-Site)
• 4 Months ago
Assystems - DĂ©veloppeur Junior - H/F

Assystems

Lyon, Auvergne-RhĂ´ne-Alpes, France (Hybrid)
• 3 Months ago
Zuora - Software Engineer III

Zuora

Chennai, Tamil Nadu, India (Hybrid)
• 3 Months ago
The Walt Disney Company - Senior Software Engineer

The Walt Disney Company

Seattle, Washington, United States (On-Site)
• 3 Weeks ago
Futurum Technology  - Junior Java Developer

Futurum Technology

KrakĂłw, Lesser Poland Voivodeship, Poland (On-Site)
• 1 Month ago
Crunchyroll - Director, Software Engineering, Android

Crunchyroll

Culver City, California, United States (Remote)
• 3 Months ago
Sinch - Mid-Senior Full Stack Engineer

Sinch

Stockholm, Stockholm County, Sweden (Hybrid)
• 1 Month ago
PwC - IN_Manager_Tech Lead Payments_FS  tech _Advisory _Mumbai

PwC

Mumbai, Maharashtra, India (On-Site)
• 4 Months ago
ByteDance - Backend Engineer (Machine Learning System) Intern - 2025 Start

ByteDance

Singapore (On-Site)
• 4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Menlo Park, California, United States

The Walt Disney Company - Content Planning Manager

The Walt Disney Company

Glendale, California, United States (On-Site)
• 3 Months ago
Universal Music - Manager, Revenue Recognition

Universal Music

Santa Monica, California, United States (Hybrid)
• 2 Weeks ago
Patreon - Frontend Engineer

Patreon

New York, New York, United States (Hybrid)
• 3 Weeks ago
PTW - Team Leader - Player Support

PTW

Charleston, South Carolina, United States (On-Site)
• 8 Months ago
Crunchyroll - Senior Software Engineer, Machine Learning, Recommendations

Crunchyroll

Culver City, California, United States (On-Site)
• 3 Months ago
Tencent - Production Director

Tencent

Palo Alto, California, United States (On-Site)
• 3 Months ago
Anthology  Inc  - Senior Manager, Finance, HR & Payroll Implementation

Anthology Inc

United States (Remote)
• 2 Weeks ago
Zoox - Software Engineer - Simulaton Scenario Automation

Zoox

Foster City, California, United States (Hybrid)
• 4 Months ago
Microsoft - Senior Researcher – Artificial Intelligence

Microsoft

Redmond, Washington, United States (On-Site)
• 1 Month ago
Mattel  Inc  - Development Program Management Associate II

Mattel Inc

California, United States (On-Site)
• 2 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Onward Search - Data Analyst II

Onward Search

San Jose, California, United States (On-Site)
• 1 Month ago
Epic Games - Senior Data Analyst, Game Platform

Epic Games

(On-Site)
• 1 Month ago
ByteDance - LLM Training Operation (Language and Creative) - Analyst

ByteDance

Singapore (On-Site)
• 3 Months ago
Netflix - Data Analyst, Finance & Strategy, MarComms

Netflix

Los Angeles, California, United States (On-Site)
• 4 Weeks ago
Warner Bros Games - Director, Market Intelligence

Warner Bros Games

Burbank, California, United States (Hybrid)
• 2 Weeks ago
Feld Entertainment - Data Engineer

Feld Entertainment

Ellenton, Florida, United States (On-Site)
• 3 Months ago
ION - Data Associate - KYC6

ION

Budapest, Hungary (On-Site)
• 4 Months ago
Conjointly - Quantitative Market Researcher

Conjointly

Bogotá, Bogota, Colombia (Remote)
• 1 Month ago
N-iX - Senior Business Analyst

N-iX

Poland (Remote)
• 2 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Character is one of the world's leading personal AI platforms. Founded in 2021 by AI pioneers Noam Shazeer and Daniel De Freitas, Character is a full-stack AI company with a globally scaled direct-to-consumer platform. 

California, United States (On-Site)

New York, New York, United States (On-Site)

Menlo Park, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

New York, New York, United States (On-Site)

Menlo Park, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

Menlo Park, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Character.AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug