Senior Research Engineer (Data)

2 Months ago • 3 Years + • Artificial Intelligence • Data Analyst • $175,000 PA - $250,000 PA

Job Summary

Job Description

This Senior Research Engineer (Data) role focuses on spearheading data acquisition and management systems for advanced AI research. Responsibilities include architecting and maintaining efficient data pipelines for sourcing, processing, and organizing large datasets used in generative AI models. The role requires partnering with research teams to improve model performance by identifying and leveraging novel data sources, developing robust data pipelines (including deduplication and filtering), collaborating with annotation teams to enhance dataset quality, applying advanced methodologies like self-supervised active learning, and leading research projects to improve data quality for video generation models. The ideal candidate will have 3+ years of experience managing large-scale datasets in fields like computer vision or NLP, strong Python and PyTorch skills, and experience with large-scale data processing tools like SQL or Spark.
Must have:
  • 3+ years experience managing large datasets
  • Strong Python & PyTorch proficiency
  • Experience with SQL or Spark
  • Expertise in designing distributed systems
  • Data pipeline development & maintenance
Perks:
  • Competitive equity packages
  • Comprehensive benefits plan

Job Details

We are seeking a Senior Software Engineer to spearhead our data acquisition and management systems, critical to our advanced AI research. In this role, you will architect and maintain efficient pipelines for sourcing, processing, and organizing the extensive datasets that fuel our generative AI models. Your expertise will have a direct and transformative impact on the quality and capabilities of our technology.

Responsibilities

  • Partner with research teams to understand and address model performance gaps by identifying and leveraging novel data sources.
  • Develop and implement robust data pipelines for acquisition, deduplication, filtering, and pre-training dataset preparation.
  • Collaborate with annotation operations teams to design innovative data filtering strategies and enhance dataset quality.
  • Apply and integrate advanced methodologies such as self-supervised active learning to scale data systems.
  • Lead research projects to improve data quality and drive advancements in video generation models.

Qualifications

  • Education: Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.
  • Experience: 3+ years of experience in managing and curating large-scale datasets, particularly in fields like computer vision, NLP, robotics, or self-driving technologies.

Key Skills:

  • Strong proficiency in Python and familiarity with deep learning frameworks such as PyTorch.
  • Experience with large-scale data processing tools, such as SQL or Spark.
  • Hands-on expertise in designing and working with distributed systems.
  • Proven ability to thrive in a fast-paced, research-focused environment and deliver end-to-end project solutions.

Note: This position is not intended for recent graduates.

Compensation

The salary range for this role in California is $175,000–$250,000 per year. Actual base pay may vary based on factors such as job-related expertise, skills, experience, and candidate location. Additionally, we provide competitive equity packages through stock options and a comprehensive benefits plan.

Similar Jobs

Zazz - Data Engineer

Zazz

(Remote)
1 Month ago
Zuora - Senior Data Scientist

Zuora

Chennai, Tamil Nadu, India (Hybrid)
3 Months ago
Rackspace Technology - R-19462 Data Engineer III - VN

Rackspace Technology

Vietnam (Remote)
1 Month ago
Appier - Software Engineer, Machine Learning Platform

Appier

Taipei City, Taiwan (On-Site)
3 Months ago
Mobileum - Senior Software Quality Engineer

Mobileum

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Microsoft - Senior Data Science Manager

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Month ago
Microsoft - Principal Product Manager, AI

Microsoft

Redmond, Washington, United States (On-Site)
1 Week ago
Krafton  - [Global Strategy & BD Div.] Strategy Manager(AI Ethics) (4년 ~ 7년)

Krafton

Seoul, South Korea (On-Site)
2 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
3 Weeks ago
Microsoft - Member of Technical Staff, Health AI

Microsoft

London, England, United Kingdom (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Match Group - Sr. Product Manager, Safety Experience

Match Group

San Francisco, California, United States (Hybrid)
4 Months ago
Whatnot - Software Engineer, Recommendation Systems

Whatnot

San Francisco, California, United States (Remote)
3 Months ago
Rackspace Technology - Data Solutions Director

Rackspace Technology

United States (Hybrid)
2 Months ago
Verve - Senior Backend Engineer (Java, Go)

Verve

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Epic Games - Principal Data Analyst, Ecosystem Economy & UGC

Epic Games

(On-Site)
1 Month ago
Netflix - Marketing Manager, Mexico

Netflix

Mexico City, Mexico City, Mexico (On-Site)
1 Month ago
Luxoft - Senior DevOps Engineer

Luxoft

Toronto, Ontario, Canada (On-Site)
2 Months ago
PwC - IN_Senior Associate _Java Developer _Data & Analytics _Advisory _PAN India

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
Unity - Senior Software Developer

Unity

Vancouver, British Columbia, Canada (On-Site)
3 Months ago
Next Level Business Services - Hadoop Architect (Full Time)

Next Level Business Services

Groton, Connecticut, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Palo Alto, California, United States

The Walt Disney Company - Lead Data Engineer

The Walt Disney Company

Lake Buena Vista, Florida, United States (On-Site)
1 Week ago
Fabric - Applied Researcher, Cryptography Proof Systems

Fabric

New York, New York, United States (Remote)
4 Months ago
The Walt Disney Company - Photo Editor, Digital/Social

The Walt Disney Company

Washington, District Of Columbia, United States (Hybrid)
1 Month ago
Next Level Business Services - Product Development Manager

Next Level Business Services

Bentonville, Arkansas, United States (On-Site)
4 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech Understanding) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Hypixel Studios - Platform Engineering Manager

Hypixel Studios

Seattle, Washington, United States (Remote)
5 Months ago
Mattel  Inc  - Sr Manager, Brand Marketing (Action Figures)

Mattel Inc

California, United States (On-Site)
2 Months ago
Salesforce - Named Account Executive

Salesforce

Texas, United States (Remote)
1 Month ago
Match Group - Product Design Manager, Design Systems and Accessibility

Match Group

Palo Alto, California, United States (Hybrid)
4 Months ago
Flow - Senior/Staff Web Engineer

Flow

Miami, Florida, United States (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Netflix - Machine Learning Platform Product Manager, Training

Netflix

Los Gatos, California, United States (Hybrid)
1 Month ago
Nagarro - Principal Engineer, AI / ML

Nagarro

Sri Lanka (Remote)
4 Months ago
ByteDance - Cloud Native Engineer, ARK Large Model Platform (Singapore)

ByteDance

Singapore (On-Site)
3 Months ago
ByteDance - Research Scientist in Foundation Model, Speech & Audio Graduates - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Firstsource - AI Content Creation - Science & Technology

Firstsource

United States (Remote)
6 Months ago
Zoox - Sensor Software Developer

Zoox

Foster City, California, United States (On-Site)
4 Months ago
ByteDance - Research Scientist, Foundation Model, Vision

ByteDance

Singapore (On-Site)
3 Months ago
Match Group - Senior ML Platform Engineer

Match Group

New York, New York, United States (Hybrid)
4 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

An idea-to-video platform that brings your creativity to motion.

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Pika

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug