Senior Research Engineer (Data)

2 Months ago • 3 Years + • Artificial Intelligence • Data Analyst • $175,000 PA - $250,000 PA

Job Summary

Job Description

This Senior Research Engineer (Data) role focuses on spearheading data acquisition and management systems for advanced AI research. Responsibilities include architecting and maintaining efficient data pipelines for sourcing, processing, and organizing large datasets used in generative AI models. The role requires partnering with research teams to improve model performance by identifying and leveraging novel data sources, developing robust data pipelines (including deduplication and filtering), collaborating with annotation teams to enhance dataset quality, applying advanced methodologies like self-supervised active learning, and leading research projects to improve data quality for video generation models. The ideal candidate will have 3+ years of experience managing large-scale datasets in fields like computer vision or NLP, strong Python and PyTorch skills, and experience with large-scale data processing tools like SQL or Spark.
Must have:
  • 3+ years experience managing large datasets
  • Strong Python & PyTorch proficiency
  • Experience with SQL or Spark
  • Expertise in designing distributed systems
  • Data pipeline development & maintenance
Perks:
  • Competitive equity packages
  • Comprehensive benefits plan

Job Details

We are seeking a Senior Software Engineer to spearhead our data acquisition and management systems, critical to our advanced AI research. In this role, you will architect and maintain efficient pipelines for sourcing, processing, and organizing the extensive datasets that fuel our generative AI models. Your expertise will have a direct and transformative impact on the quality and capabilities of our technology.

Responsibilities

  • Partner with research teams to understand and address model performance gaps by identifying and leveraging novel data sources.
  • Develop and implement robust data pipelines for acquisition, deduplication, filtering, and pre-training dataset preparation.
  • Collaborate with annotation operations teams to design innovative data filtering strategies and enhance dataset quality.
  • Apply and integrate advanced methodologies such as self-supervised active learning to scale data systems.
  • Lead research projects to improve data quality and drive advancements in video generation models.

Qualifications

  • Education: Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.
  • Experience: 3+ years of experience in managing and curating large-scale datasets, particularly in fields like computer vision, NLP, robotics, or self-driving technologies.

Key Skills:

  • Strong proficiency in Python and familiarity with deep learning frameworks such as PyTorch.
  • Experience with large-scale data processing tools, such as SQL or Spark.
  • Hands-on expertise in designing and working with distributed systems.
  • Proven ability to thrive in a fast-paced, research-focused environment and deliver end-to-end project solutions.

Note: This position is not intended for recent graduates.

Compensation

The salary range for this role in California is $175,000–$250,000 per year. Actual base pay may vary based on factors such as job-related expertise, skills, experience, and candidate location. Additionally, we provide competitive equity packages through stock options and a comprehensive benefits plan.

Similar Jobs

PwC - Power BI Developer| Senior Associate [tag01]

PwC

Barueri, São Paulo, Brazil (On-Site)
1 Month ago
The Walt Disney Company - Lead Data Scientist

The Walt Disney Company

Burbank, California, United States (On-Site)
3 Months ago
Homa games - Gaming Tech Java BackEnd Engineer

Homa games

(Remote)
1 Month ago
Appier - Senior Software Engineer, Data Backend(CrossX)

Appier

Taipei City, Taiwan (On-Site)
2 Months ago
Netflix - Software Engineer (L5), Content Engineering

Netflix

Los Gatos, California, United States (On-Site)
3 Months ago
Microsoft - AI Platform Engineer

Microsoft

Mountain View, California, United States (Hybrid)
2 Weeks ago
Axinous - Sr. Staff Machine Learning Engineer

Axinous

San Jose, California, United States (Hybrid)
1 Month ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Software Development Engineer - Large Language Models, AML

ByteDance

San Jose, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Zoox - Software Engineer - Simulation Workload Orchestration

Zoox

Foster City, California, United States (Hybrid)
3 Months ago
Warner Bros Games - Senior Manager, Analytics Engineering

Warner Bros Games

Hyderabad, Telangana, India (Hybrid)
3 Weeks ago
N-iX - Senior Data Engineer

N-iX

Ukraine (Remote)
2 Weeks ago
Saarthee - Talent Acquisition Executive

Saarthee

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
EXUSIA - Ab Initio Technical Leads ( US - H1B Visa Holders)

EXUSIA

India (Remote)
4 Months ago
N-iX - Middle Support Data Engineer (#2521)

N-iX

Ukraine (Remote)
2 Months ago
Nielsen Holdings - Senior Software Engineer ( Java , Python , SQL , AWS / Oracle)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Acceldata - Resident Solutions Architect

Acceldata

United States (Remote)
3 Months ago
Keywords Studios (Player Support) - Software Data Engineer II

Keywords Studios (Player Support)

Pune, Maharashtra, India (Hybrid)
1 Month ago
Zuora - Data Scientist III

Zuora

Bengaluru, Karnataka, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Palo Alto, California, United States

Microsoft - Research Intern - Finetuning for Post Training Quantization

Microsoft

Redmond, Washington, United States (On-Site)
3 Weeks ago
DraftKings - Lottery Fulfillment Supervisor

DraftKings

North Andover, Massachusetts, United States (On-Site)
3 Weeks ago
On Location - Marketing Associate, B2C Marketing – FIFA World Cup 26™

On Location

New York, New York, United States (On-Site)
2 Days ago
Captions - Performance Marketing Manager (3+ years of experience)

Captions

New York, New York, United States (On-Site)
2 Months ago
Meta - Marketing Science Partner (Financial Services)

Meta

Los Angeles, California, United States (On-Site)
3 Months ago
Rackspace Technology - Engagement Manager

Rackspace Technology

United States (Remote)
1 Month ago
Saviynt - Technical Lead, Professional Services

Saviynt

Atlanta, Georgia, United States (Remote)
3 Months ago
Mashgin - Deployment Engineer - North Carolina

Mashgin

Charlotte, North Carolina, United States (Remote)
3 Months ago
Netflix - Lead Technical Game Designer, Games Innovation

Netflix

Los Gatos, California, United States (Hybrid)
1 Month ago
Zones - IT Hardware Technician

Zones

New York, New York, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Microsoft - Senior Software Engineer

Microsoft

Mountain View, California, United States (Remote)
2 Weeks ago
Microsoft - Machine Learning Engineer II

Microsoft

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
PwC - Manager_Conversational AI Developer_Advisory Corporate_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Microsoft - Member of Technical Staff, AI - Pre-Training Platform

Microsoft

London, England, United Kingdom (On-Site)
1 Month ago
Intel Corporation - Full Stack Software Developer & Machine Learning Engineer

Intel Corporation

San José, San José Province, Costa Rica (Hybrid)
2 Months ago
Microsoft - Data and Applied Scientist II

Microsoft

Hyderabad, Telangana, India (On-Site)
3 Weeks ago
Virtuos - R&D Machine Learning Engineer

Virtuos

China (On-Site)
1 Hour ago
Unity - Principal Machine Learning Engineer

Unity

San Francisco, California, United States (On-Site)
2 Months ago
Zoox - Senior/Staff Software Engineer, ML Performance Optimization

Zoox

Foster City, California, United States (On-Site)
3 Months ago
Google - Senior Software Engineer, Machine Learning, Google Cloud Compute

Google

Sunnyvale, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

An idea-to-video platform that brings your creativity to motion.

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Pika

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug