Senior Research Engineer (Data)

2 Weeks ago • 3 Years + • Artificial Intelligence • Data Analyst

About the job

Job Description

This Senior Research Engineer (Data) role focuses on spearheading data acquisition and management systems for advanced AI research. Responsibilities include architecting and maintaining efficient data pipelines for sourcing, processing, and organizing large datasets used in generative AI models. The role requires partnering with research teams to improve model performance by identifying and leveraging novel data sources, developing robust data pipelines (including deduplication and filtering), collaborating with annotation teams to enhance dataset quality, applying advanced methodologies like self-supervised active learning, and leading research projects to improve data quality for video generation models. The ideal candidate will have 3+ years of experience managing large-scale datasets in fields like computer vision or NLP, strong Python and PyTorch skills, and experience with large-scale data processing tools like SQL or Spark.
Must have:
  • 3+ years experience managing large datasets
  • Strong Python & PyTorch proficiency
  • Experience with SQL or Spark
  • Expertise in designing distributed systems
  • Data pipeline development & maintenance
Perks:
  • Competitive equity packages
  • Comprehensive benefits plan

We are seeking a Senior Software Engineer to spearhead our data acquisition and management systems, critical to our advanced AI research. In this role, you will architect and maintain efficient pipelines for sourcing, processing, and organizing the extensive datasets that fuel our generative AI models. Your expertise will have a direct and transformative impact on the quality and capabilities of our technology.

Responsibilities

  • Partner with research teams to understand and address model performance gaps by identifying and leveraging novel data sources.
  • Develop and implement robust data pipelines for acquisition, deduplication, filtering, and pre-training dataset preparation.
  • Collaborate with annotation operations teams to design innovative data filtering strategies and enhance dataset quality.
  • Apply and integrate advanced methodologies such as self-supervised active learning to scale data systems.
  • Lead research projects to improve data quality and drive advancements in video generation models.

Qualifications

  • Education: Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.
  • Experience: 3+ years of experience in managing and curating large-scale datasets, particularly in fields like computer vision, NLP, robotics, or self-driving technologies.

Key Skills:

  • Strong proficiency in Python and familiarity with deep learning frameworks such as PyTorch.
  • Experience with large-scale data processing tools, such as SQL or Spark.
  • Hands-on expertise in designing and working with distributed systems.
  • Proven ability to thrive in a fast-paced, research-focused environment and deliver end-to-end project solutions.

Note: This position is not intended for recent graduates.

Compensation

The salary range for this role in California is $175,000–$250,000 per year. Actual base pay may vary based on factors such as job-related expertise, skills, experience, and candidate location. Additionally, we provide competitive equity packages through stock options and a comprehensive benefits plan.

View Full Job Description
$175.0K - $250.0K/yr (Outscal est.)
$212.5K/yr avg.
Palo Alto, California, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

An idea-to-video platform that brings your creativity to motion.

Palo Alto, California, United States (On-Site)

California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Pika

Similar Jobs

Paypal - Principal Machine Learning Engineer - AI

Paypal, United States (On-Site)

Hashlist - Senior Data Engineer

Hashlist, India (Hybrid)

Highspot - Sr. Account Manager (Strategic) - Remote

Highspot, United States (Remote)

Nielsen Holdings - Senior Full Stack Developer - Mumbai / Bangalore

Nielsen Holdings, India (Hybrid)

Intel Corporation - Full Stack Software Developer & Machine Learning Engineer

Intel Corporation, Costa Rica (Hybrid)

CharacterAI - Software Engineer, Core Product

CharacterAI, Canada (On-Site)

Stemuli - AI Engineer - Core Education, Seattle

Stemuli, United States (Hybrid)

SatSure - Senior Machine Learning Researcher

SatSure, India (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Oceaneering - Principal Data Scientist

Oceaneering, India (Hybrid)

Dream Game Studios - Senior Security Engineer - Security Operations

Dream Game Studios, India (On-Site)

Intel Corporation - Federal Proposal Manager

Intel Corporation, United States (Hybrid)

Luxoft - Data Analyst

Luxoft, India (On-Site)

Unity - Senior Data Developer

Unity, Canada (On-Site)

Razer - Solutions Architect

Razer, Singapore (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Palo Alto, California, United States

Zynga - Lead Product Manager

Zynga, United States (On-Site)

Warner Bros Discovery - T&O Operating Model, Sr. Program Manager

Warner Bros Discovery, United States (On-Site)

Axinous - Account Executive, Enterprise

Axinous, United States (Remote)

Next Level Business Services - Documentum D2 Developer

Next Level Business Services, United States (On-Site)

Warner Bros Discovery - Compression Engineer

Warner Bros Discovery, United States (On-Site)

Redhorse Corp - Program Manager

Redhorse Corp, United States (On-Site)

Kokotree - Full Stack Developers

Kokotree, United States (On-Site)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Get notifed when new similar jobs are uploaded