Research Engineer - Computer Vision ML

2 Months ago • All levels • $190,000 PA - $320,000 PA
Research Development

Job Description

Sesame is seeking a Research Engineer specializing in Computer Vision and Machine Learning to develop lifelike computer capabilities. The role focuses on vision understanding as a critical component of conversational AI, bridging speech with the physical world. Responsibilities include developing ML models for 3D computer vision problems, working across the ML stack (architectures, data handling, infrastructure, experimentation), and collaborating with hardware engineers for embedded deployment. The ideal candidate will independently tackle complex challenges, leverage literature, and create novel approaches for unique goals, contributing to next-generation wearables.
Good To Have:
  • Master's / Ph.D. desired.
  • Experience deploying models in products.
  • Experience in a startup environment.
  • Incorporate geometric/physical/structural priors into data-driven approaches.
Must Have:
  • Experience with autonomy in ambiguous environments.
  • Develop machine learning and computer vision models.
  • Familiar with state-of-the-art computer vision.
  • Proficient in deep learning frameworks (PyTorch/Jax).
  • Handle large-scale datasets (multi-camera).
  • Excellent communication and collaboration skills.
  • Bachelor's degree in CS, CV, Math, ML, or related.
Perks:
  • 401k matching
  • 100% employer-paid health, vision, and dental benefits
  • Unlimited PTO and sick time
  • Flexible spending account matching (medical FSA)

Add these skills to join the top 1% applicants for this job

communication
pytorch
deep-learning
computer-vision
machine-learning

About Sesame

Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. With this vision, we're designing a new kind of computer, focused on making voice companions part of our daily lives. Our team brings together founders from Oculus and Ubiquity6, alongside proven leaders from Meta, Google, and Apple, with deep expertise spanning hardware and software. Join us in shaping a future where computers truly come alive.

About the Role

Vision understanding is a critical addition to conversational AI, bridging the gap between speech and the physical world. We’re looking for an engineer or researcher who lives at the intersection of 3D computer vision and machine learning. You’ll tackle problems ranging from gaze tracking to SLAM, embedding physical constraints (e..g. refraction, light transport) into data-driven models. Working cross-functionally with research, hardware, and product teams, you’ll turn cutting-edge vision techniques into features that power our next-generation wearables.

Responsibilities:

  • Contribute to the development of our ML models across various flavors of 3D computer vision problems.

  • Work across the ML stack, including model architectures, data capture, data curation, model evaluation, training & inference infrastructure, research, and experimentation.

  • Collaborate with firmware and hardware engineers to deploy models onto embedded devices.

  • Pick promising approaches from the literature to bet on, and create new approaches where necessary to achieve our unique goals.

Required Qualifications:

  • Experience working with a high degree of autonomy in ambiguous environments.

  • Proven experience in developing machine learning and computer vision models.

  • Familiar with state-of-the-art in computer vision.

  • Strong proficiency in deep learning frameworks such as PyTorch or Jax.

  • Familiarity with large-scale dataset handling, including multi-camera datasets.

  • Excellent communication skills and the ability to work collaboratively across disciplines.

  • Bachelor’s degree or higher in computer science, computer vision, applied mathematics, machine learning, or a related field.

Preferred Qualifications:

  • Master’s / Ph.D. desired.

  • Experience deploying models in products.

  • Experience in a startup environment.

  • Experience incorporating geometric, physical, and/or structural priors into data-driven approaches.

Sesame is committed to a workplace where everyone feels valued, respected, and empowered. We welcome all qualified applicants, embracing diversity in race, gender, identity, orientation, ability, and more. We provide reasonable accommodations for applicants with disabilities—contact careers@sesame.com for assistance.

Full-time Employee Benefits: 

  • 401k matching

  • 100% employer-paid health, vision, and dental benefits 

  • Unlimited PTO and sick time 

  • Flexible spending account matching (medical FSA) 

Benefits do not apply to contingent/contract workers

Set alerts for more jobs like Research Engineer - Computer Vision ML
Set alerts for new jobs by Sesame
Set alerts for new Research Development jobs in United States
Set alerts for new jobs in United States
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙