Lead Machine Learning Engineer/Scientist, Algorithms and Research

1 Month ago • All levels • Research Development • $175,500 PA - $277,500 PA

Job Summary

Job Description

Upwork is seeking a Lead Machine Learning Engineer/Scientist to join their Algorithms and Research Team. This role involves building foundational models and reinforcement learning (RL) systems for Upwork's platform, including Search & Recommendations and their AI assistant, Uma. The primary focus will be on designing, developing, and deploying cutting-edge RL systems for reasoning and planning, integrating RAG with advanced RL-based planning methods. Responsibilities include designing and implementing RL approaches using techniques like Monte Carlo Tree Search (MCTS) and AlphaGo-like algorithms, integrating knowledge graphs and vector databases, and collaborating with trust and safety teams on explainability. The engineer will also mentor junior members, conduct code reviews, and drive the transition of RL models from research to production, ensuring performance, scalability, and robustness.
Must have:
  • Design and deploy RL systems (MCTS, policy/value frameworks)
  • Experience with vector databases, graph databases, GNNs in RL
  • Familiarity with search, recommendation, decision-making systems
  • Passion for intelligent agent systems with LLMs
Good to have:
  • Experience with Monte Carlo Tree Search (MCTS)
  • Experience with AlphaGo-like planning algorithms
  • Integration of knowledge graphs and vector databases
  • Collaboration with trust and safety teams on explainability
Perks:
  • Comprehensive medical coverage
  • Unlimited Paid Time Off (PTO)
  • 401(k) plan with matching
  • 12 weeks paid parental leave
  • Employee Stock Purchase Plan

Job Details

Upwork ($UPWK) is the world’s largest work marketplace, connecting businesses with highly skilled professionals worldwide. From entrepreneurs to Fortune 100 enterprises, companies trust Upwork’s platform to access expert talent, leverage AI-powered work solutions, and drive meaningful business outcomes.

Upwork’s AI-powered platform has facilitated over $20 billion in economic opportunity for professionals worldwide. With professionals spanning 10,000+ skills, including AI and machine learning, software development, sales and marketing, customer support, finance and accounting, and more, Upwork empowers businesses of all sizes to scale, innovate, and build agile teams.


We're looking for a Lead Machine Learning Engineer/Scientist to join our Algorithms and Research Team within the ML & AI organization. This team builds foundational models and reinforcement learning (RL) systems that power Upwork’s vertical teams, including Search & Recommendations and Uma, our AI assistant. In this role, you'll focus on designing, developing, and deploying cutting-edge RL systems for reasoning and planning.

You'll contribute to building scalable decision-making and information retrieval systems utilizing structured and unstructured data, and drive innovation through RAG integration combined with advanced RL-based planning  methods.

You’ll partner closely with engineering, research, and trust and safety teams to transition reinforcement learning models from research prototypes into robust, production-ready solutions, enhancing the performance and reliability of Upwork’s evolving platform.

Responsibilities

  • Design and implement reinforcement learning approaches for reasoning & planning, e.g. leveraging Monte Carlo Tree Search (MCTS) and AlphaGo-like planning algorithms.
  • Integrate knowledge graphs and vector databases to enhance context-awareness and decision-making capabilities of RL systems.
  • Collaborate with trust and safety stakeholders to investigate explainability and interpretability in reinforcement learning systems, minimizing risks and unintended behaviors.
  • Lead cross-functional efforts in advancing RL models from research stages to production, ensuring optimal performance, scalability, and robustness.
  • Stay at the forefront of reinforcement learning advancements and champion integration of promising RL innovations into Upwork’s production environment.
  • Mentor junior engineers, conduct detailed code reviews, and reinforce best practices in RL and software engineering.
  • Deliver high-quality, measurable outcomes aligned with team and organizational strategic goals.

What it takes to catch our eye

  • Proven track record designing and deploying reinforcement learning systems, especially using Monte Carlo Tree Search (MCTS) and policy/value network frameworks.
  • Deep understanding and practical experience with vector databases, graph databases, graph neural networks (GNNs), and their applications in RL.
  • Familiarity with search, recommendation, and decision-making systems within complex, two-sided marketplaces.
  • Passion for developing intelligent, autonomous agent systems utilizing advanced reinforcement learning paradigms and large language models (LLMs).

Come change how the world works.

At Upwork, you’ll shape the future of work for a global, remote-first workforce, creating economic opportunities for professionals worldwide. While we have a physical office in Palo Alto, we currently hire full-time employees in 21 U.S. states, making it easier than ever to join our mission from wherever you call home.

Our culture is built on trust, risk-taking, customer focus, and excellence, all in service of our core mission: to create economic opportunities so people have better lives. We embrace authenticity and inclusion, encouraging everyone to bring their whole selves to work. Personal and professional growth is a priority here, supported through development programs, mentorship, and our Upwork Belonging Communities.

We’re proud to offer benefits that go beyond the basics, including comprehensive medical coverage for you and your family, unlimited PTO, a 401(k) plan with matching, 12 weeks of paid parental leave, and an Employee Stock Purchase Plan. Visit our Life at Upwork page to learn more about our values, working principles, and the overall employee experience.

Ready to help shape the future of work? Check out our Careers page and follow us on LinkedIn, Facebook, Instagram, TikTok, and X to learn more about life at Upwork.

Upwork is an Equal Opportunity Employer committed to recruiting and retaining a diverse and inclusive workforce. We do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, or other legally protected characteristics under federal, state, or local law.

Please note that a criminal background check may be required once a conditional job offer is made. Qualified applicants with arrest or conviction records will be considered in accordance with applicable law, including the California Fair Chance Act and local Fair Chance ordinances.

The annual base salary range for this position  is displayed below. The range displayed reflects the minimum and maximum salary for this position, and individual base pay will depend on your skills, qualifications, experience, and location. Additionally, this position is eligible for the annual bonus plan or sales incentive plan and eligibility to participate in our long term equity incentive program.

Annual Base Compensation

$175,500 - $277,500 USD

To learn more about how Upwork processes and protects your personal information as part of the application process, please review our Global Job Applicant Privacy Notice

Similar Jobs

truecaller - Lead Product Designer

truecaller

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Western Digital - Intern - Digital Solutions Engineer (Studying Bachelor Degree)

Western Digital

Prachin Buri, Thailand (On-Site)
2 Weeks ago
Zelis  - Senior Director, Automation Engineering

Zelis

Morristown, New Jersey, United States (Hybrid)
1 Year ago
welevel  - Senior AI Gameplay Programmer

welevel

Munich, Bavaria, Germany (On-Site)
1 Week ago
Scopely - Senior 2D VFX Artist

Scopely

United Kingdom (Remote)
9 Months ago
Luma - Research Scientist

Luma

Palo Alto, California, United States (Hybrid)
5 Months ago
Apple - Senior Machine Learning Research Engineer (Computer Vision)

Apple

Seattle, Washington, United States (On-Site)
2 Months ago
Runway - Engineering Manager, Machine Learning

Runway

(Remote)
2 Weeks ago
Apple - Software Engineer, Machine Learning

Apple

Cupertino, California, United States (On-Site)
2 Months ago
Keywords Studios - Technical Research Associate - AI

Keywords Studios

(Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Critical mass - Vice President, Marketing Science

Critical mass

New York, United States (On-Site)
4 Weeks ago
Animoca Brands - Digital Asset Operations Associate

Animoca Brands

Hong Kong (On-Site)
6 Months ago
Trailmix - Senior Data Engineer

Trailmix

London, England, United Kingdom (Hybrid)
3 Weeks ago
cyara - Sales Operations Analyst – Data

cyara

Hyderabad, Telangana, India (Hybrid)
8 Months ago
Canva - Backend Software Engineer

Canva

Auckland, Auckland, New Zealand (Remote)
2 Months ago
Innerspace  - Python Developer

Innerspace

Innsbruck, Tyrol, Austria (On-Site)
2 Months ago
NCR Atleos - PS Software Engineer III, Financial Services

NCR Atleos

Hyderabad, Telangana, India (Hybrid)
3 Weeks ago
Roblox - Developer Engagement Representative - Vietnam

Roblox

Vietnam (Remote)
1 Month ago
Stake logic - Product Owner

Stake logic

Eindhoven, North Brabant, Netherlands (On-Site)
6 Months ago
Ness - Experienced Technical Support Engineer with German

Ness

Košice, Košice Region, Slovakia (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Roblox - Operations Analyst, Critical Harms

Roblox

San Mateo, California, United States (Hybrid)
1 Week ago
Snorkel AI - Staff Software Engineer  — AI Platform

Snorkel AI

San Francisco, California, United States (Hybrid)
3 Months ago
Highspot - Associate Engineer Internship

Highspot

Seattle, Washington, United States (Hybrid)
3 Months ago
Apple - Safety System Product Design Engineer

Apple

Sunnyvale, California, United States (On-Site)
2 Months ago
PermitFlow - Mid-Market Account Executive

PermitFlow

New York, United States (Hybrid)
2 Weeks ago
FlockSafety - Regional Account Executive

FlockSafety

Phoenix, Arizona, United States (Remote)
3 Weeks ago
Apple - ML Infrastructure Software Engineer

Apple

Austin, Texas, United States (On-Site)
3 Weeks ago
HCL Tech - Solution Architect with Teamcenter Functional, Teamcenter Support, TC Admin Modules

HCL Tech

Florida, United States (On-Site)
2 Months ago
Semgrep - Head of Customer Success

Semgrep

San Francisco, California, United States (Hybrid)
2 Weeks ago
Lilt - Technical Support Engineer

Lilt

United States (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Qualcomm - Machine Learning Technology Engineer

Qualcomm

San Diego, California, United States (On-Site)
1 Month ago
PhonePe - Decision Scientist

PhonePe

Bengaluru, Karnataka, India (On-Site)
4 Weeks ago
Marvell - Firmware Developer (Linux, Security, Crypto, HSM, AI, TPM)

Marvell

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Vigaet - Market Research and Sales Intern

Vigaet

(Remote)
8 Months ago
Apple - Physical Design Engineer, Machine Learning

Apple

Sunnyvale, California, United States (On-Site)
2 Months ago
Apple - Machine Learning Applied Research Scientist

Apple

Cupertino, California, United States (On-Site)
2 Months ago
Apple - AIML Triage and Diagnostic Tooling Engineer, AIML Integration and Delivery

Apple

Seattle, Washington, United States (On-Site)
2 Months ago
world resource institute - Consultant as User Research Analyst

world resource institute

Jakarta, Indonesia (On-Site)
1 Week ago
Instawork - ML Engineer

Instawork

San Francisco, California, United States (On-Site)
2 Weeks ago
Philips - Senior Manager; Development Engineering - Transducer R&D

Philips

Reedsville, Pennsylvania, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded