Lead Machine Learning Infrastructure Engineer

3 Months ago • All levels • Devops • $185,500 PA - $293,750 PA

Job Summary

Job Description

As a Lead Machine Learning Infrastructure Engineer at Upwork, you will be responsible for designing and building systems to support machine learning models at scale. This includes designing, implementing, and optimizing distributed systems and infrastructure components, as well as developing tools and frameworks for the entire machine learning lifecycle. Collaboration with cross-functional teams and mentoring junior engineers are also key responsibilities. The role requires a deep understanding of ML workflows and a passion for innovation. Upwork offers a remote-first environment and provides comprehensive benefits to its employees.
Must have:
  • Expertise in designing and building scalable ML infrastructure.
  • Experience with distributed systems and cloud-based ML platforms.
  • Proficiency in programming languages like Python, Java, or Scala.
  • Understanding of ML workflows including data pipelines, model training, and deployment.
Perks:
  • Comprehensive medical insurance coverage for you and your family.
  • Unlimited paid time off.
  • 401(k) plan with matching contributions.
  • 12 weeks of paid parental leave.
  • Employee Stock Purchase Plan.

Job Details

Upwork ($UPWK) is the world’s work marketplace. We serve everyone from one-person startups to large, Fortune 100 enterprises with a powerful, trust-driven platform that enables companies and talent to work together in new ways that unlock their potential.

Last year, more than $3.8 billion of work was done through Upwork by skilled professionals who are gaining more control by finding work they are passionate about and innovating their careers.


The Machine Learning Infrastructure & Data team is responsible for architecting and building the foundational systems and tools that enable efficient development, deployment, and management of machine learning models at scale.

As a Lead Machine Learning Infrastructure Engineer, you will be pivotal in designing, developing, and maintaining robust and scalable infrastructure components to support Upwork’s machine learning initiatives. You will work closely with cross-functional teams—including machine learning researchers, data scientists, and software engineers—to build state-of-the-art platforms and tools that accelerate the development and deployment of machine learning models.

Responsibilities:

  • Design, implement, and optimize distributed systems and infrastructure components to support large-scale machine learning workflows, including data ingestion, feature engineering, model training, and serving.
  • Develop and maintain frameworks, libraries, and tools that streamline the end-to-end machine learning lifecycle, from data preparation and experimentation to model deployment and monitoring.
  • Architect and implement highly available, fault-tolerant, and secure systems that meet the performance and scalability requirements of production machine learning workloads.
  • Collaborate with machine learning researchers and data scientists to understand their requirements and translate them into scalable and efficient software solutions.
  • Stay current with advancements in machine learning infrastructure, distributed computing, and cloud technologies, integrating them into our platform to drive innovation.
  • Mentor junior engineers, conduct code reviews, and uphold engineering best practices to ensure the delivery of high-quality software solutions.

What it takes to catch our eye:

  • Strong technical expertise in designing and building scalable ML infrastructure.
  • Experience with distributed systems and cloud-based ML platforms.
  • Proficiency in programming languages such as Python, Java, or Scala.
  • Deep understanding of ML workflows, including data pipelines, model training, and deployment.
  • Passion for innovation and eagerness to implement the latest advancements in ML infrastructure.
  • Strong problem-solving skills and ability to optimize complex systems for performance and reliability.
  • Collaborative mindset with excellent communication skills to work across teams.
  • Ability to thrive in a fast-paced, dynamic environment with evolving technical challenges.

Come change how the world works.

At Upwork, you’ll shape talent solutions for how the world works today. We are a remote-first organization working together to create exciting remote work opportunities for a global community of professionals. While we have a physical office in Palo Alto, we currently hire full-time employees in 21 states in the United States.

At the core of our vibrant culture are shared values that form the foundation of our organization. These values revolve around trust, risk-taking, customer focus, and excellence. Our overarching mission is to create economic opportunities so that people have better lives. We foster an environment where individuals are encouraged to bring their authentic selves to work, nurturing personal and professional growth through development opportunities, mentorship programs, and participation in Upwork Belonging Communities.

We take pride in providing exceptional benefits to our employees. These include comprehensive medical insurance coverage for both you and your family, unlimited paid time off, a 401(k) plan with matching contributions, 12 weeks of paid parental leave, and an Employee Stock Purchase Plan. To explore these benefits in detail, as well as gain insights into our company values, working principles, and the overall employee experience, we invite you to visit our Life at Upwork page.

Check out our Careers page to learn more about the employee experience.

Upwork is proudly committed to recruiting and retaining a diverse and inclusive workforce. As an Equal Opportunity Employer, we never discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical condition), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Additionally, a criminal background check may be run on a candidate after a conditional offer of employment is made. Qualified applicants with arrest or conviction records will be considered in accordance with applicable law, including the California Fair Chance Act and local Fair Chance ordinances.

The annual base salary range for this position  is displayed below. The range displayed reflects the minimum and maximum salary for this position, and individual base pay will depend on your skills, qualifications, experience, and location. Additionally, this position is eligible for the annual bonus plan or sales incentive plan and eligibility to participate in our long term equity incentive program.

Annual Base Compensation

$185,500 - $293,750 USD

To learn more about how Upwork processes and protects your personal information as part of the application process, please review our Global Job Applicant Privacy Notice

Similar Jobs

Canva - Senior Frontend Engineer - Developer Experience

Canva

Auckland, Auckland, New Zealand (Remote)
2 Months ago
Stake logic - Product Owner

Stake logic

Eindhoven, North Brabant, Netherlands (On-Site)
7 Months ago
Wolters Kluwer - Associate Director, Marketing

Wolters Kluwer

London, England, United Kingdom (On-Site)
3 Weeks ago
Plaid  - Experienced Software Engineer - Credit

Plaid

New York, United States (On-Site)
7 Months ago
Salesforce - Associate Manager (Events)

Salesforce

Mexico City, Mexico (On-Site)
2 Months ago
Safe security - Software Development Engineer III - Platform

Safe security

Bengaluru, Karnataka, India (On-Site)
3 Months ago
AppLovin - Backend Infrastructure Engineer, New Grad

AppLovin

Palo Alto, California, United States (On-Site)
3 Months ago
FICO - Site Reliability Engineering-Engineer II

FICO

Pune, Maharashtra, India (On-Site)
2 Months ago
Capgemini - SAP End to End Solution Architect

Capgemini

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Expedia - Technical Solutions Engineer

Expedia

Chicago, Illinois, United States (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

MiQ - Senior Vice President (SVP) of Product

MiQ

New York, New York, United States (Hybrid)
3 Months ago
Capgemini - Agile Coach

Capgemini

Bengaluru, Karnataka, India (On-Site)
3 Months ago
DraftKings - Account Management Operations Associate

DraftKings

Boston, Massachusetts, United States (On-Site)
3 Weeks ago
hogarth - Art Director

hogarth

Chennai, Tamil Nadu, India (On-Site)
2 Weeks ago
endava - Java Design Lead

endava

Buenos Aires, Buenos Aires, Argentina (On-Site)
1 Month ago
Games talent (Staffing and recruiting) - Senior Data Scientist

Games talent (Staffing and recruiting)

(Remote)
3 Months ago
Optiv - Federal Client Director

Optiv

Tampa, Florida, United States (Remote)
3 Months ago
JDA - Sr Project Manager

JDA

Barcelona, Catalonia, Spain (On-Site)
2 Months ago
Yahoo - Principal Product Designer, Partnerships

Yahoo

United States (Remote)
1 Month ago
Motorola solutions - Manager of Sales Operations & Inside Sales

Motorola solutions

San Diego, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Worldwide

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

LeoVegas - DevOps Engineer

LeoVegas

Denver, Colorado, United States (On-Site)
6 Months ago
Remedy Entertainment Plc - Senior/Lead DevOps Engineer

Remedy Entertainment Plc

Helsinki, Uusimaa, Finland (Hybrid)
5 Months ago
bytedance - Site Reliability Engineer - AML

bytedance

San Jose, California, United States (On-Site)
9 Months ago
Google - Senior Software Engineer, Infrastructure, Google Cloud Compute Infrastructure

Google

Seattle, Washington, United States (On-Site)
3 Months ago
Intel  - Sr. Infrastructure Engineer - Windows OS

Intel

Hillsboro, Oregon, United States (On-Site)
2 Months ago
Ziff Davis - Customer Solution Architect

Ziff Davis

United States (Remote)
1 Month ago
Ansys - Lead DevOps Engineer

Ansys

Waterloo, Ontario, Canada (On-Site)
1 Month ago
Hawkeye Innovations - DevOps Tech Lead

Hawkeye Innovations

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
bytedance - Software Engineer Intern (AI Platform)

bytedance

San Jose, California, United States (On-Site)
4 Months ago
binance - Mobile Developer - Cloud Team

binance

Taipei City, Taiwan (Remote)
11 Months ago

Get notifed when new similar jobs are uploaded