Staff Software Engineer, Machine Learning Infrastructure

4 Months ago • 8 Years + • Artificial Intelligence • $202,300 PA - $308,000 PA

Job Summary

Job Description

Thumbtack's Machine Learning Infrastructure team is seeking a Staff Software Engineer to build and improve their next-generation ML platform. Responsibilities include defining the technical vision and architecture, leading cross-functional initiatives, architecting critical ML infrastructure components (model serving, RAG systems), establishing technical standards, mentoring engineering teams, and partnering with leadership to align with business objectives. The ideal candidate possesses 8+ years of engineering experience with a focus on distributed systems, hands-on experience building ML infrastructure at scale, and expertise in at least one major programming language (Go/Python preferred).
Must have:
  • 8+ years engineering experience (distributed systems focus)
  • 4+ years building ML infrastructure at scale
  • Expertise in Go/Python
  • Strong architectural skills
  • ML workflow & operational challenge understanding
  • Mentoring and leadership experience
Good to have:
  • Experience with hundreds of production models
  • Expertise with PyTorch/TensorFlow and MLOps tools
  • Generative AI implementation experience
  • High-performing team building
  • Cloud-native architectures & AWS/GCP experience
  • Strategic technical decision-making
Perks:
  • Virtual-first working model
  • 20 company holidays
  • WiFi and cell phone reimbursements
  • Employee Assistance Program

Job Details

A home is the biggest investment most people make, and yet, it doesn’t come with a manual. That's why we’re building the only app homeowners need to effortlessly manage their homes —  knowing what to do, when to do it, and who to hire. With Thumbtack, millions of people care for what matters most, and pros earn billions of dollars through our platform. And as one of the fastest-growing companies in a $600B+ industry — we must be doing something right. 

We are driven by a common goal and the deep satisfaction that comes from knowing our work supports local economies, helps small businesses grow, and brings homeowners peace of mind. We’re seeking people who continually put our purpose first: advocating for pros and customers, embracing change, and choosing teamwork every day.

At Thumbtack, we're creating a new era of home care. If making an impact and the chance to do good inspires you, join us. Imagine what we’ll build together. 

Thumbtack by the Numbers

  • Available nationwide in every U.S. county
  • Over 85 million projects started on Thumbtack
  • More than 11 million 5-star reviews and counting
  • Pros earn billions on our platform
  • 1000+ employees 
  • $3.2 billion valuation (June, 2021) 

About the Machine Learning Infrastructure Team

At Thumbtack, we're solving complex technical challenges across search, ranking, recommendations, pricing optimization, and spam detection. Our ML Infrastructure team leads the architectural vision and implementation of enterprise-wide machine learning capabilities, enabling teams to effectively experiment with and deploy ML models at scale. We're building next-generation infrastructure that powers Thumbtack's AI-first future. For insights into our engineering challenges, visit our engineering blog.

Challenge 

As a Principal ML Infrastructure Engineer, you'll drive the technical vision and strategic direction of Thumbtack's machine learning platform. You'll architect solutions that democratize ML capabilities across the organization while establishing best practices and technical standards. Working closely with senior leadership, you'll shape our technical roadmap for generative AI adoption, feature platform evolution, and ML operational excellence.

Responsibilities

  • Define and drive the technical vision and architecture for Thumbtack's next-generation ML infrastructure
  •  Lead cross-functional initiatives spanning engineering, data science, and product teams to build scalable, enterprise-grade ML systems
  •  Architect and oversee implementation of critical ML infrastructure components including model serving systems and RAG systems that can scale. 
  •  Establish technical standards and best practices for ML engineering across the organization
  •  Mentor and provide technical leadership to engineering teams on ML infrastructure best practices
  •  Partner with senior leadership to align ML infrastructure capabilities with business objectives

What you’ll need

If you don't think you meet all of the criteria below but still are interested in the job, please apply. Nobody checks every box, and we're looking for someone excited to join the team.

  •  8+ years of engineering experience with significant focus on distributed systems
  •  4+ years of hands-on experience building ML infrastructure or ML platforms at scale
  •  Deep expertise in at least one major programming language; proficiency in our core stack (Go, Python) preferred
  •  Proven track record of technical leadership on complex, cross-functional projects
  •  Strong architectural skills with experience designing scalable, reliable distributed systems
  •  Deep understanding of ML workflows, common frameworks, and operational challenges
  •  Experience mentoring teams and driving engineering excellence
  •  Track record of making strategic technical decisions with organization-wide impact

Bonus points if you have

  •  Experience building AI platforms that support hundreds of models in production
  •  Deep expertise with modern ML frameworks (PyTorch, TensorFlow) and MLOps tools
  •  Experience implementing generative AI capabilities at enterprise scale
  •  Track record of building high-performing technical teams
  •  Expertise with cloud-native architectures and major cloud providers (AWS, GCP)
  •  Experience driving technical strategy at fast-growing technology companies

Thumbtack is a virtual-first company, meaning you can live and work from any one of our approved locations across the United States, Canada or the Philippines.* Learn more about our virtual-first working model here.

For candidates living in San Francisco / Bay Area, New York City, or Seattle metros, the expected salary range for the role is currently $238,000 - $308,000. Actual offered salaries will vary and will be based on various factors, such as calibrated job level, qualifications, skills, competencies, and proficiency for the role.

For candidates living in all other US locations, the expected salary range for this role is currently $202,300 - $261,800. Actual offered salaries will vary and will be based on various factors, such as calibrated job level, qualifications, skills, competencies, and proficiency for the role.

#LI-Remote

Benefits & Perks
  • Virtual-first working model coupled with in-person events
  • 20 company-wide holidays including a week-long end-of-year company shutdown
  • Library (optional use collaboration & connection hub) in San Francisco
  • WiFi reimbursements 
  • Cell phone reimbursements (North America) 
  • Employee Assistance Program for mental health and well-being 

Learn More About Us

Thumbtack embraces diversity. We are proud to be an equal opportunity workplace and do not discriminate on the basis of sex, race, color, age, pregnancy, sexual orientation, gender identity or expression, religion, national origin, ancestry, citizenship, marital status, military or veteran status, genetic information, disability status, or any other characteristic protected by federal, provincial, state, or local law. We also will consider for employment qualified applicants with arrest and conviction records, consistent with applicable law. 

Thumbtack is committed to working with and providing reasonable accommodation to individuals with disabilities. If you would like to request a reasonable accommodation for a medical condition or disability during any part of the application process, please contact: recruitingops@thumbtack.com

If you are a California resident, please review information regarding your rights under California privacy laws contained in Thumbtack’s Privacy policy available at https://www.thumbtack.com/privacy/ .

Similar Jobs

Meta - Research Intern, Computer Vision for Egocentric Representation Learning (PhD)

Meta

Redmond, Washington, United States (On-Site)
4 Months ago
Match Group - Staff Software Engineer, Machine Learning

Match Group

Palo Alto, California, United States (Hybrid)
5 Months ago
Tencent - Game Research & Development Intern, Engine Research

Tencent

Irvine, California, United States (On-Site)
1 Month ago
Canva - Senior Applied Scientist - AI Research

Canva

Surry Hills, New South Wales, Australia (Remote)
2 Weeks ago
ByteDance - Research Scientist - Multimodal Foundation Model - 2025 Start

ByteDance

Singapore (On-Site)
4 Months ago
Meta - Software Engineer, Machine Learning

Meta

Burlingame, California, United States (On-Site)
4 Months ago
ByteDance - Researcher - Large Language Models, Applied Machine Learning

ByteDance

Seattle, Washington, United States (On-Site)
1 Week ago
PlayStation Global - Senior Director, AI Governance

PlayStation Global

Aliso Viejo, California, United States (Hybrid)
1 Month ago
NVIDIA - Solutions Architect, Generative AI

NVIDIA

Canada (On-Site)
1 Month ago
Krafton  - Strategy Manager (AI Ethics)

Krafton

Seoul, South Korea (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Seedify - AI Product Manager

Seedify

Philippines (Remote)
2 Months ago
DraftKings - Director of Data Science

DraftKings

Boston, Massachusetts, United States (On-Site)
2 Weeks ago
ByteDance - Applied Scientist Intern (Computational Modeling & Optimization)

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Ubisoft - Lead R&D Scientist

Ubisoft

Shanghai, Shanghai, China (On-Site)
6 Days ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
6 Days ago
CloudHire - ML Engineer

CloudHire

Telangana, India (Remote)
2 Weeks ago
NVIDIA - Senior Technical Marketing Engineer - AI Infrastructure

NVIDIA

Canada (On-Site)
1 Month ago
N-iX - Senior Data Scientist (#2665)

N-iX

Ukraine (Remote)
3 Months ago
ByteDance - Research Scientist, Code Generation

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Twitch - Sr. Applied Scientist

Twitch

San Francisco, California, United States (On-Site)
5 Days ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Cyara - Account Executives (East)

Cyara

United States (Remote)
4 Months ago
Life church - Senior Program Manager

Life church

Edmond, Oklahoma, United States (On-Site)
5 Months ago
Canva - Business Development Representative

Canva

Austin, Texas, United States (Remote)
1 Month ago
Epic Games - Senior DevOps Programmer

Epic Games

Cary, North Carolina, United States (On-Site)
1 Month ago
Epic Games - Senior Technical Product Manager, UE Rendering

Epic Games

Cary, North Carolina, United States (On-Site)
1 Week ago
The Walt Disney Company - Lead Software Engineer

The Walt Disney Company

New York, New York, United States (On-Site)
1 Week ago
Nintendo - Manager, Communications Strategy - PDR

Nintendo

Redmond, Washington, United States (Hybrid)
8 Months ago
Next Level Business Services - Web SDLC

Next Level Business Services

Redmond, Washington, United States (On-Site)
5 Months ago
PlayStation Global - Creator Platform Planning Manager

PlayStation Global

United States (Hybrid)
6 Days ago
Gupta Media - Media Coordinator

Gupta Media

Boston, Massachusetts, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

AI Fund - Curriculum Product Manager

AI Fund

United States (Remote)
5 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - Software Development Engineer - Large Language Models, AML

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
DNEG - Head of Machine Learning

DNEG

London, England, United Kingdom (Remote)
1 Week ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Pune, Maharashtra, India (Hybrid)
1 Month ago
Inkittt - Senior Machine Learning Engineer, Recommendations

Inkittt

San Francisco, California, United States (Hybrid)
2 Months ago
Codeninja - Graduate Trainee - AI/ML

Codeninja

Punjab, Pakistan (On-Site)
5 Days ago
Zoox - Director of Perception

Zoox

Foster City, California, United States (Hybrid)
5 Months ago
NVIDIA - Global Developer Relations Account Manager – Ansys

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded