Staff Software Engineer, Machine Learning Infrastructure

4 Months ago • 8 Years + • Artificial Intelligence

Job Summary

Job Description

As a Staff Software Engineer on the Machine Learning Infrastructure team at Thumbtack, you will contribute to the design, implementation, and maintenance of scalable ML systems. Responsibilities include defining and driving the technical vision for Thumbtack's next-generation ML infrastructure, leading cross-functional initiatives, architecting critical ML infrastructure components (model serving and RAG systems), establishing technical standards and best practices, mentoring engineering teams, and partnering with senior leadership to align ML capabilities with business objectives. The role involves working with technologies like Go, Python, and modern ML frameworks (PyTorch, TensorFlow).
Must have:
  • 8+ years of engineering experience in distributed systems
  • 4+ years building ML infrastructure at scale
  • Expertise in Go or Python
  • Strong architectural skills
  • Experience mentoring teams
  • Deep understanding of ML workflows
Good to have:
  • Experience with hundreds of production models
  • Expertise with PyTorch/TensorFlow and MLOps tools
  • Generative AI implementation experience
  • High-performing team building experience
  • Cloud-native architectures expertise (AWS, GCP)
  • Experience in fast-growing tech companies
Perks:
  • Virtual-first working model
  • 20 company holidays
  • WiFi reimbursement
  • Cell phone reimbursement
  • Employee Assistance Program

Job Details

A home is the biggest investment most people make, and yet, it doesn’t come with a manual. That's why we’re building the only app homeowners need to effortlessly manage their homes —  knowing what to do, when to do it, and who to hire. With Thumbtack, millions of people care for what matters most, and pros earn billions of dollars through our platform. And as one of the fastest-growing companies in a $600B+ industry — we must be doing something right. 

We are driven by a common goal and the deep satisfaction that comes from knowing our work supports local economies, helps small businesses grow, and brings homeowners peace of mind. We’re seeking people who continually put our purpose first: advocating for pros and customers, embracing change, and choosing teamwork every day.

At Thumbtack, we're creating a new era of home care. If making an impact and the chance to do good inspires you, join us. Imagine what we’ll build together. 

Thumbtack by the Numbers

  • Available nationwide in every U.S. county
  • Over 85 million projects started on Thumbtack
  • More than 11 million 5-star reviews and counting
  • Pros earn billions on our platform
  • 1000+ employees 
  • $3.2 billion valuation (June, 2021) 

About the Machine Learning Infrastructure Team

At Thumbtack, we're solving complex technical challenges across search, ranking, recommendations, pricing optimization, and spam detection. Our ML Infrastructure team leads the architectural vision and implementation of enterprise-wide machine learning capabilities, enabling teams to effectively experiment with and deploy ML models at scale. We're building next-generation infrastructure that powers Thumbtack's AI-first future. For insights into our engineering challenges, visit our engineering blog.

Challenge 

As a Principal ML Infrastructure Engineer, you'll drive the technical vision and strategic direction of Thumbtack's machine learning platform. You'll architect solutions that democratize ML capabilities across the organization while establishing best practices and technical standards. Working closely with senior leadership, you'll shape our technical roadmap for generative AI adoption, feature platform evolution, and ML operational excellence.

Responsibilities

  • Define and drive the technical vision and architecture for Thumbtack's next-generation ML infrastructure
  •  Lead cross-functional initiatives spanning engineering, data science, and product teams to build scalable, enterprise-grade ML systems
  •  Architect and oversee implementation of critical ML infrastructure components including model serving systems and RAG systems that can scale. 
  •  Establish technical standards and best practices for ML engineering across the organization
  •  Mentor and provide technical leadership to engineering teams on ML infrastructure best practices
  •  Partner with senior leadership to align ML infrastructure capabilities with business objectives

What you’ll need

If you don't think you meet all of the criteria below but still are interested in the job, please apply. Nobody checks every box, and we're looking for someone excited to join the team.

  •  8+ years of engineering experience with significant focus on distributed systems
  •  4+ years of hands-on experience building ML infrastructure or ML platforms at scale
  •  Deep expertise in at least one major programming language; proficiency in our core stack (Go, Python) preferred
  •  Proven track record of technical leadership on complex, cross-functional projects
  •  Strong architectural skills with experience designing scalable, reliable distributed systems
  •  Deep understanding of ML workflows, common frameworks, and operational challenges
  •  Experience mentoring teams and driving engineering excellence
  •  Track record of making strategic technical decisions with organization-wide impact

Bonus points if you have

  •  Experience building AI platforms that support hundreds of models in production
  •  Deep expertise with modern ML frameworks (PyTorch, TensorFlow) and MLOps tools
  •  Experience implementing generative AI capabilities at enterprise scale
  •  Track record of building high-performing technical teams
  •  Expertise with cloud-native architectures and major cloud providers (AWS, GCP)
  •  Experience driving technical strategy at fast-growing technology companies

Thumbtack is a virtual-first company, meaning you can live and work from any one of our approved locations across the United States, Canada or the Philippines.* Learn more about our virtual-first working model here.

#LI-Remote

Benefits & Perks
  • Virtual-first working model coupled with in-person events
  • 20 company-wide holidays including a week-long end-of-year company shutdown
  • Library (optional use collaboration & connection hub) in San Francisco
  • WiFi reimbursements 
  • Cell phone reimbursements (North America) 
  • Employee Assistance Program for mental health and well-being 

Learn More About Us

Thumbtack embraces diversity. We are proud to be an equal opportunity workplace and do not discriminate on the basis of sex, race, color, age, pregnancy, sexual orientation, gender identity or expression, religion, national origin, ancestry, citizenship, marital status, military or veteran status, genetic information, disability status, or any other characteristic protected by federal, provincial, state, or local law. We also will consider for employment qualified applicants with arrest and conviction records, consistent with applicable law. 

Thumbtack is committed to working with and providing reasonable accommodation to individuals with disabilities. If you would like to request a reasonable accommodation for a medical condition or disability during any part of the application process, please contact: recruitingops@thumbtack.com

If you are a California resident, please review information regarding your rights under California privacy laws contained in Thumbtack’s Privacy policy available at https://www.thumbtack.com/privacy/ .

Similar Jobs

Meta - Research Scientist, Computer Vision for Generative AI (PhD)

Meta

New York, New York, United States (On-Site)
4 Months ago
Match Group - Sr. Software Engineer, Generative AI

Match Group

Palo Alto, California, United States (Hybrid)
5 Months ago
ByteDance - Lead Research Scientist, Foundation Model, Speech & Audio

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Trendyol - Data Science Professionals (Dolap)

Trendyol

İstanbul, İstanbul, Türkiye (Hybrid)
3 Months ago
Seedify - AI Product Manager (Web3 Games)

Seedify

Philippines (Remote)
1 Week ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Bellevue, Washington, United States (On-Site)
4 Months ago
Nextbrain - Computer Vision Engineer

Nextbrain

Bengaluru, Karnataka, India (On-Site)
4 Months ago
HP - Machine Learning Engineer

HP

Palo Alto, California, United States (On-Site)
6 Months ago
Virtuos - R&D Machine Learning Engineer

Virtuos

China (On-Site)
1 Week ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Serbia (Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Software Engineer - Distributed Inference

NVIDIA

California, United States (Remote)
2 Weeks ago
ByteDance - Research Scientist, Foundation Model, Music Intelligence

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
PlayStation Global - Staff Machine Learning Engineer, Anomaly Detection

PlayStation Global

London, England, United Kingdom (Hybrid)
4 Months ago
CloudHire - ML Engineer

CloudHire

Telangana, India (Remote)
2 Weeks ago
Egnyte - Machine Learning Engineer - AI

Egnyte

India (Remote)
6 Days ago
Tencent - Game Research & Development Intern, Engine Research

Tencent

Los Angeles, California, United States (On-Site)
9 Months ago
ByteDance - Senior Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - Senior Research Scientist, Foundation Model, Speech Understanding

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Meta - Research Scientist, Computer Vision for Generative AI (PhD)

Meta

New York, New York, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Ontario, Canada

Gamemode One  Inc  - QA Tester

Gamemode One Inc

Halifax, Nova Scotia, Canada (Hybrid)
5 Days ago
Intrepid Studios,  Inc  - Helpdesk Support Technician

Intrepid Studios, Inc

Canada (On-Site)
7 Months ago
NVIDIA - Senior Emulation Power Engineer

NVIDIA

Canada (On-Site)
1 Month ago
Epic Games - Senior Application Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Week ago
Ubisoft - Data Product Manager

Ubisoft

Montreal, Quebec, Canada (On-Site)
6 Days ago
PwC - PwC Private, High Net Worth Tax, Manager

PwC

Toronto, Ontario, Canada (On-Site)
6 Months ago
Rockstar Games - Senior Animation Systems Programmer

Rockstar Games

Oakville, Ontario, Canada (On-Site)
6 Days ago
Maxis Studios - Core Software Engineer

Maxis Studios

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
NVIDIA - Mixed Signal Design Engineer

NVIDIA

Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Engineering Manager, AI Developer Technology

NVIDIA

Westford, Massachusetts, United States (On-Site)
3 Weeks ago
Wargaming - Gen AI Business Development Manager

Wargaming

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Flutter Entertainment - Lead Data Scientist

Flutter Entertainment

Hyderabad, Telangana, India (Hybrid)
4 Months ago
NVIDIA - Senior Field Application Engineer

NVIDIA

Westford, Massachusetts, United States (On-Site)
1 Month ago
Egnyte - Senior Product Manager

Egnyte

Mountain View, California, United States (Remote)
4 Months ago
Pika - Summer Research Internship

Pika

Palo Alto, California, United States (On-Site)
1 Month ago
Scale AI - Software Engineer - New Grad

Scale AI

San Francisco, California, United States (On-Site)
5 Months ago
Sinch - Mid-level Fullstack Developer

Sinch

São Paulo, State Of São Paulo, Brazil (Remote)
5 Days ago
Zoox - Software Engineer - Simulaton Scenario Automation

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded