Senior Machine Learning Engineer - Machine Learning Infrastructure

3 Months ago • 3 Years + • Devops

Job Summary

Job Description

As a Senior Machine Learning Engineer at Flip.shop, you will design, build, and optimize the infrastructure for machine learning systems. This includes ensuring the efficient deployment, scaling, and monitoring of machine learning models. You will be responsible for developing scalable, production-level systems that support real-time recommendations and drive business growth. The role involves designing and implementing infrastructure for deploying and maintaining machine learning models, optimizing model training and serving, building tools for automation, optimizing performance, collaborating with various teams, and ensuring security and compliance. You will be working with cutting-edge infrastructure that powers personalized shopping experiences for millions of users, contributing directly to scaling our machine learning systems.
Must have:
  • Experience in building scalable systems with 3+ years of experience.
  • Proficiency in one or two programming languages (C/C++, Golang).
  • Solid understanding of GPU hardware architecture.
  • Experience in deep model inference/training, debugging, and tuning.
  • Familiarity with mainstream machine learning frameworks (e.g., TensorFlow, PyTorch, MxNet).
Good to have:
  • Familiarity with MLOps practices.
  • Experience with big data frameworks (e.g., Spark, Hadoop, Flink).
  • Experience in using or designing open-source machine learning lifecycle management systems like TFX.

Job Details

Senior Machine Learning Engineer - Machine Learning Infrastructure
Location: based in NYC or US remote

Welcome to Flip.shop, where innovation meets the social commerce revolution! Fresh off our Series C funding round, we've raised $144 million, propelling our valuation to an impressive $1.05 billion. We’re redefining the shopping experience by giving consumers a voice in a space dominated by tech giants. Join us on this exhilarating journey where your technical skills will play a pivotal role in shaping the future of social commerce!

Why Join Us?
At Flip.shop, you’ll be at the forefront of innovation in social commerce. This isn’t just a job—it’s a chance to build infrastructure that empowers our AI-driven platform to scale and deliver personalized shopping experiences. You will have the opportunity to directly partner, work with and learn from the very best engineers and scientists who joined us from some of the leading big-tech companies! 
If you thrive in a fast-paced, collaborative environment where you can develop high-performance systems, we want to hear from you!

Role Overview:
We are seeking a Senior Machine Learning Engineer - Machine Learning Infrastructure to design, build, and optimize the infrastructure that powers our machine learning systems. You’ll ensure the efficient deployment, scaling, and monitoring of machine learning models, and will help streamline the development lifecycle. This role offers the opportunity to create scalable, production-level systems that support real-time recommendations and drive business growth.

Responsibilities:

    • Infrastructure Development: Design and implement scalable infrastructure for deploying, monitoring, and maintaining machine learning models in production environments. Design and implement machine learning systems for feeds, ads, and search ranking models.
    • Training Infrastructure: Optimize the serving and training infrastructure of machine learning models.
    • Model Training: Enhance the workflow for model training and serving, data pipelines, storage systems, and resource management within multi-tenancy machine learning systems.
    • Tooling & Automation: Build tools to automate workflows for model training, testing, and deployment, ensuring that machine learning models can move quickly from development to production.
    • Performance Optimization: Ensure the infrastructure supports high-performance model inference at scale, with a focus on minimizing latency and maximizing throughput.
    • Collaboration: Work closely with data scientists, machine learning engineers, and DevOps teams to create seamless integration between development and production environments.
    • Monitoring & Maintenance: Build robust monitoring systems to track model performance and infrastructure health, ensuring reliability and uptime of machine learning services.
    • Security & Compliance: Implement best practices in infrastructure security, data privacy, and compliance, particularly when handling sensitive user data.

Requirements:

    • Education: Bachelor's degree or higher in Computer Science or a related field, with 3+ years of experience in building scalable systems.
    • Technical Skills: Proficiency in one or two programming languages (C/C++, Golang) within a Linux environment.
    • Solid understanding of GPU hardware architecture, GPU software stack (CUDA, cuDNN), and experience in GPU performance analysis.
    • Experience in deep model inference/training, debugging, and tuning.
    • ML Workflow Knowledge: Familiarity with mainstream machine learning frameworks (e.g., TensorFlow, PyTorch, MxNet).
    • Familiarity with MLOps practices.
    • Experience with big data frameworks (e.g., Spark, Hadoop, Flink) and resource management and task scheduling for large-scale distributed systems.
    • Open-source: Experience in using or designing open-source machine learning lifecycle management systems like TFX.

Key Skills

    • Excellent logical analysis and problem-solving skills with the ability to abstract and decompose complex business logic.
    • Strong sense of responsibility, good learning ability, communication skills, and self-motivation, with the ability to respond and act quickly.
    • Good working document habits, with timely writing and updating of workflow and technical documentation.
Why You’ll Love Working Here:
At Flip.shop, you’ll have the opportunity to build the backbone of our AI-driven platform, working on cutting-edge infrastructure that powers personalized shopping experiences for millions of users. Your work will directly contribute to scaling our machine learning systems, ensuring they run efficiently in a high-performance production environment. This is your chance to have a lasting impact and help Flip.shop shape the future of social commerce.

Ready to Build the Future?
If you're passionate about building scalable infrastructure and driving innovation in machine learning at scale, join us at Flip.shop! Let’s redefine the future of online shopping together.

Compensation & Benefits:
Base salary and total compensation will vary based on factors including but not limited to location, experience, and performance. Please note the base salary is just one component of the company’s total rewards package for exempt employees. Other rewards may include equity, bonuses, long term incentives, a PTO policy, and other progressive benefits.

Similar Jobs

hogarth - Account Director/ Shopper Lead

hogarth

Tokyo, Japan (On-Site)
2 Months ago
eBay - Staff Program Manager, Design Systems

eBay

Portland, Oregon, United States (Hybrid)
1 Year ago
Ubisoft - Technical Designer

Ubisoft

Taguig, Metro Manila, Philippines (On-Site)
7 Months ago
Illumina - Senior Business Process Analyst

Illumina

Bengaluru, Karnataka, India (On-Site)
1 Month ago
ISG - Consulting Manager in Benchmarking & Sourcing Strategy

ISG

Boulogne-Billancourt, Île-de-France, France (Hybrid)
1 Month ago
Aera Technology - Senior Infrastructure Platform Engineer

Aera Technology

Mountain View, California, United States (Hybrid)
1 Year ago
Regent craft - Senior Software Infrastructure Engineer

Regent craft

North Kingstown, Rhode Island, United States (On-Site)
1 Month ago
London stock Exchange - Senior Engineer, Site Reliability Engineering

London stock Exchange

Colombo, Western Province, Sri Lanka (Hybrid)
1 Month ago
Aspire - Senior Software Architect

Aspire

Bengaluru, Karnataka, India (Hybrid)
2 Years ago
Site Core - Solution Architect

Site Core

Dubai, Dubai, United Arab Emirates (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Sailpoint - Digital Sales Representative

Sailpoint

Austin, Texas, United States (Hybrid)
2 Months ago
easygo - Senior Software Development Engineer - Engagement

easygo

Melbourne, Victoria, Australia (On-Site)
2 Months ago
Nintendo - Product Tester I

Nintendo

Redmond, Washington, United States (On-Site)
11 Months ago
Take-Two Interactive - JD Edwards Business Analyst

Take-Two Interactive

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
IMC - Python Software Engineer

IMC

Sydney, New South Wales, Australia (On-Site)
3 Months ago
bytedance - Research Engineer Graduate (Vision AI Platform)

bytedance

Seattle, Washington, United States (On-Site)
5 Months ago
Rippling - Senior Security Engineer, Offensive Security

Rippling

United States (Remote)
1 Month ago
Valeo - Quality Leader UAP

Valeo

Mondeville, Normandy, France (On-Site)
4 Months ago
HoYoverse - Product Manager, AI-Powered Services

HoYoverse

Singapore, Singapore (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in New York, United States

Aisera - Multimedia Producer

Aisera

Palo Alto, California, United States (On-Site)
3 Months ago
Rackspace Technology - Sales Development Specialist

Rackspace Technology

San Antonio, Texas, United States (Hybrid)
2 Months ago
HappyRobot - Senior Telephony Engineer

HappyRobot

San Francisco, California, United States (Remote)
2 Weeks ago
Apple - Digital Forensic Investigator

Apple

Cupertino, California, United States (On-Site)
2 Months ago
CGS Carrers - Consultant II

CGS Carrers

United States (Remote)
3 Weeks ago
Visa - Staff Site Reliability Engineer

Visa

Ashburn, Virginia, United States (Hybrid)
1 Week ago
Nintendo - Contract - Sr Engineer, Cloud (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
3 Months ago
Coherent corp. - Manufacturing Technician - Crystal Growth Area

Coherent corp.

Mount Olive, New Jersey, United States (On-Site)
1 Month ago
DataVisor - Business Operations Analyst

DataVisor

Mountain View, California, United States (Hybrid)
3 Weeks ago
BioFire - Senior Buyer: Indirect - Professional Services

BioFire

Durham, North Carolina, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Next Level Business Services - Pivotal cloud Architect

Next Level Business Services

Dearborn, Michigan, United States (On-Site)
9 Months ago
Thousand Eyes - Senior Site Reliability Engineer II, Efficiency and Performance

Thousand Eyes

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Simcorp - Software Architect - Deployment and Observability

Simcorp

Manila, Metro Manila, Philippines (On-Site)
1 Month ago
Nium - DevOps Engineer II

Nium

Malta (Hybrid)
2 Months ago
C3 IoT - Solution Engineer

C3 IoT

New York, United States (On-Site)
3 Weeks ago
Zuora - Sr Enterprise Solution Architect-Zuora Billing & CPQ

Zuora

United States (Remote)
2 Months ago
GoTo Group - Senior DevOps Engineer

GoTo Group

Jakarta, Indonesia (On-Site)
4 Months ago
Fireworks AI - Partnerships Solutions Architect, Applied AI

Fireworks AI

Redwood City, California, United States (On-Site)
3 Weeks ago
Adyen - Senior Partner Solutions Engineer

Adyen

Chicago, Illinois, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded