Senior Machine Learning Engineer

1 Hour ago • 4-8 Years • Artificial Intelligence • Research & Development

About the job

Job Description

Microsoft seeks a Senior Machine Learning Engineer to design, train, and tune ML models and their infrastructure. The role involves incubating new ideas with researchers, collaborating with product engineers and partners, and applying sound engineering principles while pushing the boundaries of current conventions. Responsibilities include partnering with researchers to implement novel ML algorithms, interacting with product teams and open-source communities, engineering high-performance ML training jobs, and reducing infrastructure complexities. This includes simplifying onboarding, debugging issues, scaling training jobs, and maintaining high GPU utilization.
Must have:
  • Bachelor's degree in relevant field
  • 4+ years ML engineering experience
  • Experience with Python, PyTorch, TensorFlow
  • Experience building and optimizing ML training jobs
Good to have:
  • Deep understanding of deep learning
  • Experience with CUDA, Triton, DirectML
  • Experience with LLMs
  • Experience with ML infrastructure
  • Experience in academic research
  • CI/CD pipeline experience
  • Open-source contributions
Perks:
  • Industry-leading healthcare
  • Educational resources
  • Product and service discounts
  • Savings and investments
  • Maternity/paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

We are looking for a Senior Machine Learning Engineer with analytical and developer skills to join our team to develop industry leading machine learning solutions. Successful candidates will have several years of experience designing, training and tuning machine learning (ML) models and their supporting infrastructure. A large part of the role will be spent incubating new ideas with researchers and working with product team engineers and/or partners who leverage the research. As a result, candidates should be comfortable learning new approaches and pushing the boundaries of current conventions while applying sound engineering principles.   
 
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.  

 

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Qualifications

Required Qualifications

  • Bachelor's Degree in Computer Science, Engineering, Statistics, Mathematics, or related field AND 4+ years technical engineering experience with coding in languages including, but not limited to, Python, C, C++, C#, or Java
    • OR  equivalent experience.
  • 4+ years of industry or academic experience in applying, implementing, and/or developing algorithms for machine learning, artificial intelligence, or statistics. 
  • 2+ years of industry or academic experience with common ML engineering programming languages and platforms such as Python, Python numerical libraries, PyTorch, TensorFlow and/or HuggingFace. 
  • 2+ years of industry or academic experience with building, debugging and optimizing large, high performance ML training jobs. 

Preferred Qualifications 

  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience. 
  • Deep understanding of deep learning model architectures.
  • Experience working in or with a commercial software product team.
  • Experience with low-level, high performance ML frameworks such as CUDA, Triton, DirectML or similar frameworks  
  • Experience with large language models (LLM) architectures, inference and fine-tuning  
  • Experience with using and supporting ML infrastructure including job schedulers, big data storage, low latency interconnects and large clusters of GPUs 
  • Experience working in an academic research environment and communicating effectively withing such an environment 
  • Ability to engineer industry solutions and achieve/maintain target service level agreements  
  • Ability to work independently and ramp-up quickly on complex, unfamiliar code  
  • Experience with engineering practices, continuous integration and continuous delivery/continuous deployment (CI/CD) pipelines and Git
  • Experience contributing and/or maintaining open-source projects  

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

Microsoft will accept applications for the role until January 13, 2025.

 

 

#Research

Responsibilities

  •  You'll be a member of a cross-disciplinary team of engineers, data scientists and researchers working in a research-oriented environment.   
  • You’ll partner with researchers to quickly learn, implement and evolve novel ML algorithms and approaches.  
  • You’ll interact with Microsoft product teams, Microsoft Research (MSR) partners and open-source communities to bring research implementations to broader audiences.
  • You’ll engineer complex, high performant, efficient ML training jobs using well-known and cutting-edge tools and frameworks.
  • You’ll reduce ML infrastructure complexities across research. projects by simplifying onboarding, debugging infrastructures issues, scaling training jobs while maintaining high Graphics Processing Unit (GPU) utilization, contributing to open-source tooling/frameworks and keeping ML infrastructure documentation simple and up to date.
  • Embody our and .

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$153.6K - $250.2K/yr (Outscal est.)
$201.9K/yr avg.
New York, New York, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Seoul, South Korea (On-Site)

New York, New York, United States (On-Site)

Texas, United States (Hybrid)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

Sydney, New South Wales, Australia (Hybrid)

Bengaluru, Karnataka, India (On-Site)

Hyderabad, Telangana, India (On-Site)

London, England, United Kingdom (On-Site)

Beijing, Beijing, China (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Moon Active - Automation Engineer (Java)

Moon Active, Israel (On-Site)

LeoVegas - Backend Engineer - Payments

LeoVegas, Poland (Hybrid)

Salesforce - Software Engineer, Android - Slack (Senior/Staff)

Salesforce, United States (On-Site)

Next Level Business Services - API Developer

Next Level Business Services, United States (On-Site)

Next Level Business Services - Sr. Cassandra Architect

Next Level Business Services, United States (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Paypal - Staff Engineer, Backend (Java)

Paypal, United States (Hybrid)

Rush Street Interactive - Senior Full Stack Engineer

Rush Street Interactive, Canada (On-Site)

ION - Technical Consultant - Endur

ION, United States (On-Site)

Head Digital Works - Sr Full Stack Developer

Head Digital Works, India (On-Site)

 Sagecor Solutions - Application Developer 2 (NRN - 009)

Sagecor Solutions, United States (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in New York, New York, United States

Zoox - Sensor Software Developer

Zoox, United States (On-Site)

The Walt Disney Company - PM Housekeeping Room Attendant - Part Time

The Walt Disney Company, United States (On-Site)

Tencent - Senior Technical Artist

Tencent, United States (On-Site)

ByteDance - Senior Software Engineer, Edge Cloud Platform

ByteDance, United States (On-Site)

Netflix - Administrative Assistant, Experience Design (XD)

Netflix, United States (On-Site)

Next Level Business Services - Mobile Architect(Native Mobile Apps)

Next Level Business Services, United States (On-Site)

Blizzard Entertainment - MBA Summer Internship – General Management

Blizzard Entertainment, United States (On-Site)

Unity - IT Operations Specialist

Unity, United States (On-Site)

ION - Senior Technical Consultant - Endur

ION, United States (On-Site)

ION - Technical Consultant - Endur

ION, United States (On-Site)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

FTF Studios - FTF Senior Programmer

FTF Studios, (Remote)

Brillio - AI GTM Leader - R01541139

Brillio, United States (Hybrid)

CharacterAI - Software Engineer, Core Product

CharacterAI, United States (On-Site)

Microsoft - Senior Research Software Engineer

Microsoft, United States (On-Site)

Google - Technical Writer, Artificial Intelligence

Google, United States (On-Site)

Interface AI - SDE III - Backend

Interface AI, India (Remote)

Microsoft - Senior Applied Scientist

Microsoft, China (On-Site)

Get notifed when new similar jobs are uploaded