Senior Machine Learning Research Scientist

1 Month ago • 3 Years + • DevOps

Job Summary

Job Description

Microsoft's Azure Hardware Systems & Infrastructure (AHSI) seeks a Senior Machine Learning Research Scientist to drive model/hardware co-design, analyze novel LLM architectures, and invent efficient model architectures (e.g., sparse LLMs). Responsibilities include developing low-precision data formats, collaborating with data scientists and hardware/software teams, and optimizing CNN/transformer architectures. The ideal candidate possesses a Master's/PhD in a related field, 3+ years of ML systems experience, and expertise in frameworks like PyTorch/TensorFlow/TensorRT.
Must have:
  • Master's/PhD in ML, Computer Architecture, or related field
  • 3+ years ML systems/model optimization experience
  • Experience with PyTorch/TensorFlow/TensorRT
  • Deep knowledge of CNN/transformer architecture and optimization
  • Strong programming skills in Python/C/C++
Good to have:
  • Experience implementing low-level linear algebra kernels
  • Knowledge of GPU/TPU accelerator architecture
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Do you want to be at the forefront of innovating the latest hardware designs to propel Microsoft’s cloud growth? Are you seeking a unique career opportunity that combines both technical capabilities, cross team collaboration, with business insight and strategy?  

 

Join our Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems & Infrastructure (AHSI) organization and be a part of the organization behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission.   

Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure. We deliver the core infrastructure and foundational technologies for Microsoft's cloud businesses including Microsoft Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live.   

 

The SPARC organization manages Azure’s hardware roadmap from architecture concept through production for all of Microsoft’s current and future on-line servicesThis role is for a highly motivated Machine Learning Engineer with a strong background in neural networks and hardware implementation. You will be involved with both model development, data type analysis, ML/HW co-design. 

Qualifications

Master's Degree/PhD in Machine learning, Computer Architecture/Systems, High-Performance Computing or related areas. 

3+ years of experience in ML systems/Model optimizations/Efficient model architecture 

Track record of original research and delivering novel results in ML systems area 

Hands on experience with frameworks such as PyTorch/TensorFlow/TensorRT 

Deep knowledge of CNN/transformer architecture and optimization strategies – quantization, sparsity, NAS, sharding, KV Cache, Flash Attention 

Strong programming skills in Python/C/C++ 

Experience in implementing low-level linear algebra/BLAS kernels and performance optimisations 

Knowledge of GPU, TPU or similar NPU accelerator architecture   

Outstanding communication skills 

 

#SPARCjobs

 

Responsibilities

  • Driving model/hardware codesign 
  • Developing and analysing novel LLM architectures 
  • Inventing novel low-precision data/number formats for training/inference SOTA LLMs  
  • Inventing novel efficient model architectures (e.g., sparse LLMs, attention architecture) 
  • Collaborating with data scientists and ML researchers 
  • Interfacing with HW architecture teams 
  • Interfacing with SW framework teams 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

ByteDance - Machine Learning Engineer - Global Payment - 2025 Start

ByteDance

Singapore (On-Site)
4 Weeks ago
Axinous - Sr. Staff Machine Learning Engineer

Axinous

San Jose, California, United States (Hybrid)
1 Month ago
ByteDance - Machine Learning Research Scientist, AI for Science

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Hitachi - Artificial Intelligence - JBU

Hitachi

Chennai, Tamil Nadu, India (On-Site)
3 Months ago
Playrix - Feature Owner (LiveOps)

Playrix

Portugal (Remote)
3 Months ago
Sinch - DevOps Engineer (Email)

Sinch

Uttar Pradesh, India (Hybrid)
1 Month ago
WorldWinner - Senior DevOps Engineer

WorldWinner

(Remote)
1 Week ago
Luxoft - Senior Infrastructure Engineer

Luxoft

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)
1 Month ago
Next Level Business Services - IIB, DP, ODM Admin

Next Level Business Services

Burbank, California, United States (On-Site)
3 Months ago
Ubisoft - Senior Software Engineer - RUST Backend (W/M/NB)

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Video Analysis and Quality Algorithm Engineer - 2023 Start (MS)

ByteDance

San Diego, California, United States (On-Site)
3 Months ago
G5 Games - 2D UI/UX Artist (Hidden objects project)

G5 Games

Yerevan, Yerevan, Armenia (Remote)
3 Weeks ago
ByteDance - Machine Learning Engineer Intern (Global E-commerce Risk Control) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Sinch - Machine Learning Engineer (LLMs)

Sinch

Flanders, Belgium (Hybrid)
1 Month ago
Playrix - Feature Owner (LiveOps)

Playrix

Ireland (Remote)
3 Months ago
Samsung Semiconductor - Intern, Machine Learning Engineer - VLMs

Samsung Semiconductor

San Jose, California, United States (Hybrid)
2 Weeks ago
NXP - Junior Developer of Systems Testing Infrastructure

NXP

Brno, South Moravian Region, Czechia (On-Site)
4 Months ago
G5 Games - 2D UI/UX Artist (match-3 project)

G5 Games

Yerevan, Yerevan, Armenia (Remote)
3 Months ago
ByteDance - Machine Learning Research Scientist, AI for Science

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Cambridge, England, United Kingdom

ESL FACEIT Group - EFG - Senior Software Engineer - Platform and Game Servers

ESL FACEIT Group - EFG

London, England, United Kingdom (Remote)
4 Months ago
Jumar - QA Tester

Jumar

London, England, United Kingdom (Hybrid)
6 Months ago
Team17 - Senior Brand Manager

Team17

England, United Kingdom (Hybrid)
2 Weeks ago
Fanatics - Assistant Merchandiser

Fanatics

Manchester, England, United Kingdom (Hybrid)
3 Months ago
Build A Rocket Boy - Senior Online Programmer

Build A Rocket Boy

Edinburgh, Scotland, United Kingdom (On-Site)
2 Weeks ago
Climax Studios - Senior Technical Designer

Climax Studios

Liverpool, England, United Kingdom (On-Site)
3 Months ago
Climax Studios - Technical Artist

Climax Studios

London, England, United Kingdom (On-Site)
1 Week ago
DNEG - Department Manager (DNEG VFX) Maternity Cover

DNEG

London, England, United Kingdom (On-Site)
1 Week ago
Build A Rocket Boy - Technical Designer

Build A Rocket Boy

Edinburgh, Scotland, United Kingdom (On-Site)
1 Day ago
BULKHEAD - Senior Gameplay Programmer

BULKHEAD

Derby, England, United Kingdom (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

ByteDance - Tech Lead (SRE) - Cloud Infrastructure

ByteDance

Singapore (On-Site)
2 Months ago
PwC - ETIC, Cloud Infrastructure - Senior Associate

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
2 Months ago
Rackspace Technology - Software Developer III (Python with Linux Automation)

Rackspace Technology

India (Remote)
2 Months ago
GoTo Group - Lead Software Engineer - Engineering Platform

GoTo Group

Gurugram, Haryana, India (On-Site)
2 Months ago
Nagarro - Principal Engineer, QA Automation

Nagarro

India (Remote)
3 Months ago
Interactive Brokers - Senior Platform Engineer - Design

Interactive Brokers

Fort Lauderdale, Florida, United States (Hybrid)
3 Months ago
Saviynt - Principal Engineer – SRE

Saviynt

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Blazesoft - DevOps engineer

Blazesoft

Vaughan, Ontario, Canada (On-Site)
1 Month ago
Litera - Site Reliability Engineer

Litera

Ahmedabad, Gujarat, India (On-Site)
3 Months ago
ESL FACEIT Group - EFG - Site Reliability Engineer - Remote

ESL FACEIT Group - EFG

(Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Milan, Lombardy, Italy (On-Site)

Gurugram, Haryana, India (On-Site)

Prague, Prague, Czechia (On-Site)

Montreal, Quebec, Canada (On-Site)

Dublin, County Dublin, Ireland (On-Site)

London, England, United Kingdom (On-Site)

Virginia, United States (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug