Cambridge Internship in ML Model Optimization

1 Hour ago • Upto 1 Years • DevOps • Undisclosed

About the job

Job Description

This internship within Microsoft's Azure Hardware Systems & Infrastructure (AHSI) organization, specifically the Strategic Planning and Architecture (SPARC) team, focuses on model compression and optimization for Large Language Models (LLMs). The intern will research and develop quantization flows for LLM inference and training, design, implement, and evaluate the performance of quantized SOTA LLMs, and present findings in technical documents and presentations. The role involves hands-on experience with quantization of LLMs, model compression, low-precision data types, and potentially PyTorch and Python. The internship is located in Cambridge, UK and is open to Masters/PhD students in Computer Science/Machine Learning or related fields.
Must have:
  • Masters/PhD in CS/ML
  • Experience in LLM quantization and model compression
  • Knowledge of low-precision data types
  • Research and development of quantization flow
  • Performance evaluation of quantized LLMs
  • Technical document and presentation writing
Good to have:
  • PyTorch
  • Python
  • SW Tool development experience
  • Excellent communication skills
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

Join our Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems & Infrastructure (AHSI) organization and be a part of the organization behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. We are seeking a masters/PhD student to join us in Cambridge winter/spring 2025 to work on model compression and optimization for LLMs, covering topics such as post training quantization and quantization aware training. You will be joining a welcoming and highly interdisciplinary team and work on creative and challenging problems during your internship. 

Qualifications

Required/Minimum Qualifications: 

  • Be enrolled in Masters/PhD program in Computer Science/Machine Learning or related discipline  
  • Substantial experience quantization of LLMs, model compression  
  • Substantial knowledge in low-precision data type such as floating point, integer formats, block floats 

 

Other Requirements: 

  • Cloud Background Check 

 

Preferred/Additional Qualifications: 

  • PyTorch, Python, Hands-on experience in SW Tool development  
  • Outstanding communication skills 

 

Responsibilities

  • Research and develop quantization flow for LLM inference and training  
  • Design, implement and evaluate performance of quantized SOTA LLMs  
  • Write and present your findings in technical documents or presentations 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (On-Site)

Cambridge, England, United Kingdom (On-Site)

Bengaluru, Karnataka, India (On-Site)

Redmond, Washington, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Texas, United States (On-Site)

Redmond, Washington, United States (On-Site)

Stockholm, Stockholm County, Sweden (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Blizzard Entertainment - Principal Automation Engineer - Unannounced Project

Blizzard Entertainment, United States (Hybrid)

Wipro - Azure AD

Wipro, India (On-Site)

Saviynt - Senior Engineer

Saviynt, India (Hybrid)

Netradyne - Site Reliability Engineer (SRE)

Netradyne, India (On-Site)

Okta - Senior Software Engineer

Okta, India (On-Site)

Microsoft - Software Engineer - Security Focused

Microsoft, Romania (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

CloudBees - Senior Frontend Developer

CloudBees, India (Hybrid)

LeoVegas - Cloud Security Engineer

LeoVegas, Sweden (Hybrid)

Bounteous - Product Manager, B2B

Bounteous, United States (Hybrid)

LeoVegas - Senior System Engineer

LeoVegas, Brazil (On-Site)

Microsoft - Solution Area Specialist - Data & AI

Microsoft, Kuwait (On-Site)

Microsoft - Sr. AI HW Quality Engineer

Microsoft, Taiwan (On-Site)

Adobe - Senior Computer Scientist

Adobe, India (On-Site)

Bentley Systems - Software Quality Analyst I

Bentley Systems, India (Hybrid)

Ajmera Infotech - Sr. Accountant

Ajmera Infotech, India (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Cambridge, England, United Kingdom

Trek - Sales Associate

Trek, United Kingdom (On-Site)

Playground Games - Lead Character Artist

Playground Games, United Kingdom (Hybrid)

Creative Assembly - Brand Development Manager

Creative Assembly, United Kingdom (On-Site)

Rockstar Games - Iconographer

Rockstar Games, United Kingdom (On-Site)

Zones - Delivery Manager

Zones, United Kingdom (Hybrid)

Fabric - Applied Researcher, Cryptography Hardware

Fabric, United Kingdom (Remote)

Rockstar Games - Software Engineer, C#/Java (All Levels)

Rockstar Games, United Kingdom (On-Site)

Assystems - Project Lead Engineer

Assystems, United Kingdom (Hybrid)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Keywords Studios (Player Support) - Solutions Architect

Keywords Studios (Player Support), Canada (Remote)

ION - Lead Python Engineer, New York

ION, United States (Hybrid)

Valvoline Global Operations - Senior IT Release Manager

Valvoline Global Operations, India (On-Site)

Microsoft - Senior System Electrical Engineer

Microsoft, Taiwan (On-Site)

Glean - Site Reliability Engineer (India)

Glean, India (On-Site)

Rockstar Games - Systems Engineer, Automation

Rockstar Games, United Kingdom (On-Site)

DraftKings - Lead Software Engineer

DraftKings, Bulgaria (Hybrid)

Get notifed when new similar jobs are uploaded