Research Intern - Finetuning for Post Training Quantization

1 Month ago • 1 Years + • Research & Development • $78,600 PA - $154,560 PA

Job Summary

Job Description

This Research Internship at Microsoft focuses on finetuning optimizations for post-training quantization of large language models (LLMs). The intern will explore research ideas to achieve lower bit quantization (~2 bits/param and below) using novel encoding and tuning methods. The internship requires hands-on experience with deep learning tools like PyTorch, DeepSpeed, and FSDP. Responsibilities include developing and prototyping original research agendas, collaborating with researchers and product teams, and presenting findings. The internship is 12 weeks long and based in Redmond, Washington. Interns will work alongside experienced researchers, contributing to advancements in LLM inference efficiency.
Must have:
  • PhD in CS or related field
  • 1+ year experience in deep learning systems optimizations
  • Finetuning and quantization expertise
  • Proficiency in PyTorch, DeepSpeed, FSDP
Good to have:
  • Experience with novel encoding and tuning methods
  • Strong research and collaboration skills
Perks:
  • Industry-leading healthcare
  • Educational resources
  • Product and service discounts
  • Savings and investment programs
  • Maternity and paternity leave
  • Generous time off
  • Giving programs
  • Networking opportunities

Job Details

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

Finetuning optimizations for driving extreme datatypes and quantization for LLM inference. This project would explore research ideas pushing towards lower bits (~2 bits/param and below) with novel encoding and tuning methods.

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program in Computer Science or a related STEM field.
  • At least 1 year of experience with systems optimizations for deep learning training and finetuning.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Hands-on expertise with deep learning tools and frameworks, such as Pytorch, DeepSpeed, and FSDP.
  • Demonstrated ability to develop and prototype original research agendas.
  • Ability to collaborate effectively with other researchers and product development teams.
  • Proficient interpersonal skills, cross-group, and cross-culture collaboration.

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

 

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

ByteDance - Researcher Graduate (Applied Machine Learning - Enterprise) -2025 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Arkose Labs - Senior Machine Learning Researcher

Arkose Labs

Pune, Maharashtra, India (Hybrid)
4 Months ago
ByteDance - High-Performance Computing Research Scientist (Algorithm Acceleration)

ByteDance

Seattle, Washington, United States (On-Site)
5 Days ago
NVIDIA - VLSI Timing Methodology Intern - Summer 2025

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Google - Student Researcher, BS/MS, Winter/Summer 2025

Google

Ann Arbor, Michigan, United States (On-Site)
3 Months ago
Cadence - Sr Principal Product Validation Engineer

Cadence

Noida, Uttar Pradesh, India (On-Site)
4 Months ago
ByteDance - Research Engineer- Foundation Model AI Platform- Seattle

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
NVIDIA - Senior Digital Design Verification Engineer - Hardware

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
1 Month ago
Riot Games - Staff Software Engineer, Generalist - Unreal Ecosystem

Riot Games

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
Pixar Animation Studios - Software Engineer, Platform

Pixar Animation Studios

Emeryville, California, United States (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Recro - Automatic speech Recognition

Recro

Gurugram, Haryana, India (On-Site)
4 Months ago
Microsoft - Applied Scientist II

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
NVIDIA - Director, AI Software

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - Global Developer Relations Account Manager – Ansys

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
Spell Brush - AI Anime Researcher

Spell Brush

Tokyo, Japan (On-Site)
4 Months ago
Adept Global - 3D Geometry Engineer

Adept Global

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Onward Search - API Developer

Onward Search

North Arlington, New Jersey, United States (Remote)
5 Days ago
NVIDIA - Senior Tegra System Performance Architect

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
ByteDance - Senior Software Engineer - Serverless Compute Infrastructure

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Samsung Semiconductor - Staff Engineer, SOC Design

Samsung Semiconductor

Folsom, California, United States (Hybrid)
2 Weeks ago
Passive Logic - Technical Engineering Program Manager

Passive Logic

Salt Lake City, Utah, United States (On-Site)
4 Months ago
The Walt Disney Company - Senior Content Distribution Engineer

The Walt Disney Company

New York, New York, United States (On-Site)
1 Month ago
Twitch - Senior Data Scientist - ML

Twitch

New York, New York, United States (On-Site)
2 Months ago
Google - Software Engineer III, Infrastructure, Google TV

Google

San Jose, California, United States (On-Site)
3 Months ago
Microsoft - Research Intern - Algorithms Group: Reasoning Abilities of LLMs

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
ByteDance - Backend Software Engineer - Customer Service Platform - Seattle

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Crunchyroll - Director of Product, Curation & Personalization

Crunchyroll

San Francisco, California, United States (On-Site)
1 Week ago
The Walt Disney Company - Senior Portfolio Manager

The Walt Disney Company

Orlando, Florida, United States (On-Site)
2 Weeks ago
Lionsgate Games - Intern, Business & Legal Affairs - Marketing

Lionsgate Games

Santa Monica, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Machine Learning Engineer Intern - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
Samsung Semiconductor - Intern, Color Scientist

Samsung Semiconductor

San Jose, California, United States (On-Site)
1 Month ago
Rivos - Silicon Microarchitecture & Logic Design - Intern

Rivos

Santa Clara, California, United States (On-Site)
4 Months ago
ByteDance - Senior Research Scientist- Foundation Model, Vision and Language

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Rambus - SMTS CAD Engineering

Rambus

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Nielsen Holdings - Staff Machine learning Engineer

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Alstom - Engineering Tools Deployment Manager

Alstom

Bengaluru, Karnataka, India (On-Site)
4 Months ago
NVIDIA - Senior Mixed-Signal Design Verification Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Ubisoft - Architecte de Stockage

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

New York, New York, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

London, England, United Kingdom (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Mountain View, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug