Research Intern - Finetuning for Post Training Quantization

3 Months ago • 1 Years + • Research & Development • $78,600 PA - $154,560 PA

Job Summary

Job Description

This Research Internship at Microsoft focuses on finetuning optimizations for post-training quantization of large language models (LLMs). The intern will explore research ideas to achieve lower bit quantization (~2 bits/param and below) using novel encoding and tuning methods. The internship requires hands-on experience with deep learning tools like PyTorch, DeepSpeed, and FSDP. Responsibilities include developing and prototyping original research agendas, collaborating with researchers and product teams, and presenting findings. The internship is 12 weeks long and based in Redmond, Washington. Interns will work alongside experienced researchers, contributing to advancements in LLM inference efficiency.
Must have:
  • PhD in CS or related field
  • 1+ year experience in deep learning systems optimizations
  • Finetuning and quantization expertise
  • Proficiency in PyTorch, DeepSpeed, FSDP
Good to have:
  • Experience with novel encoding and tuning methods
  • Strong research and collaboration skills
Perks:
  • Industry-leading healthcare
  • Educational resources
  • Product and service discounts
  • Savings and investment programs
  • Maternity and paternity leave
  • Generous time off
  • Giving programs
  • Networking opportunities

Job Details

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

Finetuning optimizations for driving extreme datatypes and quantization for LLM inference. This project would explore research ideas pushing towards lower bits (~2 bits/param and below) with novel encoding and tuning methods.

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program in Computer Science or a related STEM field.
  • At least 1 year of experience with systems optimizations for deep learning training and finetuning.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Hands-on expertise with deep learning tools and frameworks, such as Pytorch, DeepSpeed, and FSDP.
  • Demonstrated ability to develop and prototype original research agendas.
  • Ability to collaborate effectively with other researchers and product development teams.
  • Proficient interpersonal skills, cross-group, and cross-culture collaboration.

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

 

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

NVIDIA - Senior Developer Technology Engineer, Public Sector

NVIDIA

Santa Clara, California, United States (Remote)
1 Month ago
ByteDance - Seed - LLM Performance Operation Analyst (Non-safety)

ByteDance

Singapore (On-Site)
4 Months ago
NVIDIA - Distinguished Engineer, AI Resiliency Lead

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
NVIDIA - Solutions Architect

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
ByteDance - Research Scientist in Machine Learning for Science (AML - AI-for-Science) - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - Backend Engineer - Applied Machine Learning Platform

ByteDance

Singapore (On-Site)
5 Months ago
ByteDance - Senior Machine Learning Ops Engineer, ML System - Foundation Model

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Dassault Systèmes - Software Engineer (Geometry)

Dassault Systèmes

Mumbai, Maharashtra, India (Hybrid)
5 Months ago
NVIDIA - Layout Design Engineer

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Fluence - Lead Engineer - Advanced Battery Modules

Fluence

Houston, Texas, United States (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Software Engineer, Metrics and Evaluation - Autonomous Vehicles

NVIDIA

Santa Clara, California, United States (Remote)
1 Month ago
NVIDIA - Senior Product Architect

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
NVIDIA - Senior Software Engineer, VLSI Design Tools

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Microsoft - Senior Researcher: Machine Learning – Microsoft Research AI for Science

Microsoft

Cambridge, England, United Kingdom (On-Site)
3 Months ago
SparkCognition - Data Scientist

SparkCognition

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Zoox - Senior Machine Learning Engineer - Collision Avoidance System

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
Mashgin - Software Engineer, Infrastructure

Mashgin

Palo Alto, California, United States (Hybrid)
6 Months ago
NVIDIA - Software Engineering Intern - Map Tools

NVIDIA

Guangzhou, Guangdong Province, China (On-Site)
3 Months ago
Mashgin - Senior Software Engineer, Computer Vision and Deep Learning

Mashgin

Palo Alto, California, United States (Hybrid)
6 Months ago
ByteDance - GPU/AI Application Platform Engineer Intern (Server Platform)

ByteDance

San Jose, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Paypal - VP, Ad Sales, PayPal Ads

Paypal

New York, New York, United States (Hybrid)
6 Months ago
Flowplay llc - Backend Engineer

Flowplay llc

Seattle, Washington, United States (Hybrid)
2 Months ago
Hitachi - Senior D365 F&O Technical Architect

Hitachi

Irvine, California, United States (Remote)
6 Months ago
The Walt Disney Company - Manager, International Disney+ Subscriber Planning

The Walt Disney Company

Santa Monica, California, United States (On-Site)
4 Months ago
Naughty Dog - IT Helpdesk Technician

Naughty Dog

Los Angeles, California, United States (On-Site)
3 Months ago
Zuru - Sales Analyst

Zuru

Bentonville, Arkansas, United States (On-Site)
6 Months ago
PlayStation Global - QA Senior Specialist

PlayStation Global

Los Angeles, California, United States (On-Site)
2 Months ago
HP - Executive Assistant Supplies Category

HP

Houston, Texas, United States (On-Site)
7 Months ago
Meta - Product Security Engineer

Meta

Washington, District Of Columbia, United States (On-Site)
5 Months ago
Canva - Corporate FP&A Partner

Canva

San Francisco, California, United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

HP - College Intern - Category Management

HP

Singapore, Singapore (On-Site)
7 Months ago
Meta - Research Scientist Intern, Photorealistic Telepresence (PhD)

Meta

Redmond, Washington, United States (On-Site)
5 Months ago
Samsung Semiconductor - Intern, Compiler Engineer

Samsung Semiconductor

San Jose, California, United States (Hybrid)
3 Months ago
Riot Games - Lead Artist - League of Legends, Game Modes

Riot Games

Sydney, New South Wales, Australia (On-Site)
5 Months ago
ByteDance - Research Engineer- Foundation Model AI Platform- Seattle

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - Research Scientist Graduate (High-Performance Computing (Inference Optimization) - Vision AI Platform)

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Zoox - Senior Firmware Engineer

Zoox

Foster City, California, United States (On-Site)
6 Months ago
Meta - Software Engineer, Machine Learning

Meta

Bellevue, Washington, United States (On-Site)
5 Months ago
NVIDIA - HPC Operations Manager – Hardware Engineering

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Luxoft - Regular BSP Developer

Luxoft

Bengaluru, Karnataka, India (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Noida, Uttar Pradesh, India (On-Site)

Redmond, Washington, United States (Hybrid)

Hyderabad, Telangana, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Redmond, Washington, United States (Remote)

Cairo, Cairo Governorate, Egypt (On-Site)

Budapest, Hungary (Hybrid)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug