Research Intern - Finetuning for Post Training Quantization

21 Minutes ago • 1 Years + • Research & Development • $78,600 PA - $154,560 PA

About the job

Job Description

This Research Internship at Microsoft focuses on finetuning optimizations for post-training quantization of large language models (LLMs). The intern will explore research ideas to achieve lower bit quantization (~2 bits/param and below) using novel encoding and tuning methods. The internship requires hands-on experience with deep learning tools like PyTorch, DeepSpeed, and FSDP. Responsibilities include developing and prototyping original research agendas, collaborating with researchers and product teams, and presenting findings. The internship is 12 weeks long and based in Redmond, Washington. Interns will work alongside experienced researchers, contributing to advancements in LLM inference efficiency.
Must have:
  • PhD in CS or related field
  • 1+ year experience in deep learning systems optimizations
  • Finetuning and quantization expertise
  • Proficiency in PyTorch, DeepSpeed, FSDP
Good to have:
  • Experience with novel encoding and tuning methods
  • Strong research and collaboration skills
Perks:
  • Industry-leading healthcare
  • Educational resources
  • Product and service discounts
  • Savings and investment programs
  • Maternity and paternity leave
  • Generous time off
  • Giving programs
  • Networking opportunities

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

Finetuning optimizations for driving extreme datatypes and quantization for LLM inference. This project would explore research ideas pushing towards lower bits (~2 bits/param and below) with novel encoding and tuning methods.

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program in Computer Science or a related STEM field.
  • At least 1 year of experience with systems optimizations for deep learning training and finetuning.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Hands-on expertise with deep learning tools and frameworks, such as Pytorch, DeepSpeed, and FSDP.
  • Demonstrated ability to develop and prototype original research agendas.
  • Ability to collaborate effectively with other researchers and product development teams.
  • Proficient interpersonal skills, cross-group, and cross-culture collaboration.

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

 

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$78.6K - $154.6K/yr (Outscal est.)
$116.6K/yr avg.
Redmond, Washington, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Barcelona, Catalonia, Spain (On-Site)

Atlanta, Georgia, United States (Hybrid)

Redmond, Washington, United States (On-Site)

Reston, Virginia, United States (On-Site)

Charlotte, North Carolina, United States (On-Site)

New York, New York, United States (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (Remote)

Redmond, Washington, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Tencent - Senior Staff Researcher

Tencent, United States (On-Site)

Samsung Semiconductor - Intern, Compiler Engineer

Samsung Semiconductor, United States (Hybrid)

ByteDance - Senior Site Reliability Engineer, ML System

ByteDance, United States (On-Site)

GoDaddy - Senior Software Engineer

GoDaddy, India (On-Site)

Meta - Design Verification Engineer

Meta, United States (On-Site)

Ceragon Networks - Verification Team Lead

Ceragon Networks, India (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer in ML Engineering Platform

ByteDance, United States (On-Site)

Rackspace Technology - Principal MLOPs Engineer

Rackspace Technology, United States (Remote)

Enterprise Bot - Data Scientist

Enterprise Bot, India (On-Site)

ByteDance - Research Scientist, Reinforcement Learning

ByteDance, United States (On-Site)

ByteDance - Research Scientist, Multimodality

ByteDance, United States (On-Site)

ByteDance - Research Scientist, Reinforcement Learning

ByteDance, United States (On-Site)

Krafton  - [AI] AI Engineer - NLP/Chatbot (3년 이상)

Krafton , South Korea (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Joyride Games - UI/UX Designer

Joyride Games, United States (Remote)

Fluence - Internal Audit Manager

Fluence, United States (On-Site)

Broadcast Music,  Inc  (BMI) - Sr. Director/Director, Creative

Broadcast Music, Inc (BMI), United States (Hybrid)

Ziff Davis - Creative Strategy Lead

Ziff Davis, United States (Hybrid)

Critical mass - Creative Director, Design

Critical mass, United States (On-Site)

Barbaricum - Test Engineer

Barbaricum, United States (Hybrid)

Epoch Games - Unreal Engine Level Designer

Epoch Games, United States (Remote)

Paypal - Senior Product Manager - Growth

Paypal, United States (Hybrid)

Onward Search - Compliance Analyst V

Onward Search, United States (On-Site)

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Get notifed when new similar jobs are uploaded