Cambridge Research Internship on Compiler-like Tools for LLM Efficiency

1 Day ago • Upto 1 Years • Research & Development

About the job

Job Description

This internship at Microsoft Research Cambridge focuses on developing compiler-like tools to enhance the efficiency of Large Language Models (LLMs). The intern will research and develop novel tools addressing quantization and parallelization techniques. Responsibilities include designing, implementing, and evaluating these tools, and presenting findings in technical documents and presentations. The ideal candidate will possess substantial experience in static analysis, verification, constraint solving, and working with LLMs, inference, parallelization, and quantization. Proficiency in PyTorch and CUDA is preferred. The internship requires a physical presence in Cambridge, UK during winter/spring 2025.
Must have:
  • Masters/PhD in CS/ML
  • Static analysis, verification, constraint solving experience
  • LLMs, inference, parallelization, quantization experience
Good to have:
  • Strong technical communication
  • PyTorch, CUDA, tooling experience
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

The Future AI Infrastructure team at Microsoft Research Cambridge (UK) is dedicated to developing technologies for AI data centers of the future. We are seeking a masters/PhD student to join us in Cambridge winter/spring 2025 to work on automatic, compiler-like tools for LLM efficiency, covering topics such as quantization and parallelization. You will joining a welcoming and highly collaborative environment and work on creative and challenging problems during your internship. 

Qualifications

 

Required/Minimum Qualifications: 

  • Be enrolled in Masters/PhD program in Computer Science/Machine Learning or related discipline 
  • Substantial experience across static analysis, verification, constraint solving 
  • Substantial experience across LMs, inference, parallelization, quantization 

Other Requirements: 

  • Interns are expected to be physically located in Cambridge, UK 

Preferred/Additional Qualifications: 

  • Strong technical/scientific communication 
  • PyTorch, CUDA, tooling 

 

Responsibilities

  • Research and develop novel compiler-like tools for LLM efficiency 
  • Design, implement and evaluate such tools 
  • Write and present your findings in technical documents or presentations 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Noida, Uttar Pradesh, India (On-Site)

Paris, Île-de-France, France (On-Site)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Noida, Uttar Pradesh, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

Jobs in Cambridge, England, United Kingdom

Alpha Sense - Account Director, Corporate

Alpha Sense, United Kingdom (On-Site)

Silent Games - Programmer - [3-6 Month Contract] UK ONLY

Silent Games, United Kingdom (On-Site)

PlayStation Global - Research Assistants - Contract Roles

PlayStation Global, United Kingdom (On-Site)

Activision - Expert Animator

Activision, United Kingdom (Hybrid)

Alphasense - Account Executive, Financial Services

Alphasense, United Kingdom (On-Site)

Scopely - Senior 2D VFX Artist

Scopely, United Kingdom (Remote)

Playground Games - Pipeline Technical Artist - Contract

Playground Games, United Kingdom (Hybrid)

Axinous - Senior Sales Engineer - Major Accounts UK

Axinous, United Kingdom (Remote)

Nexus Studios - Senior Technical Artist - Unity

Nexus Studios, United Kingdom (Hybrid)

PlayStation Global - Technical Researcher

PlayStation Global, United Kingdom (Hybrid)

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Matic Robots - Systems  Engineer (Embedded Linux)

Matic Robots, Canada (On-Site)

CLO Virtual Fashion  Inc  - C++ Developer

CLO Virtual Fashion Inc , India (On-Site)

Microsoft - Research Intern - Cloud Competitive Intelligence

Microsoft, United States (On-Site)

Pattern® - Senior Software Engineer - NodeJS

Pattern®, India (On-Site)

MediaTek - CPU Verification

MediaTek, India (On-Site)

The Walt Disney Company - Disney Research Intern

The Walt Disney Company, Switzerland (On-Site)

Nielsen Holdings - Software Engineer - Java PL/SQL

Nielsen Holdings, India (Hybrid)

Daybreak Game Company LLC - Software Development Engineer (Cardset)

Daybreak Game Company LLC, United States (Remote)

Valeo - Junior Engineer / Engineer - CAD

Valeo, India (On-Site)

Get notifed when new similar jobs are uploaded