Research Intern - Memory & Orchestration in Large Language Models

1 Hour ago • Upto 1 Years • Research & Development • Artificial Intelligence

About the job

Job Description

Microsoft's Societal Resilience team seeks Research Interns to advance AI capabilities in large language models (LLMs) and multimodal models. The internship focuses on memory and orchestration within these models, involving developing new embedding methods (graph and multimodal), creating advanced RAG systems, and conducting context-specific fine-tuning. Interns will conduct hands-on research, investigate embedding techniques, develop RAG systems, specialize in context-specific fine-tuning, and collaborate with interdisciplinary teams. The 12-week internship includes mentorship, collaboration, presentation of findings, and contributions to the research community. The research aims to build more resilient and adaptive AI systems.
Must have:
  • PhD in relevant STEM field
  • Experience with LLMs and multimodal models
  • Strong Python programming skills
  • Familiarity with AI/ML frameworks
  • Conducting hands-on research
Good to have:
  • Experience with graph and multimodal embeddings
  • Familiarity with generative model architectures
  • Experience building RAG systems
  • Experience publishing research
  • Experience in interdisciplinary teams
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment. 

 

Our Societal Resilience team is seeking Research Interns to join us in pushing the boundaries of AI capabilities with large language models (LLMs) and multimodal models. Our mission is to prepare for the unknown challenges of the future by developing resilient systems and technologies that can support societal and individual resilience during times of crisis. 

 

The research focuses on memory and orchestration within LLMs and multimodal models. We are particularly interested in developing new methods for leveraging and training various types of embeddings, such as graph embeddings and multimodal embeddings, creating advanced retrieval augmented generation (RAG) systems, and conducting specialized context-specific fine-tuning to build more capable and adaptive models. We believe that these capabilities will play a critical role in building resilience by amplifying the ability of individuals and organizations to respond to uncertainty. 

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program in Computer Science, Machine Learning, Artificial Intelligence, or a related STEM field. 

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Experience with large language models and multimodal models, including training and fine-tuning. 
  • Familiarity with embeddings, including graph embeddings and multimodal embeddings.
  • Familiarity with the architecture of generative models, such as variational autoencoders and diffusion models.
  • Experience building and deploying retrieval augmented generation systems. 
  • Experience working in interdisciplinary teams, with a focus on AI research and development. 
  • Strong programming skills in Python and familiarity with AI/ML frameworks such as PyTorch or TensorFlow. 
  • Previous experience publishing academic research in top-tier conferences or journals. 

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Additional Responsibilities

  • Conducting hands-on research into systems for memory and orchestration of LLMs and multimodal models. 
  • Investigating new embedding techniques, including graph embeddings and multimodal embeddings. 
  • Developing advanced retrieval augmented generation systems to enhance LLM capabilities. 
  • Specializing in context-specific fine-tuning for creating adaptable AI systems. 
  • Collaborating with interdisciplinary teams of researchers and engineers on challenging and impactful projects. 
  • Presenting research findings and participating in research discussions. 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$78.6K - $154.6K/yr (Outscal est.)
$116.6K/yr avg.
Redmond, Washington, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-Site)

San José, San José Province, Costa Rica (On-Site)

Prague, Prague, Czechia (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

ByteDance - Machine Learning Engineer - AML Algorithm

ByteDance, United States (On-Site)

Ushur - Data Science Manager

Ushur, India (Hybrid)

Google - SoC UPF Design Engineer, Google Cloud

Google, United States (On-Site)

Google - Software Engineering Intern, PhD, Summer 2025

Google, United States (On-Site)

Valve corporation - Software Engineer for HW

Valve corporation, United States (On-Site)

Samsung Semiconductor - Senior Engineer, AI

Samsung Semiconductor, United States (Hybrid)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Senior Machine Learning Engineer - AML Algorithm

ByteDance, United States (On-Site)

Ubisoft - Research Student - Ubisoft La Forge

Ubisoft, China (On_site)

ByteDance - Architect - AML Engine

ByteDance, United States (On-Site)

Novancy One | Digital Talent Recruitment - Expert data scientists/Researcher in Generative AI Ref. 005529

Novancy One | Digital Talent Recruitment, United States (On-Site)

Eightfold - Lead Engineer- Backend

Eightfold, India (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

CloudHire - Microsoft /Inquoto Sales Specialist

CloudHire, United States (On-Site)

The Walt Disney Company - Sr Product Manager II - Identity Observability

The Walt Disney Company, United States (On-Site)

Samsung Semiconductor - Senior Manager, ASIC Design Enablement

Samsung Semiconductor, United States (On-Site)

Patel greene - Roadway Engineer Intern

Patel greene, United States (On-Site)

Jam City - Lead Narrative Designer

Jam City, United States (Remote)

Activision - Senior Cinematic Designer

Activision, United States (On-Site)

Naughty Dog - IT HELPDESK TECHNICIAN
CONTINGENT

Naughty Dog, United States (On-Site)

Axinous - Account Executive - Risk Management (Avalor)

Axinous, United States (Remote)

Grindr - Head of International

Grindr, United States (Hybrid)

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Hololight - Software Developer (m/f/d)

Hololight, (On-Site)

Ondezx - Python Developer (Research Expert)

Ondezx, India (On-Site)

Marvell - Analog Design Engineer, Senior Staff

Marvell, Italy (On-Site)

Intel Corporation - Component Debug Engineering Manager

Intel Corporation, Malaysia (Hybrid)

Intel Corporation - Senior Logic Design Verification Engineer

Intel Corporation, Malaysia (Hybrid)

Rivos - CPU Physical Design - Full time

Rivos, India (On-Site)

Intel Corporation - NAND Product Engineer

Intel Corporation, China (On-Site)

Get notifed when new similar jobs are uploaded