Research Intern - Multimodal AI Research

5 Days ago • 1 Years + • Artificial Intelligence • $78,600 PA - $154,560 PA

Job Summary

Job Description

Microsoft's AI Platform team seeks Research Interns for its Multimodal Intelligence team. The internship focuses on cutting-edge research in multimodal AI, encompassing video, image, and document understanding. Responsibilities include collaborating with researchers, presenting findings, and contributing to projects like video understanding, information retrieval, and leveraging LLMs for improved document/video/image understanding. The ideal candidate possesses a PhD background in a relevant field (Computer Vision, NLP, etc.), at least one year of hands-on deep learning experience, and proficiency in Python and relevant tools (PyTorch). Publication in top-tier conferences is a plus. The internship is a 12-week program based in Redmond, Washington.
Must have:
  • PhD in relevant field
  • 1+ year deep learning experience
  • Proficiency in Python
  • NLP/CV background
Good to have:
  • Publications in top-tier conferences
  • Experience with PyTorch
  • Familiarity with LLMs/VLMs

Job Details

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

The AI Platform team is on a mission to advance the state of the art in AI and deliver on our company’s vision for how intelligent cloud and intelligent edge will shape the next phase of innovation. The team includes top scientists and researchers from across Microsoft who are creating a center of excellence in speech, computer vision, and natural language.

 

Within the AI Platform, the Multi-modal Intelligence team (MMI) mission is to make fundamental contributions to advancing the state-of-the-art in AI technology related to Video, Image, Document, and other multimodality inputs. “Documents”, for example, stand at the intersection between NLP and Vision research. To fully understand a document, one needs to borrow from both language and visual (Layout) elements of the document. We explore both single and multimodality inputs – and their synergy - to conduct research on forward-looking topics such as Video Understanding, Information Retrieval, Key-Value extraction, few-shot Named Entity Recognition (NER), hierarchical layout analysis, and many others. 

 

We are looking for Research Interns to work on cutting edge research in Multimodal AI. We are particularly interested in Research Interns with background in AI, NLP, and/or CV, including topics like Video/image understanding, document layout analysis, chart understanding, multi-page multi-document question answering, novel ways of leveraging LLMs for document/video/image understanding and solving problems inherent to large language models (grounding, retrieval-based generation, etc.). Familiarity with modern LLMs/VLMs is a plus, but not required.  

 

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program in Computer Vision, Natural Language Processing, Deep Learning, Machine Learning, AI, or a related field.
  • At least 1 year of experience in NLP, computer vision, Deep learning, or multimodal research with hands-on deep learning experience.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Proficient algorithmic problem solving and software development skills (Python, C/C++, etc.).
  • Experience with open-source tools such as PyTorch, etc.
  • Publication(s) in top-tier conferences or journals in related fields (e.g., ACL, CVPR, ECCV, ICCV, EMNLP, NAACL, NIPS, ICML, ICLR, IJCV, PAMI, etc.). 

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Similar Jobs

GameChanger  - Director of Business Development

GameChanger

United States (Remote)
1 Month ago
Scanline VFX - Research Intern (Summer 2025)

Scanline VFX

Los Angeles, California, United States (Hybrid)
5 Months ago
Google - Senior Staff Software Engineer, AI/ML GenAI, Google Ads

Google

New York, New York, United States (On-Site)
3 Days ago
Hawk Eye Innovations - Computer Vision Engineer - Level 2

Hawk Eye Innovations

Budapest, Hungary (Hybrid)
3 Weeks ago
Trackman - Electronics Technician/Mechanic

Trackman

(On-Site)
3 Weeks ago
FTF Studios - FTF Senior Programmer

FTF Studios

(Remote)
1 Year ago
Google - Senior Software Engineer, Machine Learning, Google Cloud AI

Google

Kirkland, Washington, United States (On-Site)
4 Days ago
Genies - Machine Learning Infrastructure Engineer, 3D Model Inference & Deployment

Genies

Los Angeles, California, United States (On-Site)
1 Month ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

(Remote)
1 Month ago
GoMotive - Software Engineer, Machine Learning-Perception

GoMotive

India (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer Intern (AI Platform)

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Razer - Solutions Architect

Razer

Singapore (On-Site)
6 Months ago
Trackman - Sales Representative - Vietnam (South, Central)

Trackman

Ho Chi Minh City, Ho Chi Minh City, Vietnam (Hybrid)
11 Months ago
ByteDance - Research Engineer- Foundation Model AI Platform- Seattle

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Trackman - Sim Operations Logistics Coordinator

Trackman

Phoenix, Arizona, United States (On-Site)
4 Months ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago
Mashgin - Deployment Engineer - Georgia

Mashgin

Atlanta, Georgia, United States (Remote)
6 Months ago
Google - Software Engineer Tech Lead, Photos Reminiscing

Google

Bengaluru, Karnataka, India (On-Site)
4 Days ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Vision Generative AI)

ByteDance

San Jose, California, United States (On-Site)
4 Weeks ago
Trackman - Electronics Mechanic/Technician

Trackman

(On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Flow - Assistant Controller

Flow

New York, New York, United States (On-Site)
5 Months ago
Pattern® - Growth Marketing Manager

Pattern®

Lehi, Utah, United States (Hybrid)
7 Months ago
Google - UX Program Manager, Google Cloud

Google

Seattle, Washington, United States (On-Site)
4 Days ago
ByteDance - Software Development Engineer in Test

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
WebFX - Software Engineer

WebFX

Ann Arbor, Michigan, United States (On-Site)
4 Months ago
Highspot - Principal Frontend Web Engineer

Highspot

Seattle, Washington, United States (Hybrid)
6 Months ago
Google - Senior Staff Software Engineer, Site Reliability Engineering, Google Cloud

Google

Kirkland, Washington, United States (On-Site)
3 Days ago
Google - Senior Product Manager, Ads

Google

Mountain View, California, United States (On-Site)
5 Days ago
Google - Data Center Engineer, Power Systems Modeling

Google

Reno, Nevada, United States (On-Site)
4 Days ago
Tencent - Research Intern (NLP)

Tencent

Palo Alto, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago
NVIDIA - Solutions Architect for NCP

NVIDIA

Dubai, Dubai, United Arab Emirates (On-Site)
5 Days ago
AI Fund - Curriculum Developer

AI Fund

India (Remote)
6 Months ago
Google - Software Engineering Manager II, Data Center Orchestration

Google

Pittsburgh, Pennsylvania, United States (On-Site)
5 Days ago
NVIDIA - Solutions Architect, AI and ML

NVIDIA

Redmond, Washington, United States (On-Site)
2 Weeks ago
ByteDance - Software Development Engineer - Large Language Models, AML

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Terralogic - SOFTWARE ENGINEER – AIML QA

Terralogic

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Google - Software Developer III, Applied AI, Google Cloud

Google

Waterloo, Ontario, Canada (On-Site)
4 Days ago
Krafton  - Applied Research Scientist/Engineer - LLM Game Agent

Krafton

Seoul, South Korea (On-Site)
3 Weeks ago
Inworld AI - Staff C++ Engineer

Inworld AI

Mountain View, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.
View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug