Research Intern - Multimodal AI Research

3 Months ago • 1 Years + • Artificial Intelligence • $78,600 PA - $154,560 PA

Job Summary

Job Description

Microsoft's AI Platform team seeks Research Interns for its Multimodal Intelligence (MMI) team. The internship involves cutting-edge research in multimodal AI, focusing on video, image, and document understanding. Responsibilities include collaborating with researchers, presenting findings, and contributing to projects such as video understanding, information retrieval, and key-value extraction. Candidates should possess a PhD background in a relevant field (AI, NLP, CV) and at least one year of hands-on deep learning experience. Familiarity with LLMs/VLMs is a plus. The internship is a 12-week program, with interns paired with mentors and expected to contribute to the team's vibrant research community.
Must have:
  • PhD in relevant field
  • 1+ years deep learning experience
  • NLP/CV/AI background
  • Proficient in Python
  • Collaboration skills
Good to have:
  • LLM/VLM familiarity
  • Publications in top conferences
  • Experience with PyTorch
  • C/C++ proficiency
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

The AI Platform team is on a mission to advance the state of the art in AI and deliver on our company’s vision for how intelligent cloud and intelligent edge will shape the next phase of innovation. The team includes top scientists and researchers from across Microsoft who are creating a center of excellence in speech, computer vision, and natural language.

 

Within the AI Platform, the Multi-modal Intelligence team (MMI) mission is to make fundamental contributions to advancing the state-of-the-art in AI technology related to Video, Image, Document, and other multimodality inputs. “Documents”, for example, stand at the intersection between NLP and Vision research. To fully understand a document, one needs to borrow from both language and visual (Layout) elements of the document. We explore both single and multimodality inputs – and their synergy - to conduct research on forward-looking topics such as Video Understanding, Information Retrieval, Key-Value extraction, few-shot Named Entity Recognition (NER), hierarchical layout analysis, and many others. 

 

We are looking for Research Interns to work on cutting edge research in Multimodal AI. We are particularly interested in Research Interns with background in AI, NLP, and/or CV, including topics like Video/image understanding, document layout analysis, chart understanding, multi-page multi-document question answering, novel ways of leveraging LLMs for document/video/image understanding and solving problems inherent to large language models (grounding, retrieval-based generation, etc.). Familiarity with modern LLMs/VLMs is a plus, but not required.  

 

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program in Computer Vision, Natural Language Processing, Deep Learning, Machine Learning, AI, or a related field.
  • At least 1 year of experience in NLP, computer vision, Deep learning, or multimodal research with hands-on deep learning experience.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Proficient algorithmic problem solving and software development skills (Python, C/C++, etc.).
  • Experience with open-source tools such as PyTorch, etc.
  • Publication(s) in top-tier conferences or journals in related fields (e.g., ACL, CVPR, ECCV, ICCV, EMNLP, NAACL, NIPS, ICML, ICLR, IJCV, PAMI, etc.). 

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

OUTFIT7 - Game Developer (C++)

OUTFIT7

Ljubljana, Ljubljana, Slovenia (On-Site)
7 Months ago
ByteDance - Procurement Manager - Professional Services, AMS

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Luxoft - C/C++ Lead Software Developer with ADAS, ASPICE, Korean speaker

Luxoft

Seoul, South Korea (On-Site)
5 Months ago
SLAY - Senior React Native Engineer

SLAY

Berlin, Berlin, Germany (On-Site)
5 Months ago
Google - Software Engineer, Early Career (For Women in Tech Candidates)

Google

State Of Minas Gerais, Brazil (On-Site)
3 Months ago
ByteDance - Research Scientist Graduate (Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
Zoox - Software Engineer - Perception

Zoox

Boston, Massachusetts, United States (Hybrid)
6 Months ago
NVIDIA - Senior Research Engineer, ML Data Pipelines

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
PwC - Conversational AI Developer- Senior Associate

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Hedra - Research Scientist

Hedra

New York, New York, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Behaviour Interactive - Senior/Principal AI Programmer  | Programmeur·euse Senior·e/Principal·e en IA

Behaviour Interactive

Montreal, Quebec, Canada (Hybrid)
7 Months ago
Meta - Production Engineer

Meta

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Months ago
DICE - Release Software Engineer

DICE

Stockholm, Stockholm County, Sweden (On-Site)
2 Months ago
Warhorse Studios - DevOps / C# Tools Programmer

Warhorse Studios

Prague, Prague, Czechia (On-Site)
1 Month ago
SmileGate - [Next Crossfire] UE5 엔진 클라이언트 담당

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
3 Months ago
ByteDance - Large Language Model Research Scientist Graduate (Doubao-Seed) - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Zoox - Software Engineering Manager: Operating Systems and Vehicle Configuration

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
Fluence - Sr. Quality Assurance Engineer II

Fluence

Houston, Texas, United States (Hybrid)
6 Months ago
Equivalent Jobs - FPGA Engineer

Equivalent Jobs

(Remote)
2 Months ago
Blizzard Entertainment - Senior Test Analyst, Diablo IV | Austin, TX

Blizzard Entertainment

Irvine, California, United States (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

DraftKings - Senior Desktop Support Specialist

DraftKings

New York, New York, United States (On-Site)
1 Month ago
Activision - Player Support Team Lead

Activision

El Segundo, California, United States (On-Site)
1 Month ago
On Location - Sr. Coordinator - Design, Production & Hospitality

On Location

Raleigh, North Carolina, United States (Hybrid)
4 Months ago
PENN Interactive - Director, Human Resources Project Management and Change Management

PENN Interactive

Philadelphia, Pennsylvania, United States (Hybrid)
2 Months ago
Meta - Software Engineer, Product

Meta

Redmond, Washington, United States (Remote)
5 Months ago
Jam City - Narrative Designer

Jam City

San Francisco, California, United States (Hybrid)
2 Months ago
Inworld AI - Staff Cloud DevOps/Site Reliability Engineer (SRE) - USA

Inworld AI

Mountain View, California, United States (On-Site)
8 Months ago
Hasbro - Associate Fraud Analyst

Hasbro

Renton, Washington, United States (On-Site)
2 Months ago
Life church - Life.Church Central Internship

Life church

Edmond, Oklahoma, United States (On-Site)
6 Months ago
Mashgin - Deployment Engineer - Tennessee

Mashgin

Nashville, Tennessee, United States (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

ANS Commerce - Data Scientist

ANS Commerce

Gurugram, Haryana, India (On-Site)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

Sunnyvale, California, United States (On-Site)
5 Months ago
ClinDCast - GenAI Application Lead

ClinDCast

Austin, Texas, United States (Remote)
9 Months ago
Nagarro - Associate Principal Consultant - Business Analyst

Nagarro

Colombia (Remote)
2 Months ago
Devrev - Engineering Leader - Applied AI Engineering

Devrev

Chennai, Tamil Nadu, India (On-Site)
4 Months ago
Google - Staff Software Engineer, Cloud ML Compute Services

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
Google - Software Engineer, PhD, Early Career, Campus, AI/Machine Learning, 2025 Start

Google

Mountain View, California, United States (On-Site)
5 Months ago
CharacterAI - Research Engineer - Multimodal

CharacterAI

Menlo Park, California, United States (On-Site)
8 Months ago
Microsoft - Research Intern - AI-driven Hardware Design

Microsoft

Vancouver, British Columbia, Canada (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.
View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug