Research Intern - Multimodal AI Research

4 Weeks ago • 1 Years + • Artificial Intelligence • $78,600 PA - $154,560 PA

Job Summary

Job Description

Microsoft's AI Platform team seeks Research Interns for its Multimodal Intelligence (MMI) team. The internship involves cutting-edge research in multimodal AI, focusing on video, image, and document understanding. Responsibilities include collaborating with researchers, presenting findings, and contributing to projects such as video understanding, information retrieval, and key-value extraction. Candidates should possess a PhD background in a relevant field (AI, NLP, CV) and at least one year of hands-on deep learning experience. Familiarity with LLMs/VLMs is a plus. The internship is a 12-week program, with interns paired with mentors and expected to contribute to the team's vibrant research community.
Must have:
  • PhD in relevant field
  • 1+ years deep learning experience
  • NLP/CV/AI background
  • Proficient in Python
  • Collaboration skills
Good to have:
  • LLM/VLM familiarity
  • Publications in top conferences
  • Experience with PyTorch
  • C/C++ proficiency
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

The AI Platform team is on a mission to advance the state of the art in AI and deliver on our company’s vision for how intelligent cloud and intelligent edge will shape the next phase of innovation. The team includes top scientists and researchers from across Microsoft who are creating a center of excellence in speech, computer vision, and natural language.

 

Within the AI Platform, the Multi-modal Intelligence team (MMI) mission is to make fundamental contributions to advancing the state-of-the-art in AI technology related to Video, Image, Document, and other multimodality inputs. “Documents”, for example, stand at the intersection between NLP and Vision research. To fully understand a document, one needs to borrow from both language and visual (Layout) elements of the document. We explore both single and multimodality inputs – and their synergy - to conduct research on forward-looking topics such as Video Understanding, Information Retrieval, Key-Value extraction, few-shot Named Entity Recognition (NER), hierarchical layout analysis, and many others. 

 

We are looking for Research Interns to work on cutting edge research in Multimodal AI. We are particularly interested in Research Interns with background in AI, NLP, and/or CV, including topics like Video/image understanding, document layout analysis, chart understanding, multi-page multi-document question answering, novel ways of leveraging LLMs for document/video/image understanding and solving problems inherent to large language models (grounding, retrieval-based generation, etc.). Familiarity with modern LLMs/VLMs is a plus, but not required.  

 

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program in Computer Vision, Natural Language Processing, Deep Learning, Machine Learning, AI, or a related field.
  • At least 1 year of experience in NLP, computer vision, Deep learning, or multimodal research with hands-on deep learning experience.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Proficient algorithmic problem solving and software development skills (Python, C/C++, etc.).
  • Experience with open-source tools such as PyTorch, etc.
  • Publication(s) in top-tier conferences or journals in related fields (e.g., ACL, CVPR, ECCV, ICCV, EMNLP, NAACL, NIPS, ICML, ICLR, IJCV, PAMI, etc.). 

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

ByteDance - Machine Learning Engineer-Model Training Infrastructure (AML-Engine)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Rockstar Games - C++ Software Engineer, FiveM (Mid/Senior)

Rockstar Games

London, England, United Kingdom (On-Site)
5 Months ago
Riot Games - Principal Software Engineer, Product Tech-Lead - Unpublished R&D Product

Riot Games

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
Inworld AI - Senior Unreal Engine Developer - USA

Inworld AI

Mountain View, California, United States (Remote)
3 Months ago
Escape Velocity Entertainment - Principal Technical Designer | North America | Canada | Europe | Fully Remote

Escape Velocity Entertainment

(Remote)
3 Months ago
Ello - Tech Lead, Machine Learning

Ello

Canada (On-Site)
3 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model, Speech & Audio) - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Zoox - Senior/Staff Machine Learning Engineer - Prediction & Behavior ML

Zoox

Boston, Massachusetts, United States (Hybrid)
4 Months ago
Google - Software Engineer III, Machine Learning, Search

Google

Mountain View, California, United States (On-Site)
3 Months ago
Dolby Laboratories - Senior Foundational AI Researcher

Dolby Laboratories

Bengaluru, Karnataka, India (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Riot Games - Principal Software Engineer - VALORANT Experience Team

Riot Games

Los Angeles, California, United States (On-Site)
3 Months ago
Playrix - Senior C++ Software Engineer (Build System)

Playrix

Portugal (Remote)
3 Months ago
ByteDance - Senior Software Engineer, Global Payment Risk & Compliance

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
PlayStation Global - Senior Audio Programmer - 12 month contract

PlayStation Global

Guildford, England, United Kingdom (On-Site)
2 Months ago
Microsoft - Software Quality Engineer

Microsoft

Bengaluru, Karnataka, India (On-Site)
4 Weeks ago
Microsoft - Software Engineer (Taipei)

Microsoft

Taipei City, Taiwan (On-Site)
1 Month ago
ION - Principal Software Engineer, Italy

ION

Rome, Lazio, Italy (On-Site)
4 Months ago
NVIDIA - Senior Solutions Architect, CSP System

NVIDIA

Beijing, Beijing, China (On-Site)
1 Month ago
Mashgin - Senior Software Engineer, Infrastructure

Mashgin

Palo Alto, California, United States (Hybrid)
4 Months ago
ByteDance - Video Codec Algorithm Intern (Multimedia Streaming)

ByteDance

San Diego, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Nagarro - Staff Engineer, ETL

Nagarro

California, United States (On-Site)
4 Months ago
The Walt Disney Company - Engineering Services Manager (Attractions)

The Walt Disney Company

Lake Buena Vista, Florida, United States (On-Site)
2 Weeks ago
Netflix - Product Manager, CMP (Ads)

Netflix

New York, New York, United States (On-Site)
3 Weeks ago
WebMD - Implementation Manager

WebMD

Newark, New Jersey, United States (On-Site)
3 Months ago
On Location - Manager, Corporate Marketing – FIFA World Cup 26™

On Location

New York, New York, United States (On-Site)
1 Month ago
Paypal - Senior Staff Software Engineer, Mobile

Paypal

San Jose, California, United States (Hybrid)
3 Months ago
My Fitness Pal - Backend Software Engineer III

My Fitness Pal

United States (Remote)
1 Month ago
Zoox - Senior/Staff Software Engineer, ML Performance Optimization

Zoox

Seattle, Washington, United States (On-Site)
3 Months ago
Zones - Services Solutions Architect, Cloud

Zones

United States (Remote)
2 Months ago
Rockstar Games - Director of Human Resources

Rockstar Games

Carlsbad, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Epic Games - Développeur sénior, Apprentissage automatique

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Month ago
Salesforce - Lead Applied Research Scientist - Responsible AI

Salesforce

San Francisco, California, United States (On-Site)
3 Months ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Menlo Park, California, United States (On-Site)
3 Months ago
CharacterAI - Research Engineer - Multimodal

CharacterAI

Menlo Park, California, United States (On-Site)
6 Months ago
Tencent - Senior Researcher: Artificial General Intelligence (Natural Language Processing)

Tencent

Bellevue, Washington, United States (On-Site)
6 Months ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Bellevue, Washington, United States (On-Site)
3 Months ago
Mistplay - Senior Data Scientist II

Mistplay

Montreal, Quebec, Canada (Hybrid)
2 Weeks ago
Interface AI - Vice President of Engineering

Interface AI

United States (Remote)
6 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

New York, New York, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

London, England, United Kingdom (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Mountain View, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug