Multimodal Researcher: PhD Internship Opportunities - Brazil, Multiple Locations

1 Week ago • Upto 1 Years

About the job

Job Description

Microsoft's Multimodal Intelligence team seeks PhD students in Brazil for a remote Spring internship. The internship involves working on real-world projects with global teams, focusing on advancements in video processing, video anomaly detection, document understanding, and multimodal model enhancement (SMM/LMM). Responsibilities include analyzing model performance on large datasets, implementing scalable systems, and collaborating on development from prototype to production. The team pushes the boundaries of multimodality, offering interns a chance to contribute to cutting-edge technology and collaborate on projects related to Retrieval Augmented Generation (RAG) and orchestration between multiple SMMs/LMMs and other tools.
Must have:
  • PhD in CS, Statistics, or related field
  • At least one semester/quarter of studies remaining
  • Analyze multimodal model performance
  • Implement scalable multimodal systems
  • Collaborate with team members
Perks:
  • Industry-leading healthcare
  • Educational resources
  • Product and service discounts
  • Savings and investments
  • Maternity and paternity leave
  • Generous time off
  • Giving programs
  • Networking opportunities

Overview

This opportunity is open to PhD students in Brazil. The internship will take place from in the Spring and is a remote position. 

 

Come build community, explore your passions and do your best work at Microsoft with thousands of university graduates from every corner of the world. This opportunity will allow you to bring your aspirations, talent, potential—and excitement for the journey ahead.   

 

The Multimodal Intelligence team is at the forefront of developing state-of-the-art solutions to novel problems in the field of computer vision, machine learning, document, and natural language processing. Our team of experienced researchers and engineers collaborate to push the boundaries of what's possible with multimodality. We are looking for motivated and highly skilled research interns to join our dynamic team.  Some of our current projects include developing new frameworks for creating video processing, video anomaly detection, document understanding, enhancing the capabilities of Small Multimodal Models (SMM) and Large Multimodal Models (LMM) through post-training, alignment, Retrieval Augmented Generation (RAG), and orchestration between multiple SMMs/LMMs and other tools. 

 

At Microsoft, Interns work on real-world projects in collaboration with teams across the world, while having fun along the way. You’ll be empowered to build community, explore your passions and achieve your goals. This is your chance to bring your solutions and ideas to life while working on cutting-edge technology.

 
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Qualifications

Required Qualifications  

  • Currently pursuing a PhD degree in computer science, statistics, mathematics, or related technical field.
  • Must have at least one additional quarter/semester of school remaining following the completion of the internship.  

Responsibilities

  • Analyze the performance of large and small multimodal models on large-scale datasets, computer vision, and document understanding applications.  
  • Implement prototypes of scalable systems in multimodal applications. 
  • Collaborate closely with team members on developing systems from prototyping to production level. 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Dublin, County Dublin, Ireland (On-Site)

Beijing, Beijing, China (On-Site)

Taipei City, Taiwan (On-Site)

Redmond, Washington, United States (On-Site)

San José, San José Province, Costa Rica (On-Site)

Vancouver, British Columbia, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Microsoft - Senior UX Designer

Microsoft, United States (Hybrid)

Blizzard Entertainment - Senior Animator, Gameplay - Unannounced Game | Irvine, CA

Blizzard Entertainment, United States (Hybrid)

ZeptoLab - Lead Unity Developer, remote

ZeptoLab, Türkiye (Remote)

Meta - Product Design Engineer, Reality Labs

Meta, United States (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PlayStation Global - Staff Machine Learning Engineer, Anomaly Detection

PlayStation Global, United Kingdom (Hybrid)

Keywords Studios (Player Support) - Senior Technical Designer at The Multiplayer Group (MPG)

Keywords Studios (Player Support), United Kingdom (Remote)

Respawn Entertainment - Senior Combat Designer - Enemies (Star Wars Jedi)

Respawn Entertainment, United States (On-Site)

Teravision Games - Lead UX Designer

Teravision Games, Colombia (Hybrid)

Voodoo - Product Designer - Wizz

Voodoo, France (Hybrid)

Cadence - Design Engineering Manager

Cadence, India (On-Site)

Google - Research Intern, PhD, Summer 2025

Google, United States (On-Site)

Socialpoint - Principal UX/UI Designer

Socialpoint, Spain (Hybrid)

Get notifed when new similar jobs are uploaded