Research Intern - AI Systems and Architecture

3 Months ago • 1 Years + • DevOps • $78,600 PA - $154,560 PA

Job Summary

Job Description

This Research Internship in AI Systems and Architecture at Microsoft's Azure Hardware Systems & Infrastructure (AHSI) focuses on performance modeling of large-scale machine learning systems. Responsibilities include developing and contributing to an in-house performance modeling tool, evaluating performance improvements, conducting bottleneck analysis, building a framework for parallel performance simulations, and developing testing frameworks and testbenches. The intern will also work on integrating the performance model with power & TCO models, developing a cloud-based simulation database, creating data analytics frameworks and visualization tools, and improving software development practices. Collaboration with the larger team on product requirements and feature implementation is also expected. The internship is 12 weeks long and requires a PhD program in Computer Science or a related field, along with at least one year of experience in performance analysis for AI accelerators.
Must have:
  • PhD in Computer Science or related field
  • 1+ years experience with AI accelerator performance analysis
  • Performance modeling tool development
  • Bottleneck analysis and feature enhancement
  • Large-scale parallel performance simulations
Good to have:
  • Collaboration skills
  • Cross-group and cross-culture collaboration
  • Creative problem-solving
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

Join our Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems & Infrastructure (AHSI) organization and be a part of the organization behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure. We deliver the core infrastructure and foundational technologies for Microsoft's cloud businesses including Microsoft Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live. The SPARC organization manages Azure’s hardware roadmap from architecture concept through production for all of Microsoft’s current and future on-line services. 

Qualifications

Required Qualifications

  • Accepted or currently enrolled in a PhD program in Computer Science or a related STEM field.
  • At least 1 year of experience with performance analysis for AI accelerators.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Ability to collaborate effectively with other researchers and product development teams.
  • Proficient interpersonal skills, cross-group, and cross-culture collaboration.
  • Ability to think unconventionally to derive creative and innovative solutions.

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Additional Responsibilities

  • Responsible for developing and contributing to an in-house performance modeling tool for large scale machine learning systems.
  • Responsible for evaluation of ideas for performance improvement along with bottleneck analysis and feature enhancement.
  • Responsible for building framework for running large scale parallel performance simulations using cloud-based compute infrastructure.
  • Developing a testing framework and testbenches for enabling operator level unit tests and end-to-end application tests for the performance model.
  • Integrate performance model with power & TCO model to project application level Perf/W and Perf/$ metrics across workloads.
  • Develop cloud-based performance simulation database for storing large scale data from design-space exploration experiments.
  • Develop data-analytics framework along with debug tools and automation for easier retrieval of performance data based on user queries.
  • Develop and maintain performance dashboards and visualization tools for improving the analysis framework.
  • Formalize and improve general software development practices including codebase maintenance, code review, feature development and software design reviews.
  • Integrating CI/CD pipeline into Azure devops software development process.
  • General troubleshooting and debug processes including common performance bottleneck limiters and developing performance comparison tools.
  • Collaborate with larger team to define product requirements, feature improvements and implementation.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

DEVOTEAM - Distributed Cloud | AWS DevOps Engineer

DEVOTEAM

Lisbon, Lisbon, Portugal (Remote)
6 Months ago
VGW - Staff Site Reliability Engineer

VGW

Perth, Western Australia, Australia (On-Site)
2 Months ago
Evolution - Systems Engineer / SRE

Evolution

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Months ago
SLAY - QA Engineer

SLAY

Berlin, Berlin, Germany (On-Site)
8 Months ago
Arrise Solutions (India)   - Senior ML Engineer

Arrise Solutions (India)

Hyderabad, Telangana, India (On-Site)
7 Months ago
NVIDIA - Senior DevOps Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
NVIDIA - Senior Cloud Test Developer Architect

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Sinch - Data Platform Engineer

Sinch

Stockholm, Stockholm County, Sweden (Hybrid)
6 Months ago
Luxoft - Senior DevOps Engineer

Luxoft

Toronto, Ontario, Canada (On-Site)
4 Months ago
ByteDance - Site Reliability Engineer - Security Engineering - San Jose

ByteDance

San Jose, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Build A Rocket Boy - Senior Online Programmer

Build A Rocket Boy

Edinburgh, Scotland, United Kingdom (On-Site)
3 Months ago
CloudHire - Sr. Backend Developer - Remote

CloudHire

Bengaluru, Karnataka, India (Remote)
6 Months ago
SSC Technologies - Sr. Platform Engineer

SSC Technologies

Kansas, United States (Remote)
6 Months ago
Egnyte - Staff Software Engineer - C++

Egnyte

Poznań, Greater Poland Voivodeship, Poland (On-Site)
5 Months ago
Every matrix - Security Officer

Every matrix

Lviv, Lviv Oblast, Ukraine (Hybrid)
2 Months ago
Luxoft - Senior SQL Developer

Luxoft

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Playrix - Lead QA Engineer

Playrix

Ukraine (Remote)
6 Months ago
Playrix - Senior C++ Software Engineer (Tools)

Playrix

Ireland (Remote)
6 Months ago
N-iX - Senior DevOps Engineer

N-iX

Ukraine (Remote)
2 Months ago
Trackman - Senior Android Developer - Mobile Golf

Trackman

(On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Mountain View, California, United States

Feld Entertainment - Warehouse Associate (2nd Shift)

Feld Entertainment

Jessup, Maryland, United States (On-Site)
6 Months ago
Sitetracker - Small Business Account Executive (SMB)

Sitetracker

Philadelphia, Pennsylvania, United States (Remote)
6 Months ago
Mobilityware - Apply Here

Mobilityware

Irvine, California, United States (On-Site)
10 Months ago
Britive - SOFTWARE ENGINEER

Britive

San Francisco, California, United States (Remote)
5 Months ago
The Walt Disney Company - KABC Temporary Digital News Producer, Spanish Language

The Walt Disney Company

Glendale, California, United States (On-Site)
4 Months ago
Entrata - Regional Vice President, Sales-Inside Sales IC role at HQ in Lehi

Entrata

Lehi, Utah, United States (On-Site)
6 Months ago
Nissan - Warehouse Operator

Nissan

Greenville, South Carolina, United States (On-Site)
6 Months ago
Next Level Business Services - WPS and Datapower Developer (Full Time)

Next Level Business Services

Dallas, Texas, United States (On-Site)
6 Months ago
Snail Games - Game Capture Artist

Snail Games

Beverly Hills, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Electronic Arts - Senior Software Engineer .NET, Game Creation

Electronic Arts

Orlando, Florida, United States (On-Site)
1 Month ago
ECI - Cloud Services Engineer

ECI

Indore, Madhya Pradesh, India (On-Site)
5 Months ago
Luxoft - Senior Infrastructure Engineer

Luxoft

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)
4 Months ago
BigID - Sr Solutions/Presales Engineer - EMEA

BigID

Zurich, Ontario, Canada (Remote)
4 Months ago
Saviynt - Software Architect - Privilege Access Management

Saviynt

United States (Remote)
6 Months ago
Easygo - Senior DevOps Engineer

Easygo

Belgrade, Serbia (On-Site)
1 Month ago
Wipro - Azure AD

Wipro

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Take-Two Interactive - Senior Systems Engineer

Take-Two Interactive

Bengaluru, Karnataka, India (On-Site)
3 Months ago
NVIDIA - Software Engineer Intern, Autonomous Vehicle - 2025

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
Unity - Senior Data Ops Engineer

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.
View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug