Research Intern - AI Systems and Architecture

1 Month ago • 1 Years + • DevOps • $78,600 PA - $154,560 PA

Job Summary

Job Description

This Research Internship at Microsoft's Azure Hardware Systems & Infrastructure (AHSI) focuses on AI systems and architecture within the Strategic Planning and Architecture (SPARC) team. Interns will contribute to the development and improvement of an in-house performance modeling tool for large-scale machine learning systems, conduct performance analysis, and build frameworks for large-scale parallel simulations. Responsibilities include bottleneck analysis, feature enhancement, testing framework development, integration with power/TCO models, database development, data analytics, dashboard creation, and collaboration with a larger team. The internship is a 12-week program involving mentoring, collaboration, and presentation of findings. Successful candidates will have a strong background in performance analysis for AI accelerators and experience in relevant software development practices.
Must have:
  • PhD in CS or related field
  • 1+ year experience with AI accelerator performance analysis
  • Performance modeling tool development
  • Bottleneck analysis & feature enhancement
  • Large-scale parallel simulation framework
Good to have:
  • Collaboration skills
  • Creative problem-solving
  • Cloud-based compute infrastructure experience

Job Details

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

Join our Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems & Infrastructure (AHSI) organization and be a part of the organization behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure. We deliver the core infrastructure and foundational technologies for Microsoft's cloud businesses including Microsoft Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live. The SPARC organization manages Azure’s hardware roadmap from architecture concept through production for all of Microsoft’s current and future on-line services. 

Qualifications

Required Qualifications

  • Accepted or currently enrolled in a PhD program in Computer Science or a related STEM field.
  • At least 1 year of experience with performance analysis for AI accelerators.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Ability to collaborate effectively with other researchers and product development teams.
  • Proficient interpersonal skills, cross-group, and cross-culture collaboration.
  • Ability to think unconventionally to derive creative and innovative solutions.

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.


#AHSI #MSFTNSBE25

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Additional Responsibilities

  • Responsible for developing and contributing to an in-house performance modeling tool for large scale machine learning systems.
  • Responsible for evaluation of ideas for performance improvement along with bottleneck analysis and feature enhancement.
  • Responsible for building framework for running large scale parallel performance simulations using cloud-based compute infrastructure.
  • Developing a testing framework and testbenches for enabling operator level unit tests and end-to-end application tests for the performance model.
  • Integrate performance model with power & TCO model to project application level Perf/W and Perf/$ metrics across workloads.
  • Develop cloud-based performance simulation database for storing large scale data from design-space exploration experiments.
  • Develop data-analytics framework along with debug tools and automation for easier retrieval of performance data based on user queries.
  • Develop and maintain performance dashboards and visualization tools for improving the analysis framework.
  • Formalize and improve general software development practices including codebase maintenance, code review, feature development and software design reviews.
  • Integrating CI/CD pipeline into Azure devops software development process.
  • General troubleshooting and debug processes including common performance bottleneck limiters and developing performance comparison tools.
  • Collaborate with larger team to define product requirements, feature improvements and implementation.

Similar Jobs

Playrix - Senior Release Support Engineer

Playrix

Serbia (Remote)
7 Months ago
The Walt Disney Company - Senior Release Engineer

The Walt Disney Company

New York, New York, United States (On-Site)
1 Month ago
Aristocrat Gaming - Senior Automation Test Engineer

Aristocrat Gaming

Warsaw, Masovian Voivodeship, Poland (Remote)
1 Month ago
Rackspace Technology - Senior GCP Cloud Engineer

Rackspace Technology

United States (Remote)
1 Month ago
Good Job Games - Software Engineer

Good Job Games

İstanbul, Türkiye (On-Site)
6 Months ago
N-iX - Senior DevOps Engineer

N-iX

Ukraine (Remote)
1 Month ago
DEVOTEAM - Data Driven | MLOps Engineer

DEVOTEAM

Lisbon, Lisbon, Portugal (Remote)
7 Months ago
Glean - Solutions Engineer - East

Glean

(Remote)
6 Months ago
Google - Software Engineer III, Performance, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Ajmera Infotech - Kubernetes Experts

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
10 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Canva - Backend Engineer - Internationalization

Canva

Beijing, Beijing, China (Remote)
2 Months ago
Rush Street Interactive - Senior Full-Stack Automation Engineer

Rush Street Interactive

Estonia (Hybrid)
2 Months ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
3 Months ago
Google - Customer Engineer, Machine Learning, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Rockstar Games - Associate Principal Technical Artist: Performance Capture

Rockstar Games

London, England, United Kingdom (On-Site)
1 Month ago
Playtika - PHP Developer

Playtika

Poland (Hybrid)
6 Months ago
Sporty Group - Technical Director

Sporty Group

(Remote)
5 Months ago
NVIDIA - Technical Program Manager, Developer Infrastructure

NVIDIA

Redmond, Washington, United States (On-Site)
2 Months ago
PwC - IN_Senior Associate _Java Developer _Data & Analytics _Advisory _PAN India

PwC

Kolkata, West Bengal, India (On-Site)
7 Months ago
Ajmera Infotech - Site Reliability Engineer (SRE) - Kubernetes

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Mountain View, California, United States

Crunchyroll - Staff Product Manager - Product Partnerships (User Acquisition & Retention)

Crunchyroll

San Francisco, California, United States (On-Site)
2 Months ago
Framestore - FREELANCE: VFX PRODUCERS - CHICAGO

Framestore

Chicago, Illinois, United States (On-Site)
12 Months ago
Tencent - Production Director

Tencent

Palo Alto, California, United States (On-Site)
6 Months ago
Trek - Assembler - Part Time Seasonal

Trek

Madison, Wisconsin, United States (On-Site)
3 Months ago
Evolution - Recruiter - High Volume

Evolution

Philadelphia, Pennsylvania, United States (On-Site)
1 Month ago
GameChanger  - Senior Strategy Data Analyst

GameChanger

New York, New York, United States (Hybrid)
2 Months ago
Meta - Software Engineering Manager, Product Infrastructure

Meta

Seattle, Washington, United States (Remote)
6 Months ago
2K - PC Compatibility Technician

2K

Las Vegas, Nevada, United States (On-Site)
2 Months ago
ByteDance - Senior Backend Software Engineer - Global E-Commerce Supply Chain Billing & Settlement

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Data Center Facilities Technician, Fire Life Safety

Google

Pryor, Oklahoma, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

ION - Software Architect - Java Multi-Tenant SAAS Cloud Native

ION

Pune, Maharashtra, India (On-Site)
7 Months ago
 Vizrt - Director of Platform

Vizrt

Lisbon, Lisbon, Portugal (Remote)
1 Month ago
Anavation - Senior Cloud Developer

Anavation

Clarksburg, West Virginia, United States (Remote)
1 Month ago
Crunchyroll - DevOps Engineer - Cloud Reliability

Crunchyroll

San Francisco, California, United States (Hybrid)
2 Months ago
Omnissa - Member of Technical Staff (Automation)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Omnissa - Staff Engineer (C++,MacOS Internals)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Google - Senior Site Reliability Manager, Site Reliability Engineering

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Ajmera Infotech - DevOps Engineer

Ajmera Infotech

San Jose, California, United States (On-Site)
8 Months ago
PENN Interactive - Engineering Manager, ML Platform

PENN Interactive

Philadelphia, Pennsylvania, United States (Hybrid)
3 Months ago
Rackspace Technology - AWS Migration Engineer

Rackspace Technology

India (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Shenzhen, Guangdong Province, China (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Redmond, Washington, United States (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug