Member of Technical Staff, AI Pretraining Platform

1 Month ago • All levels • Research & Development

Job Summary

Job Description

The Member of Technical Staff, AI Pretraining Platform at Microsoft AI will contribute to building a world-leading pre-training platform for developing cutting-edge AI models. Responsibilities involve designing and developing Python and CUDA/HIP C++ code for distributed training of multimodal LLMs, building and maintaining infrastructure to handle petabytes of data, collaborating with pre-training and post-training teams to optimize data pipelines, and partnering with product teams and researchers to identify model gaps. The role requires expertise in HPC, parallel programming, and experience with pre-training large AI models. This position is crucial for pushing the boundaries of AI model capabilities and powers the consumer Copilot experience.
Must have:
  • Python and CUDA/HIP C++ coding
  • HPC and parallel programming experience
  • Experience with AI model pre-training
  • Building and maintaining large-scale infrastructure
  • Collaborating with cross-functional teams

Job Details

Overview

Help build the world’s most advanced training platform at Microsoft AI 

We are on a mission to create the leading pretraining platform to develop the world’s most capable AI frontier models. This platform will span one of the world’s foremost GPU clusters, pushing the boundaries of scale, performance, and reliability. 

The AI Pre-training Platform team at Microsoft AI is responsible for all aspects of infrastructure including scalability, benchmarking, kernel development, performance optimizations, communications, and fault tolerance to support our model pre-training operations. We are an interdisciplinary team of engineers and scientists, learning from each other, and collaborating to create the best models, methods and products. We work closely with the teams that transform pre-trained models into the models that power the consumer Copilot experience. 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Are passionate about the infrastructure enabling large-scale AI model training 
  • Will thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Demonstrate a proactive attitude and enthusiasm for exploring new methods and technologies 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities.  

Qualifications

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling or data engineering work 
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work 
  • OR equivalent experience. 
  • Experience with HPC (High performance computing) and/ or parallel programming.
  • Experience in the area of pretraining
  • Experience working with GPU clusters

 

 

 

#Copilot #MicrosoftAI

Responsibilities

  • Design and develop Python and CUDA/HIP C++ code that enable distributed training of multimodal LLMs ingesting text, audio, images, or video data. 
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models. 
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation. 
  • Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models. 
  • Embody our and . 

Similar Jobs

Ansys - Backend Engineer II (.Net)

Ansys

Canonsburg, Pennsylvania, United States (Remote)
3 Weeks ago
NVIDIA - Senior AI Video Architecture Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
PwC - FY24 - Associate & Senior Associate - Risk Assurance - FRM

PwC

Jakarta, Jakarta, Indonesia (On-Site)
8 Months ago
ness digital  - DevOps Engineer

ness digital

Timișoara, Timiș, Romania (Hybrid)
4 Months ago
beghou consulting - Delivery Manager, Data Warehouse

beghou consulting

Hyderabad, Telangana, India (Hybrid)
4 Weeks ago
NVIDIA - Senior ASIC Design Engineer

NVIDIA

Remote, Oregon, United States (Remote)
2 Months ago
Google - Mechanical Design Engineer II

Google

Taipei City, Taiwan (On-Site)
1 Month ago
bytedance - GPU/AI Application Platform Engineer Intern (Server Platform)

bytedance

San Jose, California, United States (On-Site)
1 Month ago
bytedance - Software Engineer, Architecture and Infrastructure

bytedance

San Jose, California, United States (On-Site)
6 Months ago
Hashlist - Zone Control Unit Architect

Hashlist

Pune, Maharashtra, India (Hybrid)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Deep Learning Intern - Fall 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
Meta - Software Engineer, Infrastructure

Meta

Sunnyvale, California, United States (Remote)
6 Months ago
Wildlife Studios - Senior Data Scientist

Wildlife Studios

São Paulo, Brazil (On-Site)
1 Month ago
SEGA - Lead Technical Animator

SEGA

Sofia, Sofia City Province, Bulgaria (Hybrid)
2 Weeks ago
Capgemini - Data Analyst

Capgemini

Mumbai, Maharashtra, India (On-Site)
2 Weeks ago
cirrus logic - Embedded Software Test Manager

cirrus logic

Austin, Texas, United States (Hybrid)
1 Month ago
oportun - Senior Data Engineer

oportun

India (Remote)
2 Days ago
Zurora - Technical Implementation Consultant - Enterprise SaaS Software

Zurora

Heredia, Heredia Province, Costa Rica (Hybrid)
3 Weeks ago
Whatnot - Data Scientist, People Analytics

Whatnot

Los Angeles, California, United States (On-Site)
1 Week ago
Valve corporation - Steam Database Administrator

Valve corporation

Bellevue, Washington, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

PwC - Manager / Senior Manager for EPM & Analytics with SAP

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago
fluence - Sr. Software Engineer

fluence

Zürich, Zurich, Switzerland (Hybrid)
4 Months ago
Sonar Source - Enterprise Account Executive - German Speaker - DACH

Sonar Source

Geneva, Geneva, Switzerland (On-Site)
7 Months ago
Tesla - Service Advisor

Tesla

Zürich, Zurich, Switzerland (On-Site)
3 Months ago
Tesla - HR Operations Payroll Specialist - Switzerland & Austria

Tesla

Zug, Zug, Switzerland (On-Site)
3 Months ago
PwC - Auditor - Treasury and Commodity Trading

PwC

Geneva, Geneva, Switzerland (On-Site)
8 Months ago
Outbrain - Customer Experience Manager

Outbrain

Zürich, Zurich, Switzerland (On-Site)
2 Weeks ago
CrowdStrike - Regional Sales Manager, Switzerland

CrowdStrike

Switzerland (Remote)
4 Weeks ago
PwC - Aktuar/-in – Manager/Senior Manager Nichtleben – Actuarial Services

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago
PwC - Digital Forensic and Electronic Discovery Expert – Senior Associate

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

hello games - Machine Learning Engineer

hello games

United Kingdom (On-Site)
3 Months ago
bytedance - Research Scientist Graduate (High-Performance Computing (Algorithm Acceleration)- Vision AI Platform)

bytedance

San Jose, California, United States (On-Site)
2 Months ago
Astera Labs - Senior Digital Design Engineer - SOC

Astera Labs

Bengaluru, Karnataka, India (On-Site)
7 Months ago
NVIDIA - Senior ASIC Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
4 Months ago
Google - Student Researcher, PhD, Winter/Summer 2025

Google

Mountain View, California, United States (On-Site)
6 Months ago
Google - Senior Firmware Engineering Manager, GSOC, Platforms Infrastructure Engineering

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
NVIDIA - Senior Physical Design Full Chip STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Google - Senior GPU System Architect

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Rambus - SMTS Systems Engineering

Rambus

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Trackman - Team Lead - Radar & High-Speed Electronics

Trackman

Hørsholm, Denmark (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Shenzhen, Guangdong Province, China (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Redmond, Washington, United States (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug