Member of Technical Staff, AI Pretraining Platform

1 Day ago • All levels • Research & Development

Job Summary

Job Description

The Member of Technical Staff, AI Pretraining Platform at Microsoft AI will contribute to building a world-leading pre-training platform for developing cutting-edge AI models. Responsibilities involve designing and developing Python and CUDA/HIP C++ code for distributed training of multimodal LLMs, building and maintaining infrastructure to handle petabytes of data, collaborating with pre-training and post-training teams to optimize data pipelines, and partnering with product teams and researchers to identify model gaps. The role requires expertise in HPC, parallel programming, and experience with pre-training large AI models. This position is crucial for pushing the boundaries of AI model capabilities and powers the consumer Copilot experience.
Must have:
  • Python and CUDA/HIP C++ coding
  • HPC and parallel programming experience
  • Experience with AI model pre-training
  • Building and maintaining large-scale infrastructure
  • Collaborating with cross-functional teams

Job Details

Overview

Help build the world’s most advanced training platform at Microsoft AI 

We are on a mission to create the leading pretraining platform to develop the world’s most capable AI frontier models. This platform will span one of the world’s foremost GPU clusters, pushing the boundaries of scale, performance, and reliability. 

The AI Pre-training Platform team at Microsoft AI is responsible for all aspects of infrastructure including scalability, benchmarking, kernel development, performance optimizations, communications, and fault tolerance to support our model pre-training operations. We are an interdisciplinary team of engineers and scientists, learning from each other, and collaborating to create the best models, methods and products. We work closely with the teams that transform pre-trained models into the models that power the consumer Copilot experience. 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Are passionate about the infrastructure enabling large-scale AI model training 
  • Will thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Demonstrate a proactive attitude and enthusiasm for exploring new methods and technologies 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities.  

Qualifications

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling or data engineering work 
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work 
  • OR equivalent experience. 
  • Experience with HPC (High performance computing) and/ or parallel programming.
  • Experience in the area of pretraining
  • Experience working with GPU clusters

 

 

 

#Copilot #MicrosoftAI

Responsibilities

  • Design and develop Python and CUDA/HIP C++ code that enable distributed training of multimodal LLMs ingesting text, audio, images, or video data. 
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models. 
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation. 
  • Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models. 
  • Embody our and . 

Similar Jobs

Argus Labs - Site Reliability Engineer (LATAM)

Argus Labs

(Remote)
3 Weeks ago
PwC - Data Architect – Technology Consulting

PwC

Prague, Prague, Czechia (On-Site)
6 Months ago
Playrix - Senior Release Support Engineer

Playrix

Ukraine (Remote)
5 Months ago
Ubisoft - Technical Director, Animation

Ubisoft

Annecy, Auvergne-Rhône-Alpes, France (On-Site)
2 Days ago
Canva - Staff Machine Learning Engineer - User Voice

Canva

Sydney, New South Wales, Australia (Remote)
1 Week ago
Krafton  - General Affairs/Welfare Operations Manager

Krafton

Seoul, South Korea (On-Site)
2 Days ago
ByteDance - Machine Learning Research Scientist, AI for Science

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
Google - Senior Software Engineer, Google Research

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
18 Hours ago
Google - Software Engineer III, Pixel Connectivity

Google

New Taipei, New Taipei City, Taiwan (On-Site)
16 Hours ago
Google - Software Engineering Intern, 2025

Google

Tokyo, Japan (On-Site)
16 Hours ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Sporty Group - Information Security Engineer

Sporty Group

(Remote)
9 Months ago
Google - Senior Software Engineering Manager, Wear OS Platform

Google

Mountain View, California, United States (On-Site)
18 Hours ago
Playrix - Technical Director (Game Project)

Playrix

Armenia (Remote)
5 Months ago
NVIDIA - Senior ASIC Design Engineer

NVIDIA

California, United States (Hybrid)
4 Weeks ago
Playrix - Director of Engineering

Playrix

Almaty, Almaty Region, Kazakhstan (Remote)
5 Months ago
Google - Software Engineer III

Google

Fremont, California, United States (On-Site)
17 Hours ago
ByteDance - Senior Backend Software Engineer - Global E-Commerce Supply Chain Billing & Settlement

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
The Walt Disney Company - Technical Assistant

The Walt Disney Company

London, England, United Kingdom (Hybrid)
2 Days ago
NVIDIA - Senior Software QA Automation Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Framestore - Senior Asset Generalist

Framestore

New York, New York, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

PwC - Manager / Senior Manager for EPM & Analytics with SAP

PwC

Zürich, Zurich, Switzerland (On-Site)
6 Months ago
Tesla - Service Advisor

Tesla

Zürich, Zurich, Switzerland (On-Site)
2 Months ago
PwC - Corporate Tax Manager Zentralschweiz

PwC

Lucerne, Lucerne, Switzerland (On-Site)
6 Months ago
PwC - Auditor - Treasury and Commodity Trading

PwC

Geneva, Geneva, Switzerland (On-Site)
6 Months ago
Google - Research Scientist, Paradigms of Intelligence

Google

Zürich, Zurich, Switzerland (On-Site)
17 Hours ago
Sonar Source - Major Account Manager - DACH

Sonar Source

Geneva, Geneva, Switzerland (On-Site)
4 Months ago
PwC - Director – Operations and Supply Chain Management Consulting 80-100%

PwC

Zürich, Zurich, Switzerland (On-Site)
6 Months ago
PwC - Senior Manager Actuarial Services

PwC

Zürich, Zurich, Switzerland (On-Site)
6 Months ago
Fluence - Quality Assurance Manager

Fluence

Zürich, Zurich, Switzerland (Hybrid)
6 Months ago
PwC - Berater:in CRM - SAP Customer Experience

PwC

Zürich, Zurich, Switzerland (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

ByteDance - Applied Scientist Intern (Computational Modeling & Optimization)

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
NVIDIA - Senior DFT Verification Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
RoofStack - Software Architect

RoofStack

İstanbul, İstanbul, Türkiye (On-Site)
3 Weeks ago
NVIDIA - Software Manager, DOCA Verification

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago
Google - Staff Research Scientist

Google

Goleta, California, United States (On-Site)
18 Hours ago
Ubisoft - Senior R&D Engineer

Ubisoft

Pune, Maharashtra, India (On-Site)
2 Days ago
Tesla - Senior Mechanical Design Engineer - Seating

Tesla

Berlin, Berlin, Germany (On-Site)
2 Months ago
NVIDIA - Senior Chip Architect

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
Google - Software Engineer

Google

São Paulo, State Of São Paulo, Brazil (On-Site)
16 Hours ago
NVIDIA - DFX Methodology Engineer

NVIDIA

Canada (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Vancouver, British Columbia, Canada (On-Site)

Beijing, Beijing, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Hyderabad, Telangana, India (Hybrid)

Dublin, County Dublin, Ireland (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug