Member of Technical Staff, AI Pretraining Platform

2 Months ago • All levels • Research Development

Job Summary

Job Description

The Member of Technical Staff, AI Pretraining Platform at Microsoft AI will contribute to building a world-leading pre-training platform for developing cutting-edge AI models. Responsibilities involve designing and developing Python and CUDA/HIP C++ code for distributed training of multimodal LLMs, building and maintaining infrastructure to handle petabytes of data, collaborating with pre-training and post-training teams to optimize data pipelines, and partnering with product teams and researchers to identify model gaps. The role requires expertise in HPC, parallel programming, and experience with pre-training large AI models. This position is crucial for pushing the boundaries of AI model capabilities and powers the consumer Copilot experience.
Must have:
  • Python and CUDA/HIP C++ coding
  • HPC and parallel programming experience
  • Experience with AI model pre-training
  • Building and maintaining large-scale infrastructure
  • Collaborating with cross-functional teams

Job Details

Overview

Help build the world’s most advanced training platform at Microsoft AI 

We are on a mission to create the leading pretraining platform to develop the world’s most capable AI frontier models. This platform will span one of the world’s foremost GPU clusters, pushing the boundaries of scale, performance, and reliability. 

The AI Pre-training Platform team at Microsoft AI is responsible for all aspects of infrastructure including scalability, benchmarking, kernel development, performance optimizations, communications, and fault tolerance to support our model pre-training operations. We are an interdisciplinary team of engineers and scientists, learning from each other, and collaborating to create the best models, methods and products. We work closely with the teams that transform pre-trained models into the models that power the consumer Copilot experience. 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Are passionate about the infrastructure enabling large-scale AI model training 
  • Will thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Demonstrate a proactive attitude and enthusiasm for exploring new methods and technologies 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities.  

Qualifications

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling or data engineering work 
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work 
  • OR equivalent experience. 
  • Experience with HPC (High performance computing) and/ or parallel programming.
  • Experience in the area of pretraining
  • Experience working with GPU clusters

 

 

 

#Copilot #MicrosoftAI

Responsibilities

  • Design and develop Python and CUDA/HIP C++ code that enable distributed training of multimodal LLMs ingesting text, audio, images, or video data. 
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models. 
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation. 
  • Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models. 
  • Embody our and . 

Similar Jobs

Alten Technology - Senior Embedded Software Engineer

Alten Technology

Westminster, Colorado, United States (Hybrid)
3 Weeks ago
NVIDIA - Senior Software Configuration Management Engineer

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Ion - Principal Technical Consultant - Endur

Ion

Berlin, Berlin, Germany (On-Site)
8 Months ago
Rockstar Games - Senior Physics Programmer

Rockstar Games

Carlsbad, California, United States (On-Site)
2 Weeks ago
Playstation - Senior Software Development Engineer in Test

Playstation

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
bytedance - AR Optics Architect - Pico- San Jose

bytedance

San Jose, California, United States (On-Site)
6 Months ago
bytedance - ML Systems Software Engineer Graduate (AML - Machine Learning Systems)

bytedance

San Jose, California, United States (On-Site)
2 Months ago
NVIDIA - Senior ASIC Design Engineer

NVIDIA

California, United States (Hybrid)
3 Months ago
bytedance - Research Scientist Graduate (High-Performance Computing (Inference Optimization) - Vision AI Platform)

bytedance

Seattle, Washington, United States (On-Site)
2 Months ago
NVIDIA - Senior Math Libraries Engineers - Python APIs

NVIDIA

Louisiana, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Keen Software House - Senior Render Programmer

Keen Software House

Prague, Prague, Czechia (Remote)
4 Months ago
eBay - MTS1, Software Engineer- Payments

eBay

Shanghai, China (On-Site)
2 Weeks ago
Keywords Studios - Lead Game Developer

Keywords Studios

Mexico City, Mexico City, Mexico (Hybrid)
2 Months ago
Aristocrat - Sr Engineer II - Fullstack (Typescript + Java)

Aristocrat

Noida, Uttar Pradesh, India (Hybrid)
6 Days ago
Amanotes - Unity Developer (LiveOps Team)

Amanotes

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-Site)
5 Months ago
Cadence - Principal Solutions Engineer - AE

Cadence

Noida, Uttar Pradesh, India (On-Site)
8 Months ago
flying wild hog - AI Programmer

flying wild hog

(Remote)
1 Month ago
Capgemini - Software Engineer - B

Capgemini

Noida, Uttar Pradesh, India (On-Site)
1 Month ago
Rivian - Sr. Diagnostic Engineer, Low Voltage and Energy Distribution

Rivian

Palo Alto, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

PwC - Manager/Senior Manager for Finance Transformation with SAP

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago
Tesla - Governance Risk and Compliance Systems Analyst

Tesla

Geneva, Geneva, Switzerland (On-Site)
4 Months ago
PwC - Director ADV Risk & Reg

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago
e2 open - Game Programmer

e2 open

Tägerwilen, Thurgau, Switzerland (Remote)
1 Week ago
Salesforce - RVP Sales, MuleSoft

Salesforce

Zürich, Zurich, Switzerland (Hybrid)
2 Weeks ago
Interactive Brokers - Institutional Client Services Associate

Interactive Brokers

Zug, Zug, Switzerland (Hybrid)
1 Month ago
Philips - Field Clinical Scientist

Philips

Horgen, Zurich, Switzerland (On-Site)
1 Year ago
Tesla - Area Parts Supervisor

Tesla

Cham, Zug, Switzerland (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

bytedance - Research Scientist Intern (Doubao (Seed) - Foundation Model, Speech Understanding) - 2024 Summer (PhD)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Tencent - NLP/LLM Research Intern

Tencent

London, England, United Kingdom (On-Site)
3 Months ago
NVIDIA - Senior Signal and Power Integrity Engineer - Hardware

NVIDIA

Canada (On-Site)
2 Months ago
NVIDIA - Senior Signal and Power Integrity Engineer

NVIDIA

Toronto, Ontario, Canada (On-Site)
5 Months ago
NVIDIA - Senior System Power Management Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
The Walt Disney Company - Disney Research Intern

The Walt Disney Company

Zürich, Zurich, Switzerland (On-Site)
7 Months ago
Google - Lead CPU Design Verification Engineer, Silicon

Google

Mountain View, California, United States (On-Site)
2 Months ago
Hawkeye Innovations - Computer Vision Engineer - Level 2

Hawkeye Innovations

Budapest, Hungary (Hybrid)
2 Months ago
N-ix - Senior C++ Engineer (High Performance Computing)

N-ix

United Kingdom (Remote)
3 Months ago
DNEG - Video Streaming Engineer - Imaging, Playback and Review Tools

DNEG

London, England, United Kingdom (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

United States (On-Site)

United States (On-Site)

United States (On-Site)

Chennai, Tamil Nadu, India (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Noida, Uttar Pradesh, India (On-Site)

Redmond, Washington, United States (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug