Member of Technical Staff, AI Pretraining Platform

2 Months ago • All levels • Research Development

Job Summary

Job Description

Microsoft AI is seeking a Member of Technical Staff to contribute to their cutting-edge AI pre-training platform. This role involves designing and developing Python and CUDA/HIP C++ code for distributed training of multimodal LLMs, building and maintaining infrastructure for petabyte-scale data processing, partnering with other teams to improve data recipes, and collaborating on identifying gaps in current models. Responsibilities include optimizing for scalability, performance, and reliability on a large-scale GPU cluster. The ideal candidate will be passionate about large-scale AI infrastructure, thrive in a fast-paced collaborative environment, and demonstrate a high degree of craftsmanship.
Must have:
  • Python & CUDA/HIP C++ development
  • Experience with HPC and parallel programming
  • Large-scale AI model training experience
  • GPU cluster experience

Job Details


Job Description

Help build the world’s most advanced training platform at Microsoft AI 

We are on a mission to create the leading pretraining platform to develop the world’s most capable AI frontier models. This platform will span one of the world’s most foremost GPU clusters, pushing the boundaries of scale, performance, and reliability. 

The AI Pre-training Platform team at Microsoft AI is responsible for all aspects of infrastructure including scalability, benchmarking, kernel development, performance optimizations, communications, and fault tolerance to support our model pre-training operations. We are an interdisciplinary team of engineers and scientists, learning from each other, and collaborating to create the best models, methods and products. We work closely with the teams that transform pre-trained models into the models that power the consumer Copilot experience. 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 
  • Are passionate about the infrastructure enabling large-scale AI model training 
  • Will thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Demonstrate a proactive attitude and enthusiasm for exploring new methods and technologies 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities.  
 
Responsibilities 
  • Design and develop Python and CUDA/HIP C code that enable distributed training of multimodal LLMs ingesting text, audio, images, or video data. 
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models. 
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation. 
  • Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models. 
  • Embody our and
 

Required/Minimum Qualifications  
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling or data engineering work 
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work 
  • Experience with HPC (High performance computing) and/ or parallel programming?
  • Experience in the area of pretraining
  • Experience working with GPU clusters

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the .
 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
 
#Copilot #MicrosoftAI


Similar Jobs

WebMD - Director, Marketing

WebMD

Newark, New Jersey, United States (On-Site)
1 Week ago
Autodesk - Enterprise Subscription Renewal Representative (French Speaker)

Autodesk

Barcelona, Catalonia, Spain (Hybrid)
2 Weeks ago
Simple Viral Games - Frontend Developer Intern

Simple Viral Games

Bengaluru, Karnataka, India (On-Site)
11 Months ago
bytedance - Network Software Development Engineer, Switch

bytedance

San Jose, California, United States (On-Site)
2 Months ago
Qualcomm - Low Power Design Engineer

Qualcomm

Austin, Texas, United States (On-Site)
1 Month ago
bytedance - Machine Learning Engineer Intern

bytedance

Seattle, Washington, United States (On-Site)
2 Months ago
bytedance - Machine Learning Scientist, Scaling AI for Biology

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
Apple - AIML - Machine Learning Engineer, Answers, Knowledge & Intelligence (AKI)

Apple

Santa Clara, California, United States (On-Site)
1 Week ago
Joyteractive - Market Research Analyst

Joyteractive

Poland (Remote)
3 Months ago
Scale AI - Machine Learning Research Scientist / Research Engineer, Post-Training

Scale AI

San Francisco, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Krafton - Game Security Machine Learning Engineer

Krafton

Seoul, South Korea (On-Site)
3 Weeks ago
WebTech Corporation - CMM Operator-Programmer

WebTech Corporation

Hibbing, Minnesota, United States (On-Site)
4 Weeks ago
pipa studios - Unity Developer

pipa studios

São Paulo, Brazil (Hybrid)
1 Month ago
Nasdaq - Go Backend Software Developer Senior Specialist

Nasdaq

Montreal, Quebec, Canada (On-Site)
1 Month ago
ElevenLabs - Revenue Partnerships Manager

ElevenLabs

India (Remote)
3 Months ago
Digital dot robots - Electrical Engineer

Digital dot robots

Pittsburgh, Pennsylvania, United States (On-Site)
1 Month ago
Rippling - Senior Forward Deployed Engineer

Rippling

San Francisco, California, United States (On-Site)
5 Months ago
Haptic  - Senior UI/UX Designer

Haptic

United Kingdom (On-Site)
6 Months ago
Autodesk - Senior Fullstack Engineer - MERN Stack

Autodesk

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Epic Games - Concepteur de niveaux

Epic Games

Montreal, Quebec, Canada (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

hello games - Experienced Artist

hello games

United Kingdom (On-Site)
7 Months ago
Rebellion - AI Gameplay Programmer

Rebellion

Oxford, England, United Kingdom (Hybrid)
3 Months ago
Just wont die - Senior/Unreal Developer

Just wont die

Cambridge, England, United Kingdom (On-Site)
1 Month ago
Take-Two Interactive - HR Technology Analyst

Take-Two Interactive

London, England, United Kingdom (On-Site)
2 Months ago
Double Eleven - Senior Graphics Programmer

Double Eleven

Middlesbrough, England, United Kingdom (Hybrid)
3 Months ago
sports interactive - Senior Software Engineer (Graphics)

sports interactive

London, England, United Kingdom (Hybrid)
4 Months ago
The third floor  - Postvis Animator - VFX Generalist

The third floor

London, England, United Kingdom (Remote)
1 Month ago
Electronic Arts - Specialist, Shared Services

Electronic Arts

Guildford, England, United Kingdom (Hybrid)
1 Month ago
Pivotroots - Senior New Business Manager, Global Growth

Pivotroots

London, England, United Kingdom (Hybrid)
1 Month ago
LeoVegas - Senior Analyst

LeoVegas

Leeds, England, United Kingdom (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

FICO - Analytic Science — Lead Scientist

FICO

State Of São Paulo, Brazil (On-Site)
1 Month ago
Haven Studios  Inc  - Senior User Experience Researcher

Haven Studios Inc

Montreal, Quebec, Canada (On-Site)
1 Week ago
Ansys - Senior R&D Engineer (C++, Qt)

Ansys

Athens, Greece (Hybrid)
1 Month ago
DevRev - Architect - Applied AI Engineer

DevRev

(Remote)
2 Months ago
bytedance - Research Scientist- Applied Machine learning Graduates (AML) - 2024 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
8 Months ago
Apple - Applied Scientist

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Illumina - Senior Staff Bioinformatic Scientist

Illumina

Foster City, California, United States (On-Site)
1 Month ago
DraftKings - Senior Manager, AI Learning Programs

DraftKings

United States (Remote)
1 Month ago
Vigaet - Internship- AI Engineer

Vigaet

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Rippling - Director of Engineering - Machine Learning and AI

Rippling

San Francisco, California, United States (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

About The Company

United States (On-Site)

Mountain View, California, United States (Hybrid)

Pune, Maharashtra, India (Hybrid)

Vancouver, British Columbia, Canada (On-Site)

California, United States (On-Site)

Hyderabad, Telangana, India (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Redmond, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug