Member of Technical Staff, AI Pre-Training

2 Hours ago • All levels • Artificial Intelligence

Job Summary

Job Description

Contribute to the development of one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, and data mixtures for large-scale training; conducting experiments and overseeing flagship training runs; collaborating with infrastructure, data, and post-training teams; and using a data-driven approach grounded in meticulous ablations. Successful candidates will have expertise in deep learning, strong analytical skills, experience with large-scale distributed systems, and a collaborative work style.
Must have:
  • Expertise in deep learning
  • Strong analytical skills
  • Experience with large-scale distributed systems
  • Data-driven approach
  • Collaborative work style

Job Details

Help deliver one of the best foundational models in the world at Microsoft AI. 

At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 

 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 

Similar Jobs

Warner Bros Games - Director - Machine Learning Engineering

Warner Bros Games

(Hybrid)
1 Month ago
Tesla - Electrical Engineer - Motor Design and Multi-Physics Optimization

Tesla

Athens, Greece (On-Site)
2 Months ago
Kokotree - Artificial Intelligence Developers

Kokotree

Wilmington, North Carolina, United States (On-Site)
5 Months ago
RoofStack - Software Architect

RoofStack

İstanbul, İstanbul, Türkiye (On-Site)
4 Weeks ago
The Walt Disney Company - Senior Principal Software Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
1 Day ago
Talentica Software - Data Scientist

Talentica Software

India (Remote)
6 Months ago
Genies - Research Scientist Intern - LLM/Vision/Speech

Genies

San Mateo, California, United States (Hybrid)
1 Month ago
Ubisoft - Senior ML Programmer

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago
NVIDIA - LLM Application Intern, AV Infrastructure - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
PlayStation Global - Staff Machine Learning Engineer, Enterprise Enablement

PlayStation Global

Carlsbad, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer III, Full Stack, Google Ads

Google

Los Angeles, California, United States (On-Site)
5 Months ago
Sporty Group - Head of Technology

Sporty Group

(Remote)
4 Months ago
Meta - Postdoctoral Researcher, Embodied AI (PhD)

Meta

Seattle, Washington, United States (On-Site)
4 Months ago
CharacterAI - Research Engineer, ML Systems

CharacterAI

New York, New York, United States (On-Site)
2 Weeks ago
ByteDance - Software Engineer, Inference

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Virtuos - Game Programming Internship

Virtuos

Malaysia (On-Site)
1 Day ago
Fluence - Senior Power Electronics Controls Engineer (m/f/d)

Fluence

Erlangen, Bavaria, Germany (On-Site)
5 Months ago
N-iX - Senior Unreal Engine/C++ Engineer

N-iX

United Kingdom (Remote)
2 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

Tesla - Automotive Mechatronics Technician Apprenticeship

Tesla

Zürich, Zurich, Switzerland (On-Site)
2 Months ago
PwC - Manager SAP Data Migration Consulting

PwC

Zürich, Zurich, Switzerland (On-Site)
5 Months ago
Tesla - Sales Advisor

Tesla

Cham, Zug, Switzerland (On-Site)
1 Month ago
Tesla - Senior Business Planning Coordinator

Tesla

Zug, Zug, Switzerland (On-Site)
2 Months ago
Tesla - Automotive Mechatronics Technician

Tesla

Landquart, Grisons, Switzerland (On-Site)
2 Months ago
PwC - Senior Associate / (Senior) Manager – Deals – Separation and Integration

PwC

Zürich, Zurich, Switzerland (On-Site)
6 Months ago
Tesla - Workshop Supervisor

Tesla

Zürich, Zurich, Switzerland (On-Site)
2 Months ago
The Walt Disney Company - Disney Research Intern

The Walt Disney Company

Zürich, Zurich, Switzerland (On-Site)
5 Months ago
Tesla - Automotive Mechatronics Technician

Tesla

Zürich, Zurich, Switzerland (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Inworld AI - Forward Deployed Engineer (AI Gameplay Engineer)

Inworld AI

Mountain View, California, United States (On-Site)
2 Weeks ago
NVIDIA - Web Software Development Intern - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Wargaming - Gen AI Business Development Manager

Wargaming

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Alphasense - Lead AI Platform Engineer

Alphasense

New York, New York, United States (On-Site)
4 Months ago
Meta - Software Engineer, Machine Learning

Meta

Sunnyvale, California, United States (On-Site)
5 Months ago
Tencent - Artificial General Intelligence Research Internship

Tencent

Washington, United States (On-Site)
1 Month ago
NVIDIA - Senior Solutions Architect, Global Partner Team

NVIDIA

Canada (On-Site)
2 Months ago
SparkCognition - Data Scientist

SparkCognition

Bengaluru, Karnataka, India (On-Site)
6 Months ago
NVIDIA - Customer Program Manager

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
Microsoft - Member of Technical Staff, AI - Multimodal

Microsoft

(On-Site)
23 Hours ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Mountain View, California, United States (Hybrid)

London, England, United Kingdom (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (Hybrid)

Redmond, Washington, United States (Hybrid)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug