Member of Technical Staff, AI Pre-Training

1 Month ago • All levels • Artificial Intelligence

Job Summary

Job Description

Contribute to the development of one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, and data mixtures for large-scale training; conducting experiments and overseeing flagship training runs; collaborating with infrastructure, data, and post-training teams; and using a data-driven approach grounded in meticulous ablations. Successful candidates will have expertise in deep learning, strong analytical skills, experience with large-scale distributed systems, and a collaborative work style.
Must have:
  • Expertise in deep learning
  • Strong analytical skills
  • Experience with large-scale distributed systems
  • Data-driven approach
  • Collaborative work style

Job Details

Help deliver one of the best foundational models in the world at Microsoft AI. 

At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 

 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 

Similar Jobs

Aptive - Software Engineer

Aptive

Kraków, Lesser Poland Voivodeship, Poland (Hybrid)
1 Week ago
Outbrain - Data Scientist Assistant

Outbrain

Paris, Île-de-France, France (Hybrid)
2 Weeks ago
Feral Interactive - Experienced C/C++ Cross Platform Game Programmer

Feral Interactive

London, England, United Kingdom (On-Site)
2 Weeks ago
Trend Micro - (Sr.) Data Engineer/AI Trainer

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Ello - Tech Lead, GenAI & Machine Learning

Ello

San Francisco, California, United States (On-Site)
1 Month ago
Google - Senior Software Engineer, Distributed Machine Learning

Google

Mountain View, California, United States (On-Site)
3 Weeks ago
Google - Software Engineer III, Artificial Intelligence/Machine Learning

Google

Hyderabad, Telangana, India (On-Site)
1 Month ago
Meta - Research Scientist, Machine Learning (PhD)

Meta

Sunnyvale, California, United States (On-Site)
6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model AI Platform) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Appier - Software Engineer, Data Backend(Data Platform)

Appier

Taipei City, Taiwan (On-Site)
6 Months ago
Google - Software Engineer III, Site Reliability Engineering

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Converse.AI - Senior Software Engineer

Converse.AI

Bengaluru, Karnataka, India (On-Site)
2 Years ago
Google - Lead CPU RTL Engineer, Silicon

Google

(On-Site)
6 Months ago
Voodoo - Lead Multiplayer Game Developer - Paper.io 2

Voodoo

Barcelona, Catalonia, Spain (Remote)
2 Months ago
NVIDIA - Developer Technology Engineer, Public Sector - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Tesla - Algorithms Engineer, Autobidder (Electricity Markets/Energy Trading)

Tesla

North Holland, Netherlands (On-Site)
3 Months ago
Stake Logic - Senior Java Back-End Developer

Stake Logic

(Remote)
3 Days ago
Qualcomm - RF/Analog IC Design Engineer

Qualcomm

Santa Clara, California, United States (On-Site)
3 Days ago
Scopely - Senior Software Engineer

Scopely

Spain (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

Google - Software Engineering Manager II, Site Reliability Engineering

Google

Zürich, Zurich, Switzerland (On-Site)
1 Month ago
PwC - PaPM Role

PwC

Zürich, Zurich, Switzerland (On-Site)
7 Months ago
Outbrain - Customer Experience Manager

Outbrain

Zürich, Zurich, Switzerland (On-Site)
1 Week ago
neural concept - ML Platform Deployment Engineer

neural concept

Zürich, Zurich, Switzerland (Hybrid)
1 Week ago
PwC - Director in Life Sciences Quality Management

PwC

Zürich, Zurich, Switzerland (On-Site)
7 Months ago
Tesla - HR Operations Payroll Specialist - Switzerland & Austria

Tesla

Zug, Zug, Switzerland (On-Site)
3 Months ago
Sonar Source - Enterprise Expansion Representative - DACH

Sonar Source

Geneva, Geneva, Switzerland (On-Site)
7 Months ago
PwC - Senior Associate - SAP Global Trade Services

PwC

Zürich, Zurich, Switzerland (On-Site)
7 Months ago
Google - Software Engineer II, Full Stack, Core

Google

Zürich, Zurich, Switzerland (On-Site)
1 Month ago
Tesla - Service Advisor

Tesla

Zürich, Zurich, Switzerland (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

ByteDance - Research Scientist Graduate (Foundation Model - Vision and Language)

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Google - Staff Software Engineer, GPU Performance, Google Scale

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Scale AI - QA Engineer, Generative AI

Scale AI

Argentina (On-Site)
7 Months ago
N-iX - Senior Data Scientist

N-iX

Ukraine (Remote)
1 Month ago
DNEG - Head of Machine Learning

DNEG

London, England, United Kingdom (Remote)
2 Months ago
Zazz - Artificial Intelligence Engineer

Zazz

(Remote)
3 Months ago
Google - Software Engineering Manager, Cloud AI

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Google - Senior Software Engineer, Distributed Machine Learning

Google

Mountain View, California, United States (On-Site)
1 Month ago
Virtuos - Senior Machine Learning Engineer (Game)

Virtuos

Vietnam (On-Site)
1 Month ago
Google - Software Engineer III, Machine Learning Services

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Shenzhen, Guangdong Province, China (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Redmond, Washington, United States (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug