Member of Technical Staff, AI Pre-Training

1 Month ago • All levels • Artificial Intelligence

Job Summary

Job Description

Contribute to the development of one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, and data mixtures for large-scale training; conducting experiments and overseeing flagship training runs; collaborating with infrastructure, data, and post-training teams; and using a data-driven approach grounded in meticulous ablations. Successful candidates will have expertise in deep learning, strong analytical skills, experience with large-scale distributed systems, and a collaborative work style.
Must have:
  • Expertise in deep learning
  • Strong analytical skills
  • Experience with large-scale distributed systems
  • Data-driven approach
  • Collaborative work style

Job Details

Help deliver one of the best foundational models in the world at Microsoft AI. 

At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 

 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 

Similar Jobs

Riot Games - Senior Manager, Software Engineering - League Studio, Build, Test, Ship

Riot Games

Los Angeles, California, United States (On-Site)
4 Weeks ago
Vimeo - Software Engineer II (Fullstack)

Vimeo

Bengaluru, Karnataka, India (On-Site)
4 Weeks ago
LTI Mindtree - Java AWS

LTI Mindtree

Mexico (On-Site)
2 Days ago
Veeam Software - Backend Developer

Veeam Software

Seattle, Washington, United States (Remote)
1 Week ago
Hitachi - Senior AI Data Scientist

Hitachi

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Burlingame, California, United States (On-Site)
6 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Serbia (Remote)
6 Months ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Westford, Massachusetts, United States (Hybrid)
2 Months ago
Tencent - Game AI Product Management Intern

Tencent

Auckland, Auckland, New Zealand (On-Site)
2 Months ago
bytedance - Student Researcher Intern (Edge Research Project for General Intelligence)

bytedance

San Jose, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

bytedance - DevOps Engineer, Applied Machine Learning Engine - 2025 Start

bytedance

Singapore (On-Site)
6 Months ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Redmond, Washington, United States (On-Site)
6 Months ago
Scale AI - Head of Frontier Data Operations

Scale AI

San Francisco, California, United States (On-Site)
4 Weeks ago
Microsoft - Principal Researcher-Systems & Networking

Microsoft

Vancouver, British Columbia, Canada (On-Site)
1 Month ago
Adobe - Software Engineer - Infrastructure

Adobe

Bucharest, Bucharest, Romania (On-Site)
3 Weeks ago
Nagarro - Associate Staff Engineer

Nagarro

Philippines (Remote)
7 Months ago
Morningstar - Senior Software Development Engineer, ML Operations

Morningstar

Mumbai, Maharashtra, India (Hybrid)
22 Hours ago
Head Digital Works - Business Intelligence Analyst

Head Digital Works

Hyderabad, Telangana, India (On-Site)
7 Months ago
Kaedim - Customer Success Engineer

Kaedim

London, England, United Kingdom (On-Site)
1 Year ago
MIQ Digital - Senior Data Scientist

MIQ Digital

Bengaluru, Karnataka, India (Hybrid)
6 Days ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

PwC - Auditor - Treasury and Commodity Trading

PwC

Geneva, Geneva, Switzerland (On-Site)
8 Months ago
Google - Software Engineer II, Full Stack, Core

Google

Zürich, Zurich, Switzerland (On-Site)
1 Month ago
Philips - Field Clinical Scientist

Philips

Horgen, Zurich, Switzerland (On-Site)
1 Year ago
Niantic - Senior Software Engineer, Security

Niantic

Zürich, Zurich, Switzerland (Hybrid)
2 Months ago
PwC - Aktuar/-in – Manager/Senior Manager Nichtleben – Actuarial Services

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago
Tesla - Automotive Mechatronics Technician Apprenticeship

Tesla

Zürich, Zurich, Switzerland (On-Site)
3 Months ago
PwC - Director in Life Sciences Quality Management

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago
Salesforce - Account Executive - Partner Cloud (German and French speaking)

Salesforce

Zürich, Zurich, Switzerland (On-Site)
23 Hours ago
Interactive Brokers - Platform Operations Engineer - Linux

Interactive Brokers

Zug, Zug, Switzerland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Fremont, California, United States (Remote)
6 Months ago
bytedance - Cloud Native Engineer, ARK Large Model Platform (Singapore)

bytedance

Singapore (On-Site)
6 Months ago
Blitz app - Lead AI Engineer (Generative & 3D Modeling Expertise)

Blitz app

Tesistán, Jalisco, Mexico (On-Site)
4 Months ago
zoox - Senior/Staff Software Engineer - Simulator

zoox

Seattle, Washington, United States (Hybrid)
7 Months ago
Airlab Inc  - Artificial Intelligence Researcher

Airlab Inc

Montreal, Quebec, Canada (On-Site)
10 Months ago
Google - Senior Research Engineer, AI/ML

Google

London, England, United Kingdom (On-Site)
1 Month ago
bytedance - Student Researcher Intern (Edge Research Project for General Intelligence)

bytedance

San Jose, California, United States (On-Site)
1 Month ago
bytedance - Software Development Engineer - Large Language Models, AML

bytedance

San Jose, California, United States (On-Site)
3 Months ago
NVIDIA - Senior Solutions Architect, Retail

NVIDIA

Arkansas, United States (Remote)
1 Month ago
Genies - Machine Learning Infrastructure Engineer, 3D Model Inference & Deployment

Genies

Los Angeles, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Shenzhen, Guangdong Province, China (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Redmond, Washington, United States (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug