Member of Technical Staff, AI Pre-Training

2 Months ago • All levels

Job Summary

Job Description

Contribute to the development of one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, and data mixtures for large-scale training; conducting experiments and overseeing flagship training runs; collaborating with infrastructure, data, and post-training teams; and using a data-driven approach grounded in meticulous ablations. Successful candidates will have expertise in deep learning, strong analytical skills, experience with large-scale distributed systems, and a collaborative work style.
Must have:
  • Expertise in deep learning
  • Strong analytical skills
  • Experience with large-scale distributed systems
  • Data-driven approach
  • Collaborative work style

Job Details

Help deliver one of the best foundational models in the world at Microsoft AI. 

At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 

 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 

Similar Jobs

Riot Games - Senior Manager, Software Engineering - League Studio, Build, Test, Ship

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Vimeo - Software Engineer II (Fullstack)

Vimeo

Bengaluru, Karnataka, India (On-Site)
2 Months ago
LTI Mindtree - Java AWS

LTI Mindtree

Mexico (On-Site)
1 Month ago
Veeam Software - Backend Developer

Veeam Software

Seattle, Washington, United States (Remote)
1 Month ago
Hitachi - Senior AI Data Scientist

Hitachi

Chennai, Tamil Nadu, India (On-Site)
8 Months ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Burlingame, California, United States (On-Site)
8 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Serbia (Remote)
8 Months ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Westford, Massachusetts, United States (Hybrid)
3 Months ago
Tencent - Game AI Product Management Intern

Tencent

Auckland, Auckland, New Zealand (On-Site)
3 Months ago
bytedance - Student Researcher Intern (Edge Research Project for General Intelligence)

bytedance

San Jose, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

bytedance - DevOps Engineer, Applied Machine Learning Engine - 2025 Start

bytedance

Singapore (On-Site)
8 Months ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Redmond, Washington, United States (On-Site)
8 Months ago
Scale AI - Head of Frontier Data Operations

Scale AI

San Francisco, California, United States (On-Site)
2 Months ago
Microsoft - Principal Researcher-Systems & Networking

Microsoft

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
Adobe - Software Engineer - Infrastructure

Adobe

Bucharest, Bucharest, Romania (On-Site)
2 Months ago
Nagarro - Associate Staff Engineer

Nagarro

Philippines (Remote)
8 Months ago
Morningstar - Senior Software Development Engineer, ML Operations

Morningstar

Mumbai, Maharashtra, India (Hybrid)
1 Month ago
Head Digital Works - Business Intelligence Analyst

Head Digital Works

Hyderabad, Telangana, India (On-Site)
8 Months ago
Kaedim - Customer Success Engineer

Kaedim

London, England, United Kingdom (On-Site)
1 Year ago
MIQ Digital - Senior Data Scientist

MIQ Digital

Bengaluru, Karnataka, India (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

PwC - Auditor - Treasury and Commodity Trading

PwC

Geneva, Geneva, Switzerland (On-Site)
9 Months ago
Google - Software Engineer II, Full Stack, Core

Google

Zürich, Zurich, Switzerland (On-Site)
2 Months ago
Philips - Field Clinical Scientist

Philips

Horgen, Zurich, Switzerland (On-Site)
1 Year ago
Niantic - Senior Software Engineer, Security

Niantic

Zürich, Zurich, Switzerland (Hybrid)
3 Months ago
PwC - Aktuar/-in – Manager/Senior Manager Nichtleben – Actuarial Services

PwC

Zürich, Zurich, Switzerland (On-Site)
9 Months ago
Tesla - Automotive Mechatronics Technician Apprenticeship

Tesla

Zürich, Zurich, Switzerland (On-Site)
4 Months ago
PwC - Director in Life Sciences Quality Management

PwC

Zürich, Zurich, Switzerland (On-Site)
9 Months ago
Salesforce - Account Executive - Partner Cloud (German and French speaking)

Salesforce

Zürich, Zurich, Switzerland (On-Site)
1 Month ago
Interactive Brokers - Platform Operations Engineer - Linux

Interactive Brokers

Zug, Zug, Switzerland (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Fremont, California, United States (Remote)
8 Months ago
bytedance - Cloud Native Engineer, ARK Large Model Platform (Singapore)

bytedance

Singapore (On-Site)
8 Months ago
Blitz app - Lead AI Engineer (Generative & 3D Modeling Expertise)

Blitz app

Tesistán, Jalisco, Mexico (On-Site)
6 Months ago
zoox - Senior/Staff Software Engineer - Simulator

zoox

Seattle, Washington, United States (Hybrid)
8 Months ago
Airlab Inc  - Artificial Intelligence Researcher

Airlab Inc

Montreal, Quebec, Canada (On-Site)
11 Months ago
Google - Senior Research Engineer, AI/ML

Google

London, England, United Kingdom (On-Site)
2 Months ago
bytedance - Student Researcher Intern (Edge Research Project for General Intelligence)

bytedance

San Jose, California, United States (On-Site)
2 Months ago
bytedance - Software Development Engineer - Large Language Models, AML

bytedance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Senior Solutions Architect, Retail

NVIDIA

Arkansas, United States (Remote)
2 Months ago
Genies - Machine Learning Infrastructure Engineer, 3D Model Inference & Deployment

Genies

Los Angeles, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

United States (On-Site)

Mountain View, California, United States (Hybrid)

Vancouver, British Columbia, Canada (On-Site)

California, United States (On-Site)

Hyderabad, Telangana, India (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

London, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug