Member of Technical Staff, AI Pretraining

3 Months ago • All levels • Research Development

Job Summary

Job Description

Contribute to developing one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a data-driven approach. This involves algorithmic implementation, experimentation, overseeing flagship training runs on a large-scale distributed stack, and close collaboration with infrastructure, data, post-training, and multimodality teams. The ideal candidate will have proven expertise in pre-training, demonstrated by a strong publication record and technical leadership in high-impact projects. Strong analytical skills, attention to detail, and experience with large-scale distributed systems are essential.
Must have:
  • Expertise in AI pre-training
  • Strong analytical & problem-solving skills
  • Experience with large-scale distributed systems
  • Proficiency in C/C++/C#/Java/JavaScript/Python
  • Data-driven approach to algorithm development
Good to have:
  • Experience with conversational AI
  • Excellent communication and collaboration skills
  • Passion for learning new technologies

Job Details

Job Description

Help deliver one of the best foundational models in the world at Microsoft AI. 
At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 
 
We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 
  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
 
Responsibilities 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 
 
Required/Minimum Qualifications 
  • · Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C , C#, Java, JavaScript, or Python 
  • Proven expertise in the area of pretraining

Additional or Preferred Qualifications 
  • Demonstrated experience in large-scale AI. 
  • Passionate about conversational AI and its deployment. 
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.   
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.  
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.   


Similar Jobs

aspyr - Game Producer

aspyr

Austin, Texas, United States (On-Site)
2 Months ago
Eqvilent - OPERATIONS MANAGER

Eqvilent

(Remote)
7 Months ago
Sonar Source - Recruitment Coordinator

Sonar Source

Geneva, Geneva, Switzerland (Hybrid)
1 Month ago
PwC - Talent Delivery Team - Specialist

PwC

Olivos, Buenos Aires Province, Argentina (On-Site)
1 Month ago
bytedance - Backend Software Engineer - Customer Service Platform

bytedance

Singapore (On-Site)
8 Months ago
Apple - AIML - Machine Learning Engineer, Model Evaluations

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Yahoo - Research Engineer

Yahoo

Taiwan (Hybrid)
2 Weeks ago
Ion - AI Engineer - Graduate Development Program

Ion

Pisa, Tuscany, Italy (On-Site)
8 Months ago
ISS Stoxx - Research Analyst, China Taiwan

ISS Stoxx

Sydney, New South Wales, Australia (On-Site)
2 Weeks ago
Snorkel AI - Machine Learning Success Manager

Snorkel AI

San Francisco, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Riot Games - Senior Manager, Game Product Management - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Bosch Group - Manager - Export Control Governance and Risk Management

Bosch Group

Farmington Hills, Michigan, United States (On-Site)
1 Week ago
Light Speed Studios - Senior Game Engine Engineer

Light Speed Studios

Tokyo, Japan (On-Site)
2 Months ago
AECOM - EHS Due Diligence & Compliance Consultant

AECOM

Milan, Lombardy, Italy (Hybrid)
2 Weeks ago
Tesla - Customer Support Supervisor

Tesla

Berlin, Berlin, Germany (On-Site)
4 Months ago
Autodesk - Partner Solutions Specialist

Autodesk

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
OKX - Leadership Growth Expert

OKX

Hong Kong (On-Site)
8 Months ago
Tesla - Parts Service Associate/Logistics Specialist

Tesla

Frankfurt Am Main, Hessen, Germany (On-Site)
4 Months ago
AECOM - Senior Transit Architect

AECOM

New York, United States (Hybrid)
1 Week ago
DraftKings - Senior Lead Software Engineer

DraftKings

Boston, Massachusetts, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Zynga - Director of Product - NaturalMotion Games

Zynga

London, England, United Kingdom (Hybrid)
6 Months ago
WebMD - Scientific Content Manager

WebMD

United Kingdom (On-Site)
1 Month ago
Sprinkler - Principal Value Consultant

Sprinkler

London, England, United Kingdom (On-Site)
1 Month ago
Adyen - Senior Commercial Business Manager

Adyen

London, England, United Kingdom (On-Site)
3 Weeks ago
IGN - Guides Writer, Rock Paper Shotgun

IGN

London, England, United Kingdom (Hybrid)
3 Weeks ago
hutch - Lead Game Designer

hutch

England, United Kingdom (Hybrid)
3 Months ago
Inspired Entertainment - Arcade Host

Inspired Entertainment

Ayr, Scotland, United Kingdom (On-Site)
2 Weeks ago
Everi - Supply Chain Planning Center of Excellence Lead

Everi

London, England, United Kingdom (On-Site)
2 Weeks ago
Monzo - Senior Backend Engineer

Monzo

London, England, United Kingdom (Remote)
1 Month ago
Tesla - Mobile Service Technician / Mobile Automotive Mechanic

Tesla

Birmingham, England, United Kingdom (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Eqvilent - Quantitative Researcher

Eqvilent

(Remote)
1 Month ago
ALTEN - Robotics / Artificial Intelligence Engineer Intern

ALTEN

Sèvres, Île-de-France, France (On-Site)
1 Week ago
NVIDIA - Engineering Manager, AI Developer Technology

NVIDIA

Austin, Texas, United States (On-Site)
3 Months ago
Ion - Senior Credit Research Analyst - 271

Ion

Mumbai, Maharashtra, India (On-Site)
8 Months ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
4 Months ago
bytedance - Student Researcher (Foundation Models - Reasoning, Planning & Agent - Doubao (Seed)) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
8 Months ago
BioFire - Industrialization Scientist, Molecular Biology

BioFire

Philadelphia, Pennsylvania, United States (On-Site)
1 Month ago
IMC - Machine Learning Engineer

IMC

Amsterdam, North Holland, Netherlands (On-Site)
2 Months ago
NetEase Games - AI Engineer

NetEase Games

Singapore (On-Site)
1 Month ago
Pulse Point - Sr. Machine Learning Engineer, AdTech

Pulse Point

United Kingdom (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

United States (On-Site)

Mountain View, California, United States (Hybrid)

Pune, Maharashtra, India (Hybrid)

Vancouver, British Columbia, Canada (On-Site)

California, United States (On-Site)

Hyderabad, Telangana, India (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Redmond, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug