Member of Technical Staff, AI Pretraining

1 Month ago • All levels • Artificial Intelligence

Job Summary

Job Description

Contribute to developing one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a data-driven approach. This involves algorithmic implementation, experimentation, overseeing flagship training runs on a large-scale distributed stack, and close collaboration with infrastructure, data, post-training, and multimodality teams. The ideal candidate will have proven expertise in pre-training, demonstrated by a strong publication record and technical leadership in high-impact projects. Strong analytical skills, attention to detail, and experience with large-scale distributed systems are essential.
Must have:
  • Expertise in AI pre-training
  • Strong analytical & problem-solving skills
  • Experience with large-scale distributed systems
  • Proficiency in C/C++/C#/Java/JavaScript/Python
  • Data-driven approach to algorithm development
Good to have:
  • Experience with conversational AI
  • Excellent communication and collaboration skills
  • Passion for learning new technologies

Job Details

Job Description

Help deliver one of the best foundational models in the world at Microsoft AI. 
At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 
 
We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 
  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
 
Responsibilities 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 
 
Required/Minimum Qualifications 
  • · Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C , C#, Java, JavaScript, or Python 
  • Proven expertise in the area of pretraining

Additional or Preferred Qualifications 
  • Demonstrated experience in large-scale AI. 
  • Passionate about conversational AI and its deployment. 
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.   
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.  
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.   


Similar Jobs

Logrhytm - Principal Engineer - Java Backend Development

Logrhytm

Bengaluru, Karnataka, India (On-Site)
8 Hours ago
Contentstack - Engineer II - QA

Contentstack

Pune, Maharashtra, India (Hybrid)
1 Day ago
Drive mode - Staff Backend Engineer

Drive mode

Mountain View, California, United States (Hybrid)
3 Months ago
Google - Software Engineering Manager, Google Cloud, Looker

Google

Sunnyvale, California, United States (On-Site)
2 Days ago
Microsoft - Principal Engineering Manager

Microsoft

Redmond, Washington, United States (On-Site)
3 Days ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Sunnyvale, California, United States (Remote)
5 Months ago
Google - Software Engineer, Performance Modeling

Google

Durham, North Carolina, United States (On-Site)
1 Week ago
NVIDIA - Senior Developer Technology Engineer - AI

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
Microsoft - Technical Product Manager, AI Multimodal

Microsoft

London, England, United Kingdom (On-Site)
2 Weeks ago
Krafton  - Deep Learning Engineer - LLM Game Agent

Krafton

Seoul, South Korea (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Gaming Innovation Group  - Associate Big Data Engineer

Gaming Innovation Group

Manchester, England, United Kingdom (Hybrid)
3 Weeks ago
Epic Games - Backend Engineer

Epic Games

(On-Site)
1 Month ago
Cognite - Principal Front-end Engineer

Cognite

Austin, Texas, United States (Hybrid)
10 Months ago
Workato - Senior Infrastructure Engineer (OpenSearch)

Workato

Barcelona, Catalonia, Spain (On-Site)
6 Hours ago
PwC - Tester/Testerka automatyzujący/a (freelance)

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
7 Months ago
ION - Technical Support Analyst, Chicago - 5849/9555

ION

Chicago, Illinois, United States (On-Site)
6 Months ago
Temporal Technologies - Staff Software Engineer, Cloud Infrastructure

Temporal Technologies

(Remote)
7 Hours ago
Patreon - Senior Fullstack Software Engineer, Payments

Patreon

San Francisco, California, United States (Hybrid)
1 Month ago
Aryaka - Platform Engineer (Java Full Stack)

Aryaka

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Google - Software Engineering Intern, 2025

Google

Tokyo, Japan (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Moloco - Product Marketing Manager, Gaming

Moloco

London, England, United Kingdom (On-Site)
6 Hours ago
Unity - Senior Technical Trainer

Unity

London, England, United Kingdom (On-Site)
8 Hours ago
Sports Interactive - Software Engineer (Console)

Sports Interactive

England, United Kingdom (Hybrid)
1 Month ago
Tripledot Studios - Product Manager

Tripledot Studios

London, England, United Kingdom (Hybrid)
1 Month ago
Tesla - Customer Experience Specialist, Delivery (Part Time)

Tesla

Bristol, England, United Kingdom (On-Site)
2 Months ago
Maverick Games - Senior Engine Engineer

Maverick Games

Warwick, England, United Kingdom (Hybrid)
2 Days ago
Playtech - Financial Controller (ECM)

Playtech

Kingston Upon Hull, England, United Kingdom (Hybrid)
1 Month ago
Media Molecule - Senior Designer (Environments)

Media Molecule

London, England, United Kingdom (Hybrid)
1 Day ago
Triple Dot Studios - Product Manager

Triple Dot Studios

London, England, United Kingdom (Hybrid)
2 Months ago
playground - Lighting Artist

playground

Royal Leamington Spa, England, United Kingdom (Hybrid)
1 Day ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Microsoft - Research Scientist

Microsoft

Zürich, Zurich, Switzerland (On-Site)
1 Week ago
AI Fund - Founder in Residence/CEO (AI for Construction)

AI Fund

United States (Remote)
1 Month ago
Google - PhD Software Engineer

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Krafton  - Deep Learning Strategy & Operations Associate

Krafton

Seoul, South Korea (On-Site)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Software Engineer, Systems Research, PhD, Early Career

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Krafton  - Lead of Physical AI Agent, Research Scientist

Krafton

Seoul, South Korea (On-Site)
1 Week ago
ByteDance - Senior Software Engineer / Researcher, AI-Native Database Systems

ByteDance

San Jose, California, United States (On-Site)
2 Days ago
Google - Senior Software Engineer, Machine Learning, Google Play Books

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Google - Software Engineer, Compiler Frontend, Silicon

Google

Mountain View, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (Hybrid)

Shanghai, Shanghai, China (Hybrid)

Beijing, Beijing, China (On-Site)

Washington, United States (On-Site)

Phoenix, Arizona, United States (On-Site)

Penang, Malaysia (On-Site)

London, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug