Member of Technical Staff, AI Pretraining

4 Months ago • All levels • Research Development

Job Summary

Job Description

Contribute to developing one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a data-driven approach. This involves algorithmic implementation, experimentation, overseeing flagship training runs on a large-scale distributed stack, and close collaboration with infrastructure, data, post-training, and multimodality teams. The ideal candidate will have proven expertise in pre-training, demonstrated by a strong publication record and technical leadership in high-impact projects. Strong analytical skills, attention to detail, and experience with large-scale distributed systems are essential.
Must have:
  • Expertise in AI pre-training
  • Strong analytical & problem-solving skills
  • Experience with large-scale distributed systems
  • Proficiency in C/C++/C#/Java/JavaScript/Python
  • Data-driven approach to algorithm development
Good to have:
  • Experience with conversational AI
  • Excellent communication and collaboration skills
  • Passion for learning new technologies

Job Details

Job Description

Help deliver one of the best foundational models in the world at Microsoft AI. 
At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 
 
We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 
  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
 
Responsibilities 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 
 
Required/Minimum Qualifications 
  • · Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C , C#, Java, JavaScript, or Python 
  • Proven expertise in the area of pretraining

Additional or Preferred Qualifications 
  • Demonstrated experience in large-scale AI. 
  • Passionate about conversational AI and its deployment. 
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.   
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.  
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.   


Similar Jobs

London stock Exchange - Principal Engineer, Front-End Technologies

London stock Exchange

Charlotte, North Carolina, United States (On-Site)
3 Weeks ago
fortis games - Security Engineering Manager

fortis games

Spain (Remote)
1 Month ago
WebFX - Jr. Inside Sales Strategist

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
9 Months ago
HCL Tech - Specialist

HCL Tech

Hyderabad, Telangana, India (On-Site)
2 Months ago
The Globel Talent Co - Virtual Assistant/Contracts Administrator (German-speaking)

The Globel Talent Co

Romania (Remote)
1 Week ago
bytedance - Research Scientist Graduate (LLM Model Evaluation - Seed)

bytedance

San Jose, California, United States (On-Site)
1 Month ago
A-Team - Head of AI Revenue & Partnerships

A-Team

New York, United States (Remote)
1 Week ago
bytedance - Machine Learning Researcher (Reasoning Agent) Intern - 2025 Start

bytedance

Singapore (On-Site)
9 Months ago
Adyen - Founding Research Engineer, AI

Adyen

San Francisco, California, United States (On-Site)
4 Weeks ago
DraftKings - Senior Machine Learning Engineer

DraftKings

Boston, Massachusetts, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

luxsoft - Azure/AzureML Engineer

luxsoft

Pune, Maharashtra, India (On-Site)
3 Weeks ago
PwC - Assistant Manager - Internal Audit

PwC

Makati City, Metro Manila, Philippines (On-Site)
10 Months ago
Barracuda - Manager, Cybersecurity Engineering

Barracuda

United States (Remote)
1 Week ago
attentive - Account Director, Core Verticals

attentive

United States (Remote)
2 Months ago
endava - GCP Data Architect

endava

Berlin, Berlin, Germany (Hybrid)
2 Months ago
Alpha Sense - Account Executive, Financial Services

Alpha Sense

Ireland (Remote)
4 Weeks ago
Social Discovery Group - VP of Finance

Social Discovery Group

Israel (Remote)
9 Months ago
BKOM Studios - Quality Assurance Analyst

BKOM Studios

Québec City, Quebec, Canada (Remote)
2 Months ago
TransUnion - Sales Team Lead

TransUnion

Boca Raton, Florida, United States (Hybrid)
1 Week ago
Abridge - Implementation Manager, Strategic Accounts

Abridge

United States (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Mcdonalds - Learning Designer – HR Curriculum (6 month FTC)

Mcdonalds

London, England, United Kingdom (Hybrid)
1 Month ago
Square - Cost Manager

Square

London, England, United Kingdom (On-Site)
2 Weeks ago
Aspire - Strategy - UK Expansion

Aspire

United Kingdom (Hybrid)
2 Months ago
Scopely - Senior 2D VFX Artist

Scopely

United Kingdom (Remote)
9 Months ago
Blue bolt - Assistant Technical Director

Blue bolt

London, England, United Kingdom (Hybrid)
4 Weeks ago
Imanage - Senior Software Engineer (Java)

Imanage

London, England, United Kingdom (Hybrid)
1 Month ago
Monzo - Lead Credit Analyst, Personal Borrowing

Monzo

London, England, United Kingdom (Remote)
2 Months ago
Moonbug Entertainment - Creative Lead, Brand Marketing

Moonbug Entertainment

London, England, United Kingdom (On-Site)
2 Weeks ago
Ninja theory - Programming

Ninja theory

Cambridge, England, United Kingdom (On-Site)
1 Month ago
Unity - Senior Technical Trainer

Unity

Brighton And Hove, England, United Kingdom (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Luma - Research Engineer - Foundation Models

Luma

Palo Alto, California, United States (Hybrid)
5 Months ago
Thumbtack - Applied Scientist, Customer Growth

Thumbtack

United States (Remote)
1 Month ago
Aera Technology - Client Partner | Enterprise Platform Sales | AI /ML Decision Intelligence | Texas

Aera Technology

Texas, United States (Hybrid)
9 Months ago
Apple - Software Engineer, IS&T AiDP Applied Machine Learning

Apple

Sunnyvale, California, United States (On-Site)
3 Months ago
Boomi  - Solution Manager - AI

Boomi

Vancouver, British Columbia, Canada (Hybrid)
2 Weeks ago
ISS Stoxx - Senior Machine Learning Engineer

ISS Stoxx

New York, United States (On-Site)
1 Year ago
Globalization Partners - Principal AI Engineer

Globalization Partners

(Remote)
7 Months ago
Glean - Software Engineer, Machine Learning

Glean

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Sonar Source - ML Ops Engineer

Sonar Source

Bochum, North Rhine-Westphalia, Germany (On-Site)
4 Months ago
Autodesk - Machine Learning Engineer

Autodesk

Montreal, Quebec, Canada (Hybrid)
1 Year ago

Get notifed when new similar jobs are uploaded

About The Company

Hyderabad, Telangana, India (On-Site)

London, England, United Kingdom (On-Site)

Redmond, Washington, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Redmond, Washington, United States (On-Site)

Mountain View, California, United States (Hybrid)

Zürich, Zurich, Switzerland (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug