Member of Technical Staff, AI Pretraining

1 Week ago • All levels • Artificial Intelligence

Job Summary

Job Description

Contribute to developing one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a data-driven approach. This involves algorithmic implementation, experimentation, overseeing flagship training runs on a large-scale distributed stack, and close collaboration with infrastructure, data, post-training, and multimodality teams. The ideal candidate will have proven expertise in pre-training, demonstrated by a strong publication record and technical leadership in high-impact projects. Strong analytical skills, attention to detail, and experience with large-scale distributed systems are essential.
Must have:
  • Expertise in AI pre-training
  • Strong analytical & problem-solving skills
  • Experience with large-scale distributed systems
  • Proficiency in C/C++/C#/Java/JavaScript/Python
  • Data-driven approach to algorithm development
Good to have:
  • Experience with conversational AI
  • Excellent communication and collaboration skills
  • Passion for learning new technologies

Job Details

Job Description

Help deliver one of the best foundational models in the world at Microsoft AI. 
At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 
 
We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 
  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
 
Responsibilities 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 
 
Required/Minimum Qualifications 
  • · Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C , C#, Java, JavaScript, or Python 
  • Proven expertise in the area of pretraining

Additional or Preferred Qualifications 
  • Demonstrated experience in large-scale AI. 
  • Passionate about conversational AI and its deployment. 
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.   
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.  
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.   


Similar Jobs

Velotio Technologies - Data Architect

Velotio Technologies

Maharashtra, India (Remote)
6 Days ago
Knuddels - Java Backend Developer

Knuddels

Baden-Württemberg, Germany (Hybrid)
5 Days ago
Velotio Technologies - Lead Engineer (Java)

Velotio Technologies

Maharashtra, India (Remote)
2 Weeks ago
ARHS - Java Achitect /Technical Lead

ARHS

Brussels, Brussels, Belgium (On-Site)
5 Months ago
Tamatem Games - Software Engineer

Tamatem Games

Amman, Amman Governorate, Jordan (On-Site)
6 Days ago
Interface AI - Sr. Implementation Engineer

Interface AI

United States (Remote)
4 Months ago
Meta - AI Research Scientist - Generative AI Red Teaming (London or Paris)

Meta

Paris, Île-de-France, France (On-Site)
4 Months ago
NVIDIA - Senior Solutions Architect, Generative AI - Inference

NVIDIA

California, United States (Remote)
2 Months ago
Flutter Entertainment - Lead Data Scientist

Flutter Entertainment

Hyderabad, Telangana, India (Hybrid)
4 Months ago
Henkel - Data Scientist-Intern

Henkel

Pune, Maharashtra, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Nielsen Holdings - Software Engineer - Bigdata ( Java or Scala or  Python, Spark, SQL, AWS )

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Skillz - Backend Engineer - Java / GoLang

Skillz

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
ION - Lead Software Engineer, Italy

ION

Pisa, Tuscany, Italy (On-Site)
5 Months ago
ByteDance - Full Stack Software Engineer - Data, Security

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Tencent - Cross Border Payment Software Engineer

Tencent

(On-Site)
1 Month ago
Assystems - Backend Developer – ETL Integration

Assystems

Gurugram, Haryana, India (On-Site)
5 Months ago
Netflix - Senior Software Engineer, Partner Engineering - APAC

Netflix

Hsinchu, Hsinchu City, Taiwan (On-Site)
5 Months ago
ByteDance - Relational Database Intern

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Next Level Business Services - Java developer with Angular

Next Level Business Services

Toronto, Ontario, Canada (On-Site)
5 Months ago
Samsung Semiconductor - Staff Software Engineer – Platform

Samsung Semiconductor

San Jose, California, United States (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Rockstar Games - Senior Production Coordinator, Creator Platform

Rockstar Games

Leeds, England, United Kingdom (On-Site)
6 Months ago
The Walt Disney Company - Senior Manager, New Build Project Integration

The Walt Disney Company

London, England, United Kingdom (Hybrid)
1 Week ago
Haptic - Technical Art Director

Haptic

United Kingdom (Hybrid)
2 Months ago
The Walt Disney Company - Senior Software Engineer

The Walt Disney Company

London, England, United Kingdom (On-Site)
1 Week ago
Tesla - Sales Leader

Tesla

Cambridge, England, United Kingdom (On-Site)
1 Month ago
Creative Assembly - Senior VFX Artist

Creative Assembly

England, United Kingdom (Hybrid)
1 Month ago
Lighthouse Games - Senior SDET - C++

Lighthouse Games

Royal Leamington Spa, England, United Kingdom (Hybrid)
6 Days ago
Rockstar Games - Senior Full Stack Engineer (C#/React)

Rockstar Games

Edinburgh, Scotland, United Kingdom (On-Site)
6 Months ago
Kwalee - Community Specialist

Kwalee

Royal Leamington Spa, England, United Kingdom (On-Site)
2 Weeks ago
Hawk Eye Innovations - Commercial Manager - Growth Sports

Hawk Eye Innovations

London, England, United Kingdom (On-Site)
21 Hours ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Customer Program Manager

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model AI Platform) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Genies - Backend Engineer Intern (LLM)

Genies

San Mateo, California, United States (Hybrid)
6 Days ago
Magic Media - Senior Automation Engineer

Magic Media

State Of Rio De Janeiro, Brazil (Remote)
1 Week ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
ByteDance - Research Scientist in Foundation Model, Speech Understanding - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Pika - Research Engineer (Applied Research)

Pika

Palo Alto, California, United States (On-Site)
6 Days ago
Google - Senior Software Engineer, Core Machine Learning, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
4 Months ago
Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Vancouver, British Columbia, Canada (Remote)
2 Weeks ago
ByteDance - AI Security Researcher - Security Flow

ByteDance

San Jose, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (Hybrid)

Redmond, Washington, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Redmond, Washington, United States (On-Site)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug