Member of Technical Staff, AI Pretraining

2 Months ago • All levels • Artificial Intelligence

Job Summary

Job Description

Contribute to developing one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a data-driven approach. This involves algorithmic implementation, experimentation, overseeing flagship training runs on a large-scale distributed stack, and close collaboration with infrastructure, data, post-training, and multimodality teams. The ideal candidate will have proven expertise in pre-training, demonstrated by a strong publication record and technical leadership in high-impact projects. Strong analytical skills, attention to detail, and experience with large-scale distributed systems are essential.
Must have:
  • Expertise in AI pre-training
  • Strong analytical & problem-solving skills
  • Experience with large-scale distributed systems
  • Proficiency in C/C++/C#/Java/JavaScript/Python
  • Data-driven approach to algorithm development
Good to have:
  • Experience with conversational AI
  • Excellent communication and collaboration skills
  • Passion for learning new technologies

Job Details

Job Description

Help deliver one of the best foundational models in the world at Microsoft AI. 
At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 
 
We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 
  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
 
Responsibilities 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 
 
Required/Minimum Qualifications 
  • · Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C , C#, Java, JavaScript, or Python 
  • Proven expertise in the area of pretraining

Additional or Preferred Qualifications 
  • Demonstrated experience in large-scale AI. 
  • Passionate about conversational AI and its deployment. 
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.   
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.  
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.   


Similar Jobs

Applike Group - Product Lead

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
1 Year ago
Adyen - Senior Linux Infrastructure Engineer

Adyen

Amsterdam, North Holland, Netherlands (On-Site)
1 Week ago
Egnyte - Senior Software Engineer, Java

Egnyte

Poznań, Greater Poland Voivodeship, Poland (On-Site)
6 Days ago
bytedance - Backend Software Engineer - Global E-Commerce Supply Chain Billing & Settlement

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
Kyruuus health - Senior Data Architect

Kyruuus health

United States (Remote)
1 Month ago
Mashgin - Senior Software Engineer, Computer Vision and Deep Learning

Mashgin

Palo Alto, California, United States (Hybrid)
7 Months ago
AI Fund - AI Fund-Principal

AI Fund

Palo Alto, California, United States (Hybrid)
7 Months ago
bytedance - LLM Software Engineer/Researcher (Applied Machine Learning)

bytedance

Seattle, Washington, United States (On-Site)
2 Months ago
Arrise Solutions (India)   - Lead ML Engineer

Arrise Solutions (India)

Hyderabad, Telangana, India (On-Site)
8 Months ago
Lionbridge Games - Language AI (Games) Program Manager

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Smilegate - Group Procurement System and Internal Web System Operation [Development]

Smilegate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
2 Months ago
Epic Games - Software Engineer, Developer Relations

Epic Games

Seoul, South Korea (On-Site)
4 Months ago
Scale AI - Solutions Engineer

Scale AI

San Francisco, California, United States (On-Site)
1 Month ago
SingleStore - Senior Software Engineer, Data Ingestion

SingleStore

Hyderabad, Telangana, India (Hybrid)
2 Weeks ago
Meta - Software Engineer, Product

Meta

Los Angeles, California, United States (Remote)
1 Month ago
Granicus - Senior Site Reliability Engineer

Granicus

Bengaluru, Karnataka, India (Remote)
1 Year ago
Forcepoint - Principal Engineer (Data Analytics & Java)

Forcepoint

Mumbai, Maharashtra, India (On-Site)
4 Days ago
endava - Senior Automation Tester

endava

Cluj-Napoca, Cluj County, Romania (Hybrid)
3 Weeks ago
Ion - Data Engineer, Italy

Ion

Italy (Hybrid)
7 Months ago
onwards Search - Backend Developer III

onwards Search

New York, United States (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Firesprite - Principal Gameplay Animator

Firesprite

Liverpool, England, United Kingdom (Hybrid)
2 Months ago
Behaviour Interactive - Principal Generalist Programmer - Dead by Daylight | Programmeur·se Généraliste Principal·e - Dead by Daylight

Behaviour Interactive

Middlesbrough, England, United Kingdom (Hybrid)
6 Months ago
ElevenLabs - Full-Stack Engineer (Back-End Leaning - Core)

ElevenLabs

United Kingdom (Remote)
2 Months ago
Ion - Technical Analyst - 8276

Ion

Woking, England, United Kingdom (On-Site)
7 Months ago
Just wont die - Video Editor

Just wont die

Cambridge, England, United Kingdom (On-Site)
3 Weeks ago
Aristocrat Gaming - Technical Project Manager

Aristocrat Gaming

London, England, United Kingdom (Hybrid)
2 Months ago
Climax Studios - Senior Level Designer

Climax Studios

Scotland, United Kingdom (On-Site)
2 Months ago
Media molecule - Senior Designer (Environments)

Media molecule

London, England, United Kingdom (Hybrid)
2 Months ago
Marsh McLennan - Senior Pro Rata & Facultative Technician

Marsh McLennan

Witham, England, United Kingdom (Hybrid)
6 Days ago
Vimeo - Customer Success Manager II (DACH)

Vimeo

London, England, United Kingdom (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Meta - AI Research Scientist, Language - Generative AI

Meta

Redmond, Washington, United States (On-Site)
6 Months ago
bytedance - Research Engineer Graduate (Vision AI Platform)

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
Tencent - NLP Research Intern

Tencent

London, England, United Kingdom (On-Site)
6 Months ago
Krafton - Deep Learning Engineer - Model Optimization

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Google - Senior ML Compiler Engineer, Silicon

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
bytedance - Researcher Graduate (Applied Machine Learning - Enterprise)

bytedance

San Jose, California, United States (On-Site)
2 Months ago
TVH - Data Scientist

TVH

Pune, Maharashtra, India (On-Site)
8 Months ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

Toronto, Ontario, Canada (Remote)
2 Months ago
The Walt Disney Company - Senior Machine Learning Engineer - Ad Platforms

The Walt Disney Company

Santa Monica, California, United States (On-Site)
1 Month ago
Inworld AI - Staff C++ Engineer

Inworld AI

Mountain View, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Shenzhen, Guangdong Province, China (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Redmond, Washington, United States (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug