Member of Technical Staff, AI Pre-Training

3 Weeks ago • All levels • Artificial Intelligence

Job Summary

Job Description

Contribute to the development of one of the world's best foundational AI models at Microsoft AI. The Pre-Training team focuses on challenging deep learning problems at scale. Responsibilities include developing algorithms, model architectures, and data mixtures for large-scale training; conducting experiments and overseeing flagship training runs; collaborating with infrastructure, data, and post-training teams; and using a data-driven approach grounded in meticulous ablations. Successful candidates will have expertise in deep learning, strong analytical skills, experience with large-scale distributed systems, and a collaborative work style.
Must have:
  • Expertise in deep learning
  • Strong analytical skills
  • Experience with large-scale distributed systems
  • Data-driven approach
  • Collaborative work style

Job Details

Help deliver one of the best foundational models in the world at Microsoft AI. 

At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 

 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment 
  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 

Similar Jobs

Springer Group - Senior AI ML Engineer

Springer Group

Pune, Maharashtra, India (On-Site)
20 Hours ago
ByteDance - Lead Research Scientist, Foundation Model, Music Intelligence

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Software Engineer III, Infrastructure and Operations

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
2 Weeks ago
Genies - Machine Learning Infrastructure Engineer, 3D Model Inference & Deployment

Genies

Los Angeles, California, United States (On-Site)
2 Months ago
NVIDIA - Principal Engineer

NVIDIA

United States (Remote)
2 Months ago
Google - Applied Machine Learning Engineer, AICore, Platforms and Devices

Google

Taipei City, Taiwan (On-Site)
1 Week ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Bellevue, Washington, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer III, Full Stack, Google Ads

Google

(On-Site)
5 Months ago
Blitz app - Senior Software Engineer (C++)

Blitz app

India (Remote)
1 Month ago
Peak - Data Scientist (New Grad)

Peak

(On-Site)
7 Months ago
Applike Group - Product Lead

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
1 Year ago
Snowed In Studios - Senior Generalist Programmer

Snowed In Studios

Quebec, Canada (Remote)
1 Month ago
Sabre India - Senior Service Data Analyst

Sabre India

Bengaluru, Karnataka, India (Hybrid)
6 Days ago
Interface AI - Vice President of Engineering

Interface AI

(Remote)
6 Days ago
Fluxon - Senior Software Engineer

Fluxon

Bengaluru, Karnataka, India (Remote)
6 Months ago
NVIDIA - Customer Technical Program Manager

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
Google - Software Engineer, Android Automotive

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

AFRY - Electrical Systems Designer/Technician 80-100% (f/m), BELLINZONA

AFRY

Bellinzona, Ticino, Switzerland (On-Site)
1 Month ago
Sonar Source - Junior UX Designer

Sonar Source

Geneva, Geneva, Switzerland (On-Site)
6 Months ago
Tesla - Senior Business Planning Coordinator

Tesla

Zug, Zug, Switzerland (On-Site)
2 Months ago
GIANTS Software - Tools Programmer

GIANTS Software

Schlieren, Zurich, Switzerland (On-Site)
4 Months ago
Thales - System Integration & Test Engineer

Thales

Bern, Canton Of Bern, Switzerland (Hybrid)
5 Days ago
Niantic - Security Engineer, Production

Niantic

Zürich, Zurich, Switzerland (Hybrid)
1 Month ago
Google - Sales Specialist, Go-To-Market, Alps, Google Cloud

Google

Zürich, Zurich, Switzerland (On-Site)
2 Weeks ago
AFRY - Electrician Planner AFC (f/m/d) (second apprenticeship)

AFRY

Bellinzona, Ticino, Switzerland (On-Site)
2 Months ago
Interactive Brokers - Java Software Engineer

Interactive Brokers

Zug, Zug, Switzerland (On-Site)
6 Months ago
Interactive Brokers - Institutional Client Services Associate

Interactive Brokers

Zug, Zug, Switzerland (Hybrid)
6 Days ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Tencent - NLP Research Intern

Tencent

London, England, United Kingdom (On-Site)
5 Months ago
Google - Software Engineer III, AI/ML

Google

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Google - Cloud Developer II, AI/ML, Professional Services

Google

Atlanta, Georgia, United States (On-Site)
1 Week ago
ByteDance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Canva - Senior Backend Engineer - AI Enablement

Canva

Surry Hills, New South Wales, Australia (Remote)
1 Month ago
ByteDance - Student Researcher (Doubao (Seed) - Machine Learning System) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Software Engineer, PhD

Google

Kirkland, Washington, United States (On-Site)
2 Weeks ago
Garena - AI Image Creation Collaborator

Garena

Hanoi, Hanoi, Vietnam (On-Site)
3 Weeks ago
Hedra - Applied Research Scientist

Hedra

San Francisco, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Shenzhen, Guangdong Province, China (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Sydney, New South Wales, Australia (Remote)

Redmond, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug