Member of Technical Staff, AI - Pre-Training

1 Hour ago • 4-10 Years • Artificial Intelligence

About the job

Job Description

This role involves developing algorithms, model architectures, and scaling laws for large-scale AI model training. Responsibilities include algorithmic implementation, conducting experiments, and overseeing training runs on a distributed system. Close collaboration with cross-functional teams is required. The ideal candidate will have expertise in deep learning, large-scale distributed systems, and a strong publication record. The team aims to deliver one of the world's best foundational AI models, impacting various Microsoft AI initiatives. The position requires proficiency in languages like C, C++, C#, Java, JavaScript, or Python and a passion for conversational AI and its deployment. The role demands strong analytical, communication, and collaborative skills.
Must have:
  • Expertise in deep learning and large-scale systems
  • Proficiency in C/C++/Java/Python etc.
  • Strong publication record and technical leadership
  • Experience with large-scale AI model training
  • Excellent communication and collaboration skills
Good to have:
  • Passion for conversational AI
  • Experience with cloud computing platforms
  • Familiarity with multimodal AI models
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

Help deliver one of the best foundational models in the world at Microsoft AI. 

At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 

 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

 

Qualifications

Required Qualifications 

  • Bachelor's Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python 
    • OR equivalent experience. 

Preferred Qualifications 

  • Bachelor's Degree in Computer Science or related technical field AND 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python 
    • OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python 
    • OR equivalent experience. 
  • Demonstrated experience in large-scale AI. 
  • Passionate about conversational AI and its deployment. 
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.   
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.  
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.  

 

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.

 

Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $137,600 - $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 - $294,000 per year.


Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

Microsoft will accept applications and processes offers for these roles on an ongoing basis.

 

Responsibilities

  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$117.2K - $294.0K/yr (Outscal est.)
$205.6K/yr avg.
Redmond, Washington, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Seoul, South Korea (On-Site)

New York, New York, United States (On-Site)

Texas, United States (Hybrid)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

Sydney, New South Wales, Australia (Hybrid)

Bengaluru, Karnataka, India (On-Site)

Hyderabad, Telangana, India (On-Site)

London, England, United Kingdom (On-Site)

Beijing, Beijing, China (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Passive Logic - Autonomous Systems Software Engineer

Passive Logic, United States (On-Site)

Amazon Games - Senior ML Scientist, Amazon Games AI Research

Amazon Games, United States (On-Site)

Dream Game Studios - Senior Security Engineer - Red Team

Dream Game Studios, India (On-Site)

Hitachi - Quality Analyst

Hitachi, India (On-Site)

Sumo Logic - Staff Software Engineer

Sumo Logic, India (On-Site)

Playrix - Senior QA Engineer (VSO Engine)

Playrix, Cyprus (Remote)

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Netflix - Senior Test Engineer (L5), iOS Games SDK

Netflix, United States (Remote)

Google - Engineering Analyst, Messages Spam and Abuse

Google, United States (On-Site)

Alten Technology - Netsuite OpenAir Administrator

Alten Technology, United States (On-Site)

PENN Interactive - Director, Motion Design

PENN Interactive, United States (Hybrid)

Axiom Zen - Game Designer, CryptoKitties

Axiom Zen, United States (Remote)

Next Level Business Services - SAP OER Project Manager

Next Level Business Services, United States (On-Site)

Valve corporation - Design
Visual & User Experience

Valve corporation, United States (On-Site)

Glean - Enterprise Account Executive - Texas

Glean, United States (Remote)

Google - Student Researcher, PhD, Winter/Summer 2025

Google, United States (On-Site)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Get notifed when new similar jobs are uploaded