Member of Technical Staff, AI - Reinforcement Systems

4 Months ago • All levels • Research Development

Job Summary

Job Description

This role involves building advanced reinforcement learning systems at Microsoft AI, focusing on creating trustworthy, autonomous agents. Responsibilities include collaborating with research teams to improve reinforcement learning algorithms for LLMs, developing core systems for scaling reinforcement learning to diverse environments, and ensuring high-quality software engineering practices. The ideal candidate excels in programming (parallel/concurrent), software engineering, API design, and large-scale system development. Experience with large-scale systems and a demonstrated interest in reinforcement learning are essential. The role demands collaboration, attention to detail, and the ability to manage multiple responsibilities in a fast-paced setting.
Must have:
  • Expert in parallel/concurrent programming
  • Strong software engineering & API design skills
  • Large-scale system experience
  • Reinforcement learning knowledge
  • Collaboration & problem-solving skills
Good to have:
  • Machine learning research background
  • Large-scale distributed AI systems experience

Job Details

Job Description

Help build the world’s most advanced reinforcement learning systems at Microsoft AI. 
  
We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you’ll help advance state-of-the-art model capabilities by contributing to core systems, infrastructure, and research. 
  
We are looking for distributed systems experts with a scientific mindset. The ideal candidate will be able to build complex systems from the ground up, discover and diagnose causes of suboptimal performance, and contribute to solving scientific and research challenges. Specifically, they should: 
  • Excel in programming (especially parallel/concurrent), software engineering, and API design 
  • Have experience in large-scale systems, preferably having built some components from scratch. 
  • Thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities 
  • Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users 
 
A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics, competitive programming, and related domains are a plus.   
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. 
 
Responsibilities 
  • Collaborate with research teams to advance state-of-the-art algorithms for reinforcement learning in LLMs 
  • Develop the core systems for adapting reinforcement learning to unprecedented scales and heterogeneous environments.  
  • Embody our of collaboration, innovation, and excellence. 
 
Qualifications 
 
Required Qualifications: 
  • Bachelor's Degree in Computer Science, Software Engineering, Computer Engineering, Machine Learning, Mathematics, or related STEM fields and experience in coding in languages including, but not limited to, C, C , C#, Rust, Java, or Python 
  • Experience with large-scale software systems and infrastructure. 
  • Demonstrated interest in reinforcement learning, language modelling, generative modelling, or related domains. 
  • Ability to work collaboratively in a fast-paced, innovative environment. 

Preferred Qualifications: 
  • Background in machine learning research. 
  • Experience with large scale distributed AI systems. 



Similar Jobs

GHX - Director, ACE Commercial Associate Program

GHX

Louisville, Colorado, United States (On-Site)
2 Weeks ago
Bosch Group - IN_RBIC_Senior Engineer/Assistant Manager HSE

Bosch Group

Kurali, Maharashtra, India (On-Site)
1 Month ago
Epic Games - Product Manager

Epic Games

Cary, North Carolina, United States (On-Site)
5 Months ago
Blazesoft - Gaming Paralegal

Blazesoft

Toronto, Ontario, Canada (On-Site)
1 Month ago
Oliver Plus - Integrated Designer

Oliver Plus

India (Remote)
2 Months ago
Reddit - Senior Software Engineer, AI Enablement

Reddit

Canada (Remote)
2 Months ago
Haleon - Lead Machine Learning Engineer

Haleon

Bengaluru, Karnataka, India (On-Site)
2 Months ago
rivos - Deep Learning Libraries Engineer

rivos

United Kingdom (Hybrid)
1 Year ago
Apple - Senior AI Application Engineer

Apple

Cupertino, California, United States (On-Site)
2 Months ago
DevRev - Architect - Applied AI Engineer

DevRev

(Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Coda - Software Engineering Manager

Coda

Bangkok, Thailand (Hybrid)
5 Months ago
Blazesoft - Gaming Paralegal

Blazesoft

Toronto, Ontario, Canada (On-Site)
1 Month ago
Illumina - Senior Manufacturing Manager

Illumina

Cambridge, England, United Kingdom (On-Site)
3 Weeks ago
luxsoft - Technical Lead / Senior Data Engineer

luxsoft

Ukrainka, Kyiv Oblast, Ukraine (Remote)
1 Month ago
USE Insider - Enterprise Account Executive - France

USE Insider

Paris, Île-de-France, France (Hybrid)
6 Months ago
Lulalend - Software Engineering Team Lead

Lulalend

Cape Town, Western Cape, South Africa (Hybrid)
2 Months ago
Euromonitor - Research Associate

Euromonitor

Mexico City, Mexico (Hybrid)
1 Week ago
CyberArk - Enterprise Customer Success Manager

CyberArk

United States (On-Site)
2 Months ago
Zinnia - Director – Developer Platforms (DevX)

Zinnia

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Salesforce - Named Account Executive - Insurance

Salesforce

Munich, Bavaria, Germany (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Ubisoft - Vendor Project Manager

Ubisoft

Newcastle Upon Tyne, England, United Kingdom (Hybrid)
2 Months ago
asbo interactibe - Unreal Engine Programmer - Bike Physics

asbo interactibe

Rowlands Gill, England, United Kingdom (On-Site)
2 Months ago
Synthesia - Sales Development Representative - Italian Speaker

Synthesia

London, England, United Kingdom (On-Site)
2 Weeks ago
Tesla - Sales Advisor

Tesla

London, England, United Kingdom (On-Site)
5 Months ago
Whatnot - Performance Creative Producer

Whatnot

London, England, United Kingdom (On-Site)
2 Months ago
Zoe - Technical Conversion Rate Optimisation (CRO) Specialist

Zoe

United Kingdom (Remote)
3 Weeks ago
Cloud Imperium Games - VFX Artist

Cloud Imperium Games

Manchester, England, United Kingdom (On-Site)
5 Months ago
The Walt Disney Company - Technical Lighter (All Levels)

The Walt Disney Company

London, England, United Kingdom (Hybrid)
2 Months ago
Square - Class 1 C & D Driver

Square

Wolverhampton, England, United Kingdom (On-Site)
1 Week ago
miniclip - Senior UI Artist

miniclip

Derby, England, United Kingdom (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Ubisoft - Lead R&D Scientist

Ubisoft

Shanghai, Shanghai, China (On-Site)
3 Months ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Hyderabad, Telangana, India (Hybrid)
5 Months ago
Playtika - R&D Team Leader

Playtika

Poland (Hybrid)
6 Months ago
Rippling - Machine Learning Engineer Intern - Winter 2026

Rippling

San Francisco, California, United States (On-Site)
2 Months ago
Alpha Sense - Senior Software Engineer (AI Applications)

Alpha Sense

Pune, Maharashtra, India (On-Site)
1 Month ago
Tavus - AI Researcher (Voice)

Tavus

San Francisco, California, United States (Remote)
1 Week ago
Qualcomm - GPU Research Engineer

Qualcomm

Santa Clara, California, United States (On-Site)
3 Months ago
PermitFlow - Machine Learning, Software Engineer

PermitFlow

New York, United States (Hybrid)
1 Week ago
C3 IoT - Senior Software Engineer - Machine Learning

C3 IoT

Redwood City, California, United States (On-Site)
3 Weeks ago
Capgemini - Gen AI Professional

Capgemini

Bengaluru, Karnataka, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Hyderabad, Telangana, India (On-Site)

London, England, United Kingdom (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (On-Site)

Mountain View, California, United States (Hybrid)

Zürich, Zurich, Switzerland (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug