Member of Technical Staff, AI Reinforcement Systems

2 Months ago • All levels • Research Development

Job Summary

Job Description

Microsoft AI is seeking a Member of Technical Staff to build advanced reinforcement learning systems. Responsibilities include collaborating with research teams to improve reinforcement learning algorithms for LLMs, developing core systems for adapting reinforcement learning to large scales and diverse environments, and contributing to core systems, infrastructure, and research. The ideal candidate excels in programming (parallel/concurrent), software engineering, and API design, has experience with large-scale systems, thrives in collaborative environments, and has a high attention to detail. A background in machine learning is preferred but not required; strong mathematical or competitive programming skills are valuable substitutes. The role involves managing multiple responsibilities and adapting to changing priorities, ultimately aiming to deliver safe and capable AI agents to millions of users.
Must have:
  • Experience with large-scale software systems
  • Proficient in programming (parallel/concurrent)
  • Expertise in software engineering and API design
  • Excellent collaboration and communication skills
Good to have:
  • Machine learning research background
  • Experience with large-scale distributed AI systems
  • Strong mathematical or competitive programming skills

Job Details

Overview

Help build the world’s most advanced reinforcement learning systems at Microsoft AI. 

  

We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you’ll help advance state-of-the-art model capabilities by contributing to core systems, infrastructure, and research. 

  

We are looking for distributed systems experts with a scientific mindset. The ideal candidate will be able to build complex systems from the ground up, discover and diagnose causes of suboptimal performance, and contribute to solving scientific and research challenges. Specifically, they should: 

  • Excel in programming (especially parallel/concurrent), software engineering, and API design 
  • Have experience in large-scale systems, preferably having built some components from scratch. 
  • Thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities 
  • Be motivated by training capable and safe AI agents and shipping them into the hands of millions of users 

 

A background in machine learning is preferred but not required. In this case, candidates must demonstrate they have an ability to quickly learn the subject, and backgrounds in mathematics, competitive programming, and related domains are a plus. 

  

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. 

Qualifications

Required Qualifications:

  • Bachelor's Degree in Computer Science, Software Engineering, Computer Engineering, Machine Learning, Mathematics, or related STEM fields and experience in coding in languages including, but not limited to, C, C++, C#, Rust, Java, or Python
  • OR equivalent experience.
  • Experience with large-scale software systems and infrastructure.
  • Experience in reinforcement learning, language modelling, generative modelling, or related domains

Preferred Qualifications:

 

  • Background in machine learning research.
  • Experience with large scale distributed AI systems.
  • Ability to work collaboratively in a fast-paced, innovative environment.

 

Responsibilities

  • Collaborate with research teams to advance state-of-the-art algorithms for reinforcement learning in LLMs
  • Develop the core systems for adapting reinforcement learning to unprecedented scales and heterogeneous environments.
  • Embody our culture of collaboration, innovation, and excellence.

Similar Jobs

Qualcomm - Staff LLVM Compiler Engineer

Qualcomm

Markham, Ontario, Canada (On-Site)
2 Weeks ago
Haptic  - Head of Product Design

Haptic

United Kingdom (Hybrid)
5 Months ago
Penumbrainc - EH&S Specialist II T

Penumbrainc

Alameda, California, United States (On-Site)
1 Month ago
broadcom - Wafer Fab Engineer

broadcom

Breinigsville, Pennsylvania, United States (On-Site)
2 Weeks ago
London stock Exchange - Content Analyst (Mandarin)

London stock Exchange

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Hybrid)
1 Week ago
bytedance - Student Researcher (Doubao (Seed) - Foundation Model - Generative AI) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
bytedance - Research Scientist in Foundation Model, Speech & Audio Graduates - 2024 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Mountain View, California, United States (Remote)
2 Months ago
bytedance - Large Language Model Algorithm Engineer - Volcano Ark

bytedance

Singapore (On-Site)
7 Months ago
The Walt Disney Company - Lead Machine Learning Engineer

The Walt Disney Company

Seattle, Washington, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PayPal - Manager, Business Operations

PayPal

New York, New York, United States (Hybrid)
1 Week ago
Qualcomm - Windows Performance Engineer

Qualcomm

San Diego, California, United States (On-Site)
1 Month ago
easygo - Junior VIP Host

easygo

Melbourne, Victoria, Australia (Hybrid)
3 Weeks ago
Glean - Senior/Staff Applied Scientist

Glean

Palo Alto, California, United States (Hybrid)
1 Month ago
Nahc.io - Business Development Director

Nahc.io

United States (Remote)
1 Month ago
Nice - HRIS ERP Program Manager

Nice

Ra'anana, Center District, Israel (Hybrid)
2 Days ago
London stock Exchange - Senior Lead Engineer

London stock Exchange

Hyderabad, Telangana, India (On-Site)
4 Weeks ago
Tide - Senior Quality Assurance Analyst

Tide

Delhi, India (On-Site)
1 Month ago
Blinkhealth - Call Center Representative

Blinkhealth

Pittsburgh, Pennsylvania, United States (On-Site)
1 Month ago
Tencent - Public Affairs Director

Tencent

(On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

Trellix - Enterprise Account Manager

Trellix

Switzerland (Remote)
1 Month ago
Interactive Brokers - Platform Operations Engineer - Linux

Interactive Brokers

Zug, Zug, Switzerland (On-Site)
1 Month ago
Haleon - Process Safety Specialist

Haleon

Nyon, Vaud, Switzerland (On-Site)
1 Week ago
Haleon - Maintenance Electromechanic

Haleon

Nyon, Vaud, Switzerland (On-Site)
1 Week ago
Thales - System Integration & Test Engineer

Thales

Bern, Canton Of Bern, Switzerland (Hybrid)
1 Month ago
Tesla - Automotive Service Technician

Tesla

Cadenazzo, Ticino, Switzerland (On-Site)
4 Months ago
Sonar Source - Enterprise Account Executive - German Speaker - DACH

Sonar Source

Geneva, Geneva, Switzerland (On-Site)
7 Months ago
PwC - Manager/ Senior Manager Financial Services - Technology Strategy & Transformation

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago
GIANTS Software - Tools Programmer

GIANTS Software

Schlieren, Zurich, Switzerland (On-Site)
5 Months ago
Haleon - Quality Agent

Haleon

Nyon, Vaud, Switzerland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
bytedance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
zoox - Senior Software Engineer: Secure Embedded Operating Systems

zoox

Foster City, California, United States (On-Site)
8 Months ago
Devoteam - Data Driven | MLOps Engineer

Devoteam

Lisbon, Lisbon, Portugal (Remote)
8 Months ago
bytedance - Senior Software Engineer / Researcher, AI-Native Database Systems

bytedance

San Jose, California, United States (On-Site)
1 Month ago
Google - Silicon Design Verification Engineer, TPU

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
The Walt Disney Company - Lead Software Engineer - Applied AI & Machine Learning

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Months ago
AI Fund - AI Fund-Principal

AI Fund

Palo Alto, California, United States (Hybrid)
8 Months ago
Hedra - Senior Research Engineer

Hedra

San Francisco, California, United States (On-Site)
2 Months ago
zoox - Senior/Staff Software Engineer, ML Performance Optimization

zoox

Foster City, California, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

United States (On-Site)

Chennai, Tamil Nadu, India (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Noida, Uttar Pradesh, India (On-Site)

Redmond, Washington, United States (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug