Member of Technical Staff, AI - Reinforcement Learning (RL) Platform

1 Month ago • All levels • Research & Development

Job Summary

Job Description

This role involves building and enhancing the world's most advanced reinforcement learning (RL) platform at Microsoft AI. Responsibilities include designing and developing the core infrastructure of the RL platform, focusing on systematizing and extending RL algorithms for LLMs across various environments. The position requires collaborating with cross-functional teams to deliver new agentic AI product capabilities, developing new algorithms, and onboarding team members to state-of-the-art techniques. The ideal candidate possesses strong coding, software engineering, and API design skills, a background in machine learning and scientific computing, and thrives in a collaborative environment. They must excel at managing multiple responsibilities and adapting to changing priorities.
Must have:
  • Design and develop RL platform infrastructure
  • Extend RL algorithms for LLMs
  • Collaborate with cross-functional teams
  • Develop new algorithms
  • Strong coding, software engineering, and API design skills
  • Machine learning and scientific computing background
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Help build the world’s most advanced reinforcement learning platform at Microsoft AI. 

We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you’ll help advance state-of-the-art algorithms for model alignment and develop tools to extend model capabilities to numerous product domains within Microsoft. 

We are looking for candidates who are both scientists and software engineers. The ideal candidate will be able to build robust systems that help our team solve the next generation of AI problems. They would: 

  • Excel in coding, software engineering, and API design 
  • Have a background in machine learning and scientific computing 
  • Thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities.   

Qualifications

Required/Minimum Qualifications  

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling or data engineering work 
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work 
  • OR equivalent experience. 

 

 

 

 

#Copilot #MicrosoftAI

Responsibilities

  • Design and develop the core infrastructure of the RL Platform, focusing on systematizing and extending RL algorithms for LLMs to a variety of present and future environments. 
  • Assist in development of new algorithms and help onboard other team members to state-of-the-art techniques. 
  • Collaborate with cross-functional teams to ship new agentic AI product capabilities. 
  • Embody our of collaboration, innovation, and excellence. 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

The Walt Disney Company - Principal Software Engineer

The Walt Disney Company

Burbank, California, United States (On-Site)
1 Month ago
ByteDance - Research Scientist Graduate (Foundation Models for Science - ByteDance Research) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Paypal - Sr MTS Software Engineer

Paypal

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Paypal - Machine Learning Engineer

Paypal

San Jose, California, United States (Hybrid)
4 Months ago
Riot Games - Principal Software Engineer (Gameplay) - Teamfight Tactics, Major Projects

Riot Games

Los Angeles, California, United States (On-Site)
3 Months ago
Microsoft - Design Verification Engineer 2

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Power Integrations - Software Developer (Backend)

Power Integrations

Pasig, Metro Manila, Philippines (On-Site)
4 Months ago
Microsoft - Research Intern - Machine Learning and Optimization - Redmond

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Meta - Software Engineer, Machine Learning

Meta

Fremont, California, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Playtech - Java Developer

Playtech

London, England, United Kingdom (On-Site)
2 Months ago
Meta - Design Verification Engineer

Meta

San Diego, California, United States (On-Site)
3 Months ago
Centripetal - Cyber Data Scientist

Centripetal

Portsmouth, New Hampshire, United States (On-Site)
6 Months ago
Google - Senior Software Engineer, Cloud Security

Google

Bengaluru, Karnataka, India (On-Site)
2 Months ago
ByteDance - Research Scientist/Engineer - Multimodal Interaction & World Model

ByteDance

Singapore (On-Site)
3 Months ago
Balbix - Senior Staff / Principal Software Development Engineer Test, Data Connector

Balbix

Gurugram, Haryana, India (On-Site)
4 Months ago
Microsoft - Research Intern - Applied Sciences Group (Audio/Vision/NLP/Multimodal)

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Sunnyvale, California, United States (Remote)
3 Months ago
Fliff  Inc  - Software Engineer II

Fliff Inc

Sofia, Sofia City Province, Bulgaria (On-Site)
8 Months ago
Warner Bros Discovery - Customer Data Manager - Digital/ VOD

Warner Bros Discovery

Masovian Voivodeship, Poland (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

version 1 - .Net Technical Lead

version 1

Belfast, Northern Ireland, United Kingdom (On-Site)
2 Months ago
Push Gaming - Infrastructure Engineer

Push Gaming

United Kingdom (Hybrid)
1 Month ago
THE GAME - SENIOR PEOPLE & CULTURE MANAGER

THE GAME

London, England, United Kingdom (Hybrid)
4 Months ago
Alphasense - Product Specialist

Alphasense

London, England, United Kingdom (On-Site)
3 Months ago
Assystems - Cost Engineer

Assystems

Glasgow, Scotland, United Kingdom (Hybrid)
3 Months ago
Bazaar Voice - Senior Software Engineer (Backend)

Bazaar Voice

Belfast, Northern Ireland, United Kingdom (Hybrid)
4 Months ago
DraftKings - Associate Director, Product, Global Sports

DraftKings

London, England, United Kingdom (On-Site)
1 Month ago
Hyper Luminal Games  - Console Programmer

Hyper Luminal Games

Scotland, United Kingdom (On-Site)
4 Months ago
Climax Studios - Senior Games Designer

Climax Studios

Edinburgh, Scotland, United Kingdom (On-Site)
4 Months ago
Orange Tree Theatre - Casual Audio Describers

Orange Tree Theatre

London, England, United Kingdom (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Ubisoft India Studios - QC Technical Manager

Ubisoft India Studios

Pune, Maharashtra, India (Hybrid)
5 Months ago
Intel Corporation - Senior Performance Verification Architect

Intel Corporation

Haifa, Haifa District, Israel (Hybrid)
3 Months ago
Luxoft - Senior Information Architect

Luxoft

Gothenburg, Västra Götaland County, Sweden (On-Site)
2 Months ago
Microsoft - Data and Applied Scientist II

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Month ago
Samsung Semiconductor - Intern, High Capacity SSD Software Ecosystem

Samsung Semiconductor

San Jose, California, United States (Hybrid)
1 Month ago
Virtuos - Software Engineer Trainee

Virtuos

China (On-Site)
3 Months ago
ByteDance - Research Scientist Intern (Doubao (Seed) - Foundation Model, Speech Understanding) - 2024 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Zoox - Software Systems Engineer - Software Health and Complexity

Zoox

Foster City, California, United States (Hybrid)
4 Months ago
Intel Corporation - SOC Architect

Intel Corporation

Boxborough, Massachusetts, United States (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

New York, New York, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

London, England, United Kingdom (On-Site)

Dublin, County Dublin, Ireland (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug