Member of Technical Staff, High Performance Computing Engineer

5 Hours ago • 6-6 Years • Artificial Intelligence • $137,600 PA - $294,000 PA

Job Summary

Job Description

Microsoft AI is seeking experienced High Performance Computing Engineers to contribute to the evolution of Copilot. Responsibilities include building secure and performant AI Platform services, collaborating with other engineers and researchers, shipping high-quality code, overcoming roadblocks to deliver work quickly, and thriving in a fast-paced environment. The role involves working on large-scale supercomputers and developing APIs to enhance Copilot's functionalities. The ideal candidate will possess strong problem-solving skills, excellent communication abilities, and a collaborative work ethic. This position requires working in the office 3 days a week and is located in Mountain View, CA.
Must have:
  • Build secure & performant AI Platform services
  • Collaborate with Platform, infrastructure, application engineers & AI Researchers
  • Ship high-quality, well-tested, secure, and maintainable code
  • 6+ years experience with high-scale training clusters (e.g., Nvidia InfiniBand, SLURM, Kubernetes, Ray)
  • 6+ years experience building scalable services on public cloud (Azure, AWS, GCP)
  • Proficiency in Python, C#, C++, Rust, or Java
Good to have:
  • Experience with LLM training clusters
  • Experience with AI platforms, frameworks, and APIs
  • Experience using Machine Learning frameworks
  • Ability to identify and resolve complex technical issues

Job Details

Job Description

Overview
As Microsoft AI we are pushing the boundaries of technology. 
We are creating unique, beautiful and powerful products that will change lives. A small, friendly, fast-moving team, we support each other to do the best work of our lives, always looking to break new ground, fast. We are proud of what we build, how we build it and that our products will define the AI era. We run lean, obsess about users, and always make our decisions based on the evidence. We ship regularly, so your work will have real and immediate impact.
We are seeking experienced High Performance Computing Engineers to join our team and contribute to the evolution of our personal AI, Copilot. This role offers the unique opportunity to work on some of the largest scale supercomputers in the world, a rare chance to operate at such a significant scale. The right candidate will bring a wealth of positive energy, empathy, and kindness, coupled with a track record of effectiveness. You'll be proactive, relishing the challenge of crafting top-tier consumer experiences and products swiftly and efficiently. Our team is at the forefront of developing APIs that enhance our ability to fine-tune and deploy Copilot's core functionalities, in partnership with our Product Management, Design, and AI Research teams.
 
Our newly formed organization, Microsoft AI, is dedicated to advancing Copilot and other consumer AI products and research. The team is responsible for Copilot, Bing, Edge, and generative AI research. Come be a part of the team shaping the future personal computing.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. 
 
By applying to this Mountain View, CA position, you are required to be local to the San Francisco area and in office 3 days a week. 
 
Responsibilities: 
  • Build secure and performant AI Platform services that power Copilot.
  • Work collaboratively with other Platform, infrastructure, application engineers as well as AI Researchers to build next generation AI products and services.
  • Ship high-quality, well-tested, secure, and maintainable code.
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively.
  • Enjoy working in a fast-paced, design-driven, product development cycle.
  • Embody our and 

Required/Minimum Qualifications:
  • Bachelor’s degree in computer science, or related technical discipline AND 6 years technical engineering experience building web services with coding in languages including, but not limited to, Python, C#, C , Rust, Java
  • OR equivalent experience.
  • 6 years of experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.)
  • 6 years' experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP.

Preferred Qualifications:
  • Experience with LLM training clusters. 
  • Experience working with AI platforms, frameworks, and APIs.
  • Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models, either personally or professionally.
  • Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience.
  • Dedication to writing clean, maintainable, and well-documented code with a focus on application quality, performance, and security.
  • Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, designers, and other engineers.
  • Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders.
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in web development and AI.
  • Ability to work in a fast-paced environment, manage multiple priorities, and adapt to changing requirements and deadlines.
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.
 
Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $137,600 - $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 - $294,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
Microsoft will accept applications and processes offers for these roles on an ongoing basis.

Similar Jobs

GoTo Group - Associate Software Engineer - Marketplace

GoTo Group

Jakarta, Jakarta, Indonesia (On-Site)
2 Months ago
Aristocrat Gaming - Lead Developer (Android)

Aristocrat Gaming

London, England, United Kingdom (Hybrid)
3 Weeks ago
Appier - Senior Software Engineer, Backend Development

Appier

Taipei City, Taiwan (On-Site)
5 Months ago
Riot Games - Staff Software Engineer - Infrastructure Reliability

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Netomi - L2 Support Engineer

Netomi

Gurugram, Haryana, India (Hybrid)
5 Months ago
NVIDIA - Senior Software Engineer, PyTorch - Deep Learning

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Week ago
Tencent - NLP Research Intern 104493

Tencent

London, England, United Kingdom (On-Site)
3 Months ago
Krafton  - Deep Learning Engineer - LLM Game Agent

Krafton

Seoul, South Korea (On-Site)
2 Months ago
The Walt Disney Company - Senior Data Scientist - NLP/LLM

The Walt Disney Company

Glendale, California, United States (On-Site)
9 Hours ago
Glean - Software Engineer, Machine Learning (India)

Glean

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - SRE Software Engineer Intern - Global Payment - 2025 Start

ByteDance

Singapore (On-Site)
1 Month ago
Playtika - Full Stack Developer

Playtika

Poland (Hybrid)
1 Month ago
Santa Monica Studio - QA Lead

Santa Monica Studio

Los Angeles, California, United States (On-Site)
2 Weeks ago
PwC - Software Engineer Senior Associate - MILANO [DIG]

PwC

Milan, Lombardy, Italy (On-Site)
6 Months ago
Jam Studio - Minecraft Java Developer

Jam Studio

(Remote)
5 Months ago
Microsoft - Member of Technical Staff - Backend Engineer, Health AI

Microsoft

London, England, United Kingdom (On-Site)
2 Weeks ago
Luxoft - DevOps Engineering Lead

Luxoft

Pune, Maharashtra, India (On-Site)
4 Months ago
undefined - Senior Java Engineer (Affirmative Action for Women)

São José Dos Campos, State Of São Paulo, Brazil (Hybrid)
6 Months ago
ByteDance - Backend Software Engineer, Infrastructure Platform

ByteDance

San Jose, California, United States (On-Site)
7 Hours ago

Get notifed when new similar jobs are uploaded

Jobs in Mountain View, California, United States

Spell Brush - Game Programmer (Unity)

Spell Brush

San Francisco, California, United States (On-Site)
2 Weeks ago
Google - Open Career Opportunities, Autonomous (Self-Driving) Vehicle Jobs, Waymo

Google

Novi, Michigan, United States (On-Site)
5 Months ago
Cat Daddy - Sr. HR Business Partner

Cat Daddy

Kirkland, Washington, United States (On-Site)
2 Weeks ago
ByteDance - User Research Intern

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Saviynt - Sr. Solutions Engineer, New York

Saviynt

New York, New York, United States (Remote)
5 Months ago
Zoox - Senior/Staff Software Engineer - 3D World Generation Pipelines

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago
Zoox - Staff Autonomy Integration Manager

Zoox

Foster City, California, United States (Hybrid)
5 Months ago
ByteDance - Senior Backend Software Engineer, Global E-commerce Seller Platform

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Interface AI - Vice President of Product Management

Interface AI

United States (Remote)
2 Months ago
The Walt Disney Company - Lead Data Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Level AI - Principal Software Engineer

Level AI

Noida, Uttar Pradesh, India (Hybrid)
6 Months ago
ByteDance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

Mountain View, California, United States (On-Site)
5 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Video Generative Model)

ByteDance

San Jose, California, United States (On-Site)
8 Hours ago
ByteDance - Software Engineer (Applied Machine Learning - Enterprise)

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Meta - Software Engineer, Machine Learning

Meta

Austin, Texas, United States (Remote)
14 Hours ago
NVIDIA - Director, AI Software

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Seattle, Washington, United States (On-Site)
5 Months ago
Google - Student Researcher, PhD, Winter/Summer 2025

Google

Mountain View, California, United States (On-Site)
5 Months ago
The Walt Disney Company - Principal Machine Learning Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Redmond, Washington, United States (On-Site)

Mountain View, California, United States (On-Site)

Redmond, Washington, United States (Hybrid)

New York, New York, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug