AI/ML Model Runtime Engineer

59 Minutes ago • 5-10 Years • Research Development • $127,100 PA - $226,000 PA

Job Summary

Job Description

Broadcom is seeking a Systems Engineer for its VMware Cloud Foundation's AI and Advanced Services team. The role focuses on building a private cloud AI platform and involves designing and implementing scalable solutions. The candidate will be part of the Model Runtime team, working with a Kubernetes-based control plane for ML inferencing services. Responsibilities include collaborating on Kubernetes-based platform services for AI, troubleshooting inference services, augmenting the platform for diverse inferencing engines, developing enhancements for ML models, participating in code reviews, and maintaining automated tests. Experience with Open Source projects, particularly Kubernetes and CNCF projects, is highly valued.
Must have:
  • 5+ years of production quality Python code
  • 5+ years of experience with Docker and Kubernetes
  • Strong analytical and diagnostic skills
  • Excellent communication and collaboration skills
  • Experience with agile development methodologies
  • Experience with version control systems (Git)
  • BS in Computer Science or related technical fields
  • Candidate should not require work visa/sponsorship
Good to have:
  • Experience contributing to Open Source projects
  • Experience participating in upstream Kubernetes and CNCF projects
  • Experience with CUDA or other numeric computing libraries
Perks:
  • Competitive salary and benefits package
  • Opportunities for career growth and professional development
  • Collaborative and dynamic work environment
  • Access to cutting-edge technologies and tools

Job Details

Please Note:

1. If you are a first time user, please create your candidate login account before you apply for a job. (Click Sign In > Create Account)

2. If you already have a Candidate Account, please Sign-In before you apply.

Job Description:

Broadcom is looking for a Systems Engineer to join VMware Cloud Foundation’s (VCF) AI and Advanced Services team. This position is key to building a best in class private cloud AI platform. You will have a high impact by playing a critical role designing and implementing scalable solutions along with a team of talented and enthusiastic engineers.

This role will be a member of the Private AI Services’ Model Runtime team, which is a Kubernetes based control plane that operates ML inferencing services. The successful candidate must have experience contributing to Open Source projects, and experience participating in upstream Kubernetes and related CNCF projects as a contributor is a major plus.

The AI & Advanced Services team is responsible for building AI platform capabilities into the VMware Cloud Foundation product to enable our enterprise customers to have all of the AI platform features they need to build, deploy, test, manage, and scale their AI infrastructure and workloads.

 

Responsibilities

  • Collaborate with cross-functional teams to design and deliver expanded capabilities of Kubernetes-based platform services for AI

  • Troubleshoot and resolve complex issues related to Private AI inference services and their performance

  • Augment the AI services platform to support diverse inferencing and embedding engines (e.g,. vLLM or HuggingFace Transformers library)

  • Develop enhancements to the AI services platform to support a large set of ML models

  • Participate in code reviews and ensure that the code is aligned with VMware's coding standards and best practices

  • Develop and maintain automated tests to ensure the quality and reliability of the Private AI feature set

 

Requirements

  • Ability to succeed on an take-home homework assignment and in-person technical interview including coding and debugging

  • 5+ years experience in production quality Python code

  • 5+ years of hands on experience with Container technologies (Docker and Kubernetes)

  • Experience with CUDA or other numeric computing libraries and their Python bindings is a plus

  • Strong analytical and diagnostic skills with ability to independently solve complex problems

  • Excellent communication and collaboration skills, with the ability to work with cross-functional teams

  • Experience with agile development methodologies and version control systems, such as Git

  • BS in Computer Science or related technical fields and 12+ years of related experience in the software industry or MS in Computer Science or related technical fields and 10+ years of related experience in the software industry

  • Candidate should not require work visa / sponsorship

 

What We Offer:

  • Competitive salary and benefits package

  • Opportunities for career growth and professional development

  • Collaborative and dynamic work environment

  • Access to cutting-edge technologies and tools

Broadcom is proud to be an equal opportunity employer.  We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law.  We will also consider qualified applicants with arrest and conviction records consistent with local law.

If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Research Development Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

A global infrastructure technology leader built on more than 60 years of innovation, collaboration and engineering excellence.

 

United States (On-Site)

United States (On-Site)

Prague, Prague, Czechia (On-Site)

San Jose, California, United States (On-Site)

Austin, Texas, United States (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

Palo Alto, California, United States (On-Site)

San Jose, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by broadcom

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug