Cloud Machine Learning LLM Serving Engineer

3 Days ago • 2 Years +

Job Summary

Job Description

The Qualcomm Cloud Computing team is developing hardware and software for Machine Learning solutions. This role involves improving and optimizing Deep Learning models on Qualcomm AI 100, building deep learning framework extensions, implementing Kernels for AI workloads, and collaborating with internal teams for optimization. Responsibilities also include building software tools, working on various technologies like vLLM, Triton, and optimizing workloads for scale-up and scale-out systems, and applying knowledge of software engineering best practices. The role requires strong communication, planning, and execution skills.
Must have:
  • Improve and optimize Deep Learning models on Qualcomm AI 100.
  • Build deep learning framework extensions for Qualcomm AI 100.
  • Implement Kernels for AI workloads.
  • Excellent C/C++/Python programming and software design skills.
  • Bachelor's degree in Engineering, Machine learning/ AI, Computer Science.
Good to have:
  • Knowledge of tiling and scheduling a Machine learning operator is a plus.
  • Experience in using C++ 14 (advanced features)
  • Experience of profiling software and optimization techniques
  • Experience of ML compiler, Auto-code generation (using MLIR) is a plus.
  • Hands-on experience with CUDA, CUDNN is a plus.

Job Details

Job Description

Job Posting Date

2025-06-03


Company:

Qualcomm India Private Limited

Job Area:

Engineering Group, Engineering Group > Software Engineering

General Summary:

JD for Cloud Machine Learning LLM Serving engineer

Job Overview:

The Qualcomm Cloud Computing team is developing hardware and software for Machine Learning solutions spanning the data center, edge, infrastructure, automotive market. We are seeking ambitious, bright, and innovative engineers with experience in machine learning framework development. Job activities span the whole product life cycle from early design to commercial deployment. The environment is fast-paced and requires cross-functional interaction daily so good communication, planning and execution skills are a must.


Key Responsibilities

  • Improve and optimize key Deep Learning models on Qualcomm AI 100.
  • Build deep learning framework extensions for Qualcomm AI 100 in upstream open-source repositories.
  • Implement Kernels for AI workloads
  • Collaborate and interact with internal teams to analyze and optimize training and inference for deep learning.
  • Build software tools and ecosystem around AI SW Stack.
  • Work on vLLM, Triton, ExecuTorch, Inductor, TorchDynamo to build abstraction layers for inference accelerator.
  • Optimize workloads for both scale-up (multi-SoC) and scale-out (multi-card) systems.
  • Optimize the entire deep learning pipeline including graph compiler integration.
  • Apply knowledge of software engineering best practices.

Desirable Skills and Aptitudes

  • Deep Learning experience or knowledge – LLMs, Natural Language Processing, Vision, Audio, Recommendation systems.
  • Knowledge of the structure and function of different components of Pytorch, TensorFlow software stacks.
  • Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design.
  • Ability to work independently, define requirements and scope, and lead your own development effort.
  • Well versed with open-source development practices.
  • Strong developer with a research mindset – strives to innovate.
  • Avid problem solver – should be able to find solutions to key engineering and domain problems.

  • Knowledge of tiling and scheduling a Machine learning operator is a plus.
  • Experience in using C++ 14 (advanced features)
  • Experience of profiling software and optimization techniques
  • Hands on experience writing SIMD and/or multi-threaded high-performance code is a plus.
  • Experience of ML compiler, Auto-code generation (using MLIR) is a plus.
  • Experiences to run workloads on large scale heterogeneous clusters is a plus.
  • Hands-on experience with CUDA, CUDNN is a plus.

Qualifications:

  • Bachelor's / Masters/ PHD degree in Engineering, Machine learning/ AI, Information Systems, Computer Science, or related field.
  • 2+ years Software Engineering or related work experience.
  • 2+ years’ experience with Programming Language such as C++, Python.

Minimum Qualifications:

• Bachelor's degree in Engineering, Information Systems, Computer Science, or related field.

Applicants: Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).

Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.

To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.

If you would like more information about this role, please contact Qualcomm Careers.

Similar Jobs

Canva - Senior Computer Vision Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
2 Months ago
PwC - Senior AI Developer - Roma [DIG]

PwC

Rome, Lazio, Italy (On-Site)
8 Months ago
Zscaler - Staff Data Science Engineer

Zscaler

San Jose, California, United States (On-Site)
4 Weeks ago
ByteDance - Research Engineer Graduate (Machine Learning Sys-US) - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Riot Games - Staff Software Engineer, Machine Learning - AI Foundations

Riot Games

Los Angeles, California, United States (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Illuminia - ML/Software Engineer 1 - MLOps

Illuminia

Singapore, Singapore (On-Site)
2 Weeks ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Westford, Massachusetts, United States (Hybrid)
2 Months ago
Attentive - Staff Software Engineer, Personalization Engine

Attentive

(Remote)
1 Month ago
Google - Senior Research Engineer, AI/ML

Google

London, England, United Kingdom (On-Site)
1 Month ago
Canva - Senior Machine Learning Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
4 Months ago
Scale AI - Machine Learning Engineer, Enterprise GenAI

Scale AI

San Francisco, California, United States (On-Site)
1 Month ago
Microsoft - Applied Scientist: Microsoft AI – PhD – Redmond

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Pentair - Engineer- Data Science

Pentair

Noida, Uttar Pradesh, India (On-Site)
1 Month ago
ByteDance - Research Engineer Graduate (Machine Learning Sys-US) - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
NVIDIA - Senior Math Libraries Engineers - Python APIs

NVIDIA

Louisiana, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

bosh group india - IT Support Specialist

bosh group india

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Tide - Head of Procurement - Technology, SaaS & Services

Tide

Hyderabad, Telangana, India (Hybrid)
1 Month ago
Index Exchange - Backend Engineer, Data Products

Index Exchange

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Highspot - Sr. Software Engineer, Search and AI

Highspot

Hyderabad, Telangana, India (Hybrid)
1 Month ago
Capgemini - Manual Tester

Capgemini

Pune, Maharashtra, India (On-Site)
1 Month ago
Workato - AI Solutions Architect

Workato

Bengaluru, Karnataka, India (On-Site)
4 Weeks ago
Google - Product Manager, Google Distributed Cloud, Compliance and Security

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
PwC - IN_Senior Associate_Tableau Developer_Data & Analytics_Advisory_PAN India

PwC

Gurugram, Haryana, India (On-Site)
7 Months ago
PwC - Senior Associate_Databricks_Data & Analytics_Advisory_PAN  India

PwC

Kolkata, West Bengal, India (On-Site)
8 Months ago
Zones LLC - Network Engineer L2

Zones LLC

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Our employees make Qualcomm’s success possible. We hire the brightest minds and foster a supportive, inclusive culture where your ideas have the power to contribute to world-changing innovations and breakthrough technologies. To make that possible, we leverage the breadth and depth of our diverse expertise from around the world to answer the unasked, conquer the complex, and solve some of the biggest challenges only we can – together.

Bengaluru, Karnataka, India (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Chennai, Tamil Nadu, India (On-Site)

View All Jobs

Get notified when new jobs are added by Qualcomm

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug