Lead Engineer, Senior-Machine Learning Tools

Qualcomm

Job Summary

As a Lead Engineer, Senior-Machine Learning Tools at Qualcomm, you will join the Generative AI team to develop and commercialize the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs. This role involves integrating cutting-edge GenAI models on Qualcomm chipsets, leveraging heterogeneous computing capabilities for on-device inference. You will be responsible for pushing performance limits of large models, deploying C/C++ software stacks, and staying updated on GenAI advancements, particularly LLMs/Transformers and edge-based deployment.

Must Have

  • Spearhead development and commercialization of Qualcomm AI Runtime (QAIRT) SDK.
  • Push performance limits from large models as an AI inferencing expert.
  • Deploy large C/C++ software stacks using best practices.
  • Stay on the cutting edge of GenAI advancements.
  • Understand LLMs/Transformers and nuances of edge-based GenAI deployment.
  • Master’s/Bachelor’s degree in computer science or equivalent.
  • 6+ years of relevant work experience in software development.
  • Strong understanding of Generative AI models – LLM, LVM, LMMs and building blocks (self-attention, cross attention, kv caching etc.).
  • Floating-point, Fixed-point representations and Quantization concepts.
  • Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU).
  • Strong in C/C++ programming, Design Patterns and OS concepts.
  • Good scripting skills in Python.
  • Excellent analytical and debugging skills.
  • Good communication skills (verbal, presentation, written).
  • Ability to collaborate across a globally diverse team and multiple interests.

Good to Have

  • Strong understanding of SIMD processor architecture and system design.
  • Proficiency in object-oriented software development.
  • Familiarity with Linux and Windows environment.
  • Strong background in kernel development for SIMD architectures.
  • Familiarity with frameworks like llama.cpp, MLX, and MLC.
  • Good knowledge of PyTorch, TFLite, and ONNX Runtime.
  • Experience with parallel computing systems and languages like OpenCL and CUDA.

Perks & Benefits

  • World-class health benefit options providing world-class coverage to employees and their eligible dependents.
  • Programs designed to help employees build and prepare for a financially secure future.
  • Self and family resources to build emotional/mental strength and resilience, as well as define purpose.
  • Wellbeing programs and resources to help employees Live+Well and Work+Well.
  • Continuous learning and development programs.
  • Tuition reimbursement.
  • Mentorships.

Job Description

Job Posting Date

2026-01-04

General Summary:

As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Software Engineer, you will design, develop, create, modify, and validate embedded and cloud edge software, applications, and/or specialized utility programs that launch cutting-edge, world class products that meet and exceed customer needs. Qualcomm Software Engineers collaborate with systems, hardware, architecture, test engineers, and other teams to design system-level software solutions and obtain information on performance requirements and interfaces.

Minimum Qualifications:

• Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience.

OR

• Master's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.

OR

• PhD in Engineering, Information Systems, Computer Science, or related field and 1+ year of Software Engineering or related work experience.

• 2+ years of academic or work experience with Programming Language such as C, C++, Java, Python, etc.

Job Description

Join the exciting Generative AI team at Qualcomm focused on integrating cutting edge GenAI models on Qualcomm chipsets. The team uses Qualcomm chips’ extensive heterogeneous computing capabilities to allow inference of GenAI models on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. Utilize this power efficient hardware and Software stack to run Large Language Models (LLMs) and Large Vision Models (LVM) at near GPU speeds!

Responsibilities:

In this role, you will spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs. As an AI inferencing expert, you'll push the limits of performance from large models. Your mastery in deploying large C/C++ software stacks using best practices will be essential. You'll stay on the cutting edge of GenAI advancements, understanding LLMs/Transformers and the nuances of edge-based GenAI deployment. Most importantly, your passion for the role of edge in AI's evolution will be your driving force.

Requirements:

Master’s/Bachelor’s degree in computer science or equivalent.

6+ years of relevant work experience in software development.

Strong understanding of Generative AI models – LLM, LVM, LMMs and building blocks (self-attention, cross attention, kv caching etc.)

Floating-point, Fixed-point representations and Quantization concepts.

Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU).

Strong in C/C++ programming, Design Patterns and OS concepts.

Good scripting skills in Python.

Excellent analytical and debugging skills.

Good communication skills (verbal, presentation, written).

Ability to collaborate across a globally diverse team and multiple interests.

Preferred Qualifications

· Strong understanding of SIMD processor architecture and system design.

· Proficiency in object-oriented software development and familiarity

· Familiarity with Linux and Windows environment

· Strong background in kernel development for SIMD architectures.

· Familiarity with frameworks like llama.cpp, MLX, and MLC is a plus.

· Good knowledge of PyTorch, TFLite, and ONNX Runtime is preferred.

· Experience with parallel computing systems and languages like OpenCL and CUDA is a plus.

Applicants: Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).

To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.

If you would like more information about this role, please contact Qualcomm Careers.

Job Application Privacy Notice

Use of AI in the Application Process

Equal Employment Opportunity

"EEO is the Law" Poster Supplement

Pay Transparency Non-Discrimination Provision

Employee Polygraph Protection Act

Family Medical Leave Act

Rights of Pregnant Employees

Discrimination and Harassment

California Family Rights Act

Qualcomm Right to Inspect

14 Skills Required For This Role

Communication Problem Solving Design Patterns Cpp Game Texts Cuda Opencl Linux Pytorch Python Algorithms Java System Design Machine Learning

Similar Jobs