Engineer, Staff-Machine Learning-Embedded,C++
Qualcomm
Job Summary
As a Staff Machine Learning Engineer at Qualcomm, you will design, develop, and validate embedded and cloud edge software, applications, and utility programs. This role involves joining the Generative AI team to integrate cutting-edge GenAI models on Qualcomm chipsets, leveraging heterogeneous computing for on-device inference. You will spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK, optimizing performance for large models. Expertise in deploying C/C++ software stacks and understanding GenAI advancements like LLMs/Transformers for edge deployment is crucial.
Must Have
- 6+ years of software development experience
- Strong C/C++ programming, Design Patterns, OS concepts
- Good Python scripting skills
- Understanding of Generative AI models (LLM, LVM, LMMs)
- Knowledge of self-attention, cross-attention, KV caching
- Familiarity with Floating-point, Fixed-point, Quantization concepts
- Experience optimizing algorithms for AI hardware accelerators (CPU/GPU/NPU)
- Excellent analytical and debugging skills
- Good communication skills
Good to Have
- Understanding of SIMD processor architecture and system design
- Proficiency in object-oriented software development
- Familiarity with Linux and Windows environments
- Background in kernel development for SIMD architectures
- Familiarity with frameworks like llama.cpp, MLX, MLC
- Knowledge of PyTorch, TFLite, ONNX Runtime
- Experience with parallel computing systems (OpenCL, CUDA)
Perks & Benefits
- World-class health benefits
- Financial security programs
- Self and family resources for emotional/mental strength
- Wellbeing programs (Live+Well, Work+Well)
- Continuous learning and development programs
- Tuition reimbursement
- Mentorships
Job Description
Job Posting Date
2026-01-04
- * *
General Summary:
As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Software Engineer, you will design, develop, create, modify, and validate embedded and cloud edge software, applications, and/or specialized utility programs that launch cutting-edge, world class products that meet and exceed customer needs. Software Engineers collaborate with systems, hardware, architecture, test engineers, and other teams to design system-level software solutions and obtain information on performance requirements and interfaces.
Minimum Qualifications:
• Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience.
OR
Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience.
OR
PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.
• 2+ years of work experience with Programming Language such as C, C++, Java, Python, etc.
Join the exciting Generative AI team at focused on integrating cutting edge GenAI models on chipsets. The team uses chips’ extensive heterogeneous computing capabilities to allow inference of GenAI models on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. Utilize this power efficient hardware and Software stack to run Large Language Models (LLMs) and Large Vision Models (LVM) at near GPU speeds!
Responsibilities:
In this role, you will spearhead the development and commercialization of the AI Runtime (QAIRT) SDK on SoCs. As an AI inferencing expert, you'll push the limits of performance from large models. Your mastery in deploying large C/C++ software stacks using best practices will be essential. You'll stay on the cutting edge of GenAI advancements, understanding LLMs/Transformers and the nuances of edge-based GenAI deployment. Most importantly, your passion for the role of edge in AI's evolution will be your driving force.
Requirements:
Master’s/Bachelor’s degree in computer science or equivalent.
6+ years of relevant work experience in software development.
Strong understanding of Generative AI models – LLM, LVM, LMMs and building blocks (self-attention, cross attention, kv caching etc.)
Floating-point, Fixed-point representations and Quantization concepts.
Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU).
Strong in C/C++ programming, Design Patterns and OS concepts.
Good scripting skills in Python.
Excellent analytical and debugging skills.
Good communication skills (verbal, presentation, written).
Ability to collaborate across a globally diverse team and multiple interests.
Preferred Qualifications
· Strong understanding of SIMD processor architecture and system design.
· Proficiency in object-oriented software development and familiarity
· Familiarity with Linux and Windows environment
· Strong background in kernel development for SIMD architectures.
· Familiarity with frameworks like llama.cpp, MLX, and MLC is a plus.
· Good knowledge of PyTorch, TFLite, and ONNX Runtime is preferred.
· Experience with parallel computing systems and languages like OpenCL and CUDA is a plus.
Applicants: is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com
or call 's toll-free number found here
. Upon request, will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).
expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.
To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at . Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, employees or any other company location. is not responsible for any fees related to unsolicited resumes/applications.
If you would like more information about this role, please contact Careers
.