Cloud Machine Learning LLM Serving Senior engineer

14 Hours ago • 3 Years +
Research Development

Job Description

The Qualcomm Cloud Computing team is seeking a Senior Engineer to develop hardware and software for Machine Learning solutions across data center, edge, infrastructure, and automotive markets. This role involves machine learning framework development, spanning the entire product lifecycle from design to deployment, in a fast-paced, cross-functional environment.
Good To Have:
  • Deep Learning experience (LLMs, NLP, Vision, Audio, Recommendation systems).
  • Knowledge of Pytorch, TensorFlow software stacks.
  • Excellent C/C++/Python programming and software design skills.
  • Debugging, performance analysis, and test design skills.
  • Ability to work independently and lead development efforts.
  • Proficiency in open-source development practices.
  • Research mindset and innovation drive.
  • Strong problem-solving abilities.
  • Knowledge of tiling and scheduling ML operators.
  • Experience with C++ 14 advanced features.
  • Software profiling and optimization techniques experience.
  • Hands-on experience with SIMD and/or multi-threaded high-performance code.
  • Experience with ML compiler, Auto-code generation (MLIR).
  • Experience running workloads on large scale heterogeneous clusters.
  • Hands-on experience with CUDA, CUDNN.
Must Have:
  • Optimize Deep Learning models on Qualcomm AI 100.
  • Develop deep learning framework extensions for Qualcomm AI 100 in open-source.
  • Implement Kernels for AI workloads.
  • Analyze and optimize deep learning training and inference.
  • Build software tools and ecosystem for AI SW Stack.
  • Develop abstraction layers for inference accelerators using vLLM, Triton, ExecuTorch, Inductor, TorchDynamo.
  • Optimize workloads for multi-SoC and multi-card systems.
  • Optimize deep learning pipeline including graph compiler integration.
  • Apply software engineering best practices.
  • 3+ years in Software Engineering or related field.
  • 3+ years programming in C++, Python.
Perks:
  • World-class health benefits for employees and dependents.
  • Programs for financial security and future planning.
  • Resources for emotional/mental strength, resilience, and purpose.
  • Wellbeing programs for living and working well.
  • Continuous learning and development programs.
  • Tuition reimbursement.
  • Mentorships.

Add these skills to join the top 1% applicants for this job

cross-functional
communication
problem-solving
performance-analysis
oops
cpp
data-structures
game-texts
cuda
pytorch
deep-learning
python
tensorflow
java
machine-learning

Job Area:Engineering Group, Engineering Group > Software EngineeringGeneral Summary: Job Overview:The Qualcomm Cloud Computing team is developing hardware and software for Machine Learning solutions spanning the data center, edge, infrastructure, automotive market. We are seeking ambitious, bright, and innovative engineers with experience in machine learning framework development. Job activities span the whole product life cycle from early design to commercial deployment. The environment is fast-paced and requires cross-functional interaction daily so good communication, planning and execution skills are a must.Key Responsibilities* Improve and optimize key Deep Learning models on Qualcomm AI 100.* Build deep learning framework extensions for Qualcomm AI 100 in upstream open-source repositories.* Implement Kernels for AI workloads* Collaborate and interact with internal teams to analyze and optimize training and inference for deep learning.* Build software tools and ecosystem around AI SW Stack.* Work on vLLM, Triton, ExecuTorch, Inductor, TorchDynamo to build abstraction layers for inference accelerator.* Optimize workloads for both scale-up (multi-SoC) and scale-out (multi-card) systems.* Optimize the entire deep learning pipeline including graph compiler integration.* Apply knowledge of software engineering best practices.Desirable Skills and Aptitudes* Deep Learning experience or knowledge – LLMs, Natural Language Processing, Vision, Audio, Recommendation systems.* Knowledge of the structure and function of different components of Pytorch, TensorFlow software stacks.* Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design.* Ability to work independently, define requirements and scope, and lead your own development effort.* Well versed with open-source development practices.* Strong developer with a research mindset – strives to innovate.* Avid problem solver – should be able to find solutions to key engineering and domain problems. Knowledge of tiling and scheduling a Machine learning operator is a plus.* Experience in using C++ 14 (advanced features)* Experience of profiling software and optimization techniques* Hands on experience writing SIMD and/or multi-threaded high-performance code is a plus.* Experience of ML compiler, Auto-code generation (using MLIR) is a plus.* Experiences to run workloads on large scale heterogeneous clusters is a plus.* Hands-on experience with CUDA, CUDNN is a plus.Qualifications:* Bachelor's / Masters/ PHD degree in Engineering, Machine learning/ AI, Information Systems, Computer Science, or related field.* 3+ years Software Engineering or related work experience.* 3+ years’ experience with Programming Language such as C++, Python.Minimum Qualifications:• Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.ORMaster's degree in Engineering, Information Systems, Computer Science, or related field and 1+ year of Software Engineering or related work experience.ORPhD in Engineering, Information Systems, Computer Science, or related field.• 2+ years of academic or work experience with Programming Language such as C, C++, Java, Python, etc.Applicants: Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com  or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.If you would like more information about this role, please contact Qualcomm Careers.Job Application Privacy Notice----------------------------------Job Application Privacy NoticeUse of AI in the Application Process----------------------------------------Use of AI in the Application ProcessEqual Employment Opportunity**--------------------------------Equal Employment Opportunity[

Set alerts for more jobs like Cloud Machine Learning LLM Serving Senior engineer
Set alerts for new jobs by Qualcomm
Set alerts for new Research Development jobs in India
Set alerts for new jobs in India
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙