AI Framework Software Engineer

Intel

Job Summary

The Intel Neural Compressor team is seeking an AI Framework Software Engineer to develop Intel Neural Compressor products and tools for Intel AI platforms (CPU, GPU, AI Accelerator). Responsibilities include researching and implementing quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models, as well as exploring efficient model deployment and inference/finetuning acceleration.

Must Have

  • Develop Intel Neural Compressor product and related tools
  • Support Intel AI platform, including CPU, GPU and AI Accelerator
  • Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models
  • Explore cutting-edge directions in efficient model deployment and inference/finetuning acceleration
  • Solid understanding of deep learning, deep learning framework and large language model (LLM) fundamentals
  • Familiarity with model compression techniques such as quantization and pruning
  • Proficiency in Python or other programming languages commonly used for deep learning development

Good to Have

  • Passion for technological innovation and practical engineering, with a drive for continuous exploration and improvement
  • Experience in model fine-tuning
  • Experience in inference optimization
  • Experience in related tool development

Job Description

Job Description:

Intel Neural Compressor team is looking for a highly motivated talent to join us

Responsibilities includes:

  • Develop Intel Neural Compressor product and related tools to support Intel AI platform, including CPU, GPU and AI Accelerator
  • Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models
  • Explore cutting-edge directions in efficient model deployment and inference/finetuning acceleration.

Qualifications:

  • Major in computer science or related subject
  • Solid understanding of deep learning, deep learning framework and large language model (LLM) fundamentals
  • Familiarity with model compression techniques such as quantization and pruning
  • Proficiency in Python or other programming languages commonly used for deep learning development
  • Passion to work as a professional software engineer and team player
  • Strong self-motivation and problem-solving skill
  • Good English oral and written skill

Preferred Qualifications

  • Passion for technological innovation and practical engineering, with a drive for continuous exploration and improvement
  • Experience in model fine-tuning, inference optimization or related tool development is a plus

Job Type:

College Grad

Shift:

Shift 1 (China)

Primary Location:

PRC, Shanghai

Additional Locations:

Business group:

The Sales and Marketing Group (SMG) leverages the product portfolio to drive Intel's revenue growth and market expansion, blending strategic initiatives with dynamic sales efforts to capture and retain customers. SMG is responsible for empowering the sales force with tools and insights needed to close deals and build lasting customer relationships. Sales analytics and market research ensure strategies are both targeted and impactful. In SMG, disciplined execution, creativity, and ambition are celebrated, providing ample opportunities for career advancement and skill development.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Work Model for this Role

This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.

7 Skills Required For This Role

Revenue Growth Team Player Game Texts Market Research Model Deployment Deep Learning Python

Similar Jobs