Data Parallel Accelerator Performance Intern

1 Month ago • Upto 1 Years • Research & Development

About the job

Summary

Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
Our mission is to create computing platforms (HW/SW co-design) that will transform the industry with the most advanced technologies. As a DPA performance intern, you will be given a project to work on SOC-level performance per watt improvement through memory management innovations. You will be working with the internal SW (eg. OS, Kernel, Framework) and Silicon (eg. RTL, Power, Perf) team members

Requirements
- Knowledge in one or more of the following areas,  computer architecture , performance modeling, and analytical model
- Knowledge and experience with common LLM (Large Language Model) workloads.
- Proficiency in C or C++, and scripting languages such as Python.
- Experience with high-level simulators for performance or power estimation is a plus.
- Knowledge in server-class GPU/ML architecture is a plus.

Responsibilities
- Responsible for an analytical model implementation of LLM inference and training memory usage
- Responsible for running the performance simulation to extract the workload's characteristics such as memory footprint and bandwidth requirement.
- Responsible for evaluation ideas for performance improvement

Minimum Education & Experience
Current EE or CS master or Ph.D students with computer architecture backgrounds
undefinedundefinedundefined
View Full Job Description

About The Company

United States (Hybrid)

Hsinchu City, Taiwan (Hybrid)

Karnataka, India (Hybrid)

Hsinchu City, Taiwan (Hybrid)

View All Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug