Senior/Principal DL/LLM Performance Architect
rivos
Job Summary
Join a cutting-edge, well-funded hardware startup in Silicon Valley as a Deep Learning and Large Language Model Performance Architect. The company aims to reimagine silicon and create Risc-V based Accelerated computing platforms that will transform the industry. You will collaborate with talented engineers to develop designs that push the boundaries of performance, energy efficiency, and scalability in a fun, creative, and flexible work environment. Responsibilities include analyzing the performance of key workloads, tuning software, proposing improvements, developing analytical models for target systems, identifying performance bottlenecks, and making recommendations to implementation teams. You will also work with deep learning software engineers and hardware architects, adapt to the evolving AI industry, and contribute across the codebase. Tasks involve pre-silicon and post-silicon performance validation.
Must Have
- MS or PhD in CS, EE, Math, or equivalent
- 5+ years of experience
- In-depth knowledge of DL or LLM models
- Strong background in computer architecture or AI software stack/compilers
- Strong C/C++ programming skills
- Strong hardware modeling skills
- Strong problem-solving and analytical thinking
Good to Have
- Performance modeling and analysis background
- GPU programming experience (CUDA)
- LLVM/MLIR development experience
- Good communication and organizational skills