Member Of Technical Staff (Winter Intern) at Wafer (S25)
35 Minutes ago • All levels • $72,000 PA - $120,000 PA
Software Development & Engineering
Job Description
Join our team to build the future of inference, GPU optimization and AI infrastructure. You'll work directly with the team to define our technical direction and build the core systems that power our GPU optimization platform. This role involves building scalable infrastructure for AI model training and inference, and leading technical decisions and architecture choices.
Good To Have:
Publications or open-source contributions in inference GPU computing or ML/AI for code are a plus
Hands-on experience with large-scale experiments, benchmarking, and performance tuning
Must Have:
Build scalable infrastructure for AI model training and inference
Lead technical decisions and architecture choices
Deep understanding of GPU architectures, CUDA programming, and parallel computing patterns
Proficiency in PyTorch, TensorFlow, or JAX, particularly for GPU-accelerated workloads
Strong grounding in large language models (training, fine-tuning, prompting, evaluation)
Proficiency in C++, Python, and possibly Rust/Go for building tooling around CUDA
Perks:
Will sponsor
Add these skills to join the top 1% applicants for this job
cpp
game-texts
cuda
rust
pytorch
deep-learning
python
tensorflow
Join our team to build the future of inference, GPU optimization and AI infrastructure. You'll work directly with the team to define our technical direction and build the core systems that power our GPU optimization platform.
What You'll Do
Build scalable infrastructure for AI model training and inference
Lead technical decisions and architecture choices
What We Look For
Core Technical Expertise
GPU Fundamentals: Deep understanding of GPU architectures, CUDA programming, and parallel computing patterns.
Deep Learning Frameworks: Proficiency in PyTorch, TensorFlow, or JAX, particularly for GPU-accelerated workloads.
LLM/AI Knowledge: Strong grounding in large language models (training, fine-tuning, prompting, evaluation).
Systems Engineering: Proficiency in C++, Python, and possibly Rust/Go for building tooling around CUDA.
Ideal Background
Publications or open-source contributions in inference GPU computing or ML/AI for code are a plus.
Hands-on experience with large-scale experiments, benchmarking, and performance tuning.
Set alerts for more jobs like Member Of Technical Staff (Winter Intern) at Wafer (S25)
Set alerts for new jobs by Wafer
Set alerts for new Software Development & Engineering jobs in United States
Set alerts for new jobs in United States
Set alerts for Software Development & Engineering (Remote) jobs