Senior Research Engineer, JAX

4 Minutes ago • All levels • $190,050 PA - $217,200 PA
Software Development & Engineering

Job Description

We are seeking a highly skilled Senior Research Engineer to collaborate closely with both Research and Engineering teams. The role involves diagnosing and resolving bottlenecks across large-scale distributed training, data processing, and inference systems, while also driving optimizations for existing high-performance pipelines. This cross-functional position requires strong technical rigor, intellectual curiosity, and excellent communication. You will maintain and evolve our JAX training framework, optimize production JAX inference systems, refactor model architectures, investigate performance bottlenecks, and design scalable workloads for TPU/GPU architectures, bridging research and production engineering.
Good To Have:
  • GPU/CUDA expertise.
  • Domain knowledge in Speech-to-Text (ASR architectures, audio processing, streaming inference).
  • C++ or Rust experience for kernel-level work.
Must Have:
  • Maintain and evolve JAX training framework.
  • Optimize production JAX inference systems for speech-to-text models.
  • Refactor and modernize model architectures and infrastructure.
  • Investigate and resolve performance bottlenecks across the stack (XLA, Pallas).
  • Design and deploy scalable, distributed workloads optimized for TPU and GPU architectures.
  • Bridge Research and Engineering teams, ensuring seamless knowledge transfer.
  • Expert-level proficiency with JAX and its ecosystem (Flax, Optax, XLA compilation pipeline).
  • Strong experience optimizing inference systems for production, ideally with LLMs or speech models.
  • Hands-on experience with TPU programming and optimization.
  • Familiarity with modern inference optimization techniques: continuous batching, KV-cache management, sharding strategies, quantization.
  • Strong Python skills.
  • Deep understanding of distributed training at scale and ML infrastructure best practices.
Perks:
  • Remote work

Add these skills to join the top 1% applicants for this job

cross-functional
communication
cpp
game-texts
cuda
prototyping
rust
deep-learning
python
system-design

About the Role

We are seeking a highly skilled Senior Research Engineer to collaborate closely with both Research and Engineering teams. The role involves diagnosing and resolving bottlenecks across large-scale distributed training, data processing, and inference systems, while also driving optimizations for existing high-performance pipelines.

The ideal candidate possesses a deep understanding of modern deep learning systems, combined with strong engineering expertise in areas such as layer-level optimization, large-scale distributed training, streaming, low-latency and asynchronous inference, inference compilers, and advanced parallelization techniques.

This is a cross-functional role requiring strong technical rigor, attention to detail, intellectual curiosity, and excellent communication skills. The position is embedded within the Research team and is responsible for developing and refining the technical foundation that enables cutting-edge research and translates its outcomes into production, bridging research and production engineering.

What You’ll Do

  • Maintain and evolve our JAX training framework, ensuring scalability and efficiency for large-scale distributed training runs.
  • Optimize production JAX inference systems for speech-to-text models using advanced techniques like continuous batching, model sharding, paged attention, and quantization.
  • Refactor and modernize model architectures and infrastructure, translating research prototypes into production-ready systems.
  • Investigate and resolve performance bottlenecks across the stack—from low-level kernels (XLA, Pallas) to high-level system design.
  • Design and deploy scalable, distributed workloads optimized for TPU and GPU architectures.
  • Bridge Research and Engineering teams, ensuring seamless knowledge transfer and alignment on technical priorities.

What You’ll Need

  • Expert-level proficiency with JAX and its ecosystem (Flax, Optax, XLA compilation pipeline).
  • Strong experience optimizing inference systems for production, ideally with LLMs or speech models.
  • Hands-on experience with TPU programming and optimization; GPU/CUDA expertise is also valuable.
  • Passion for refactoring and improving existing systems—you thrive on making code faster, cleaner, and more maintainable.
  • Familiarity with modern inference optimization techniques: continuous batching, KV-cache management, sharding strategies, quantization.
  • Domain knowledge in Speech-to-Text (ASR architectures, audio processing, streaming inference) is a plus.
  • Strong Python skills; C++ or Rust experience for kernel-level work is a plus.
  • Deep understanding of distributed training at scale and ML infrastructure best practices.
  • Excellent communication skills and a collaborative mindset—you can clearly explain complex tradeoffs and prioritize high-impact work.

Set alerts for more jobs like Senior Research Engineer, JAX
Set alerts for new jobs by Assembly AI
Set alerts for new Software Development & Engineering jobs in Switzerland
Set alerts for new jobs in Switzerland
Set alerts for Software Development & Engineering (Remote) jobs
Contact Us
hello@outscal.com
Made in INDIA 💛💙