AI Researcher

Gimlet Labs

Job Summary

Gimlet Labs is building the foundation for the next generation of AI applications, focusing on redefining AI inference for breakthrough performance and efficiency. As an AI Researcher, you will evaluate and implement techniques to optimize AI models, explore new architectures, and experiment with novel inference efficiency techniques like KV caching and FlashAttention. The role involves designing and prototyping frameworks leveraging fine-tuning and knowledge distillation to push model performance boundaries.

Must Have

  • Monitoring and evaluating cutting-edge AI research
  • Researching ways to improve model accuracy, performance and efficiency
  • Prototyping frameworks with the latest fine-tuning and distillation techniques
  • Master’s or PhD degree in computer science, engineering, applied mathematics or comparable area of study
  • Experience with AI/ML or applied data science

Good to Have

  • Experience with PyTorch, TensorFlow, vLLM, ONNX and other AI frameworks
  • Software development experience with Python and C++
  • Understanding of the latest AI research and techniques
  • Strong foundation in statistical analysis

Job Description

Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefining AI inference from the ground up, combining cutting-edge research with an integrated hardware-software stack that delivers breakthrough performance, efficiency, and model quality. Gimlet pairs its inference stack with a seamless developer experience, allowing users to deploy, manage, and monitor AI workloads from frameworks like PyTorch and LangChain at production scale in seconds.

Gimlet is spun out of a Stanford research project under Professors Zain Asgar and Sachin Katti. The founding team has deep experience across AI, distributed systems, and hardware with previous successful exits.

Gimlet Labs is seeking an AI Researcher. As an AI Researcher, you will be evaluating and implementing techniques to drive performance and quality optimizations across the latest AI models. The research team is responsible for exploring new model architectures and experimenting with novel inference efficiency techniques such as KV caching and FlashAttention. The team will design and prototype frameworks leveraging fine-tuning and knowledge distillation to push the boundaries of model performance.

Responsibilities:

  • Monitoring and evaluating cutting-edge AI research
  • Researching ways to improve model accuracy, performance and efficiency
  • Prototyping frameworks with the latest fine-tuning and distillation techniques

Qualifications:

  • Master’s or PhD degree in computer science, engineering, applied mathematics or comparable area of study
  • Experience with AI/ML or applied data science.

Preferred Qualifications:

  • Experience with PyTorch, TensorFlow, vLLM, ONNX and other AI frameworks
  • Software development experience with Python and C++
  • Understanding of the latest AI research and techniques
  • Strong foundation in statistical analysis

7 Skills Required For This Role

Cpp Game Texts Prototyping Data Science Pytorch Python Tensorflow

Similar Jobs