Software Engineer, Speed

18 Hours ago • 2 Years + • Research & Development

Job Summary

Job Description

Google Research seeks a Software Engineer specializing in speed optimization for Large Language Models (LLMs). Responsibilities include developing fast inference algorithms, improving decoding and prefill efficiency, analyzing execution using profiling tools, developing compute-efficient models (pre-training, tail/torso patching), introducing architectural changes for efficiency/quality, and collaborating with the Gemini team to meet product needs. The role requires experience in software development, data structures, algorithms, NLP, and ML infrastructure. The ideal candidate will possess a strong understanding of LLM architectures and optimization techniques.
Must have:
  • 2+ years software development experience
  • 2+ years data structures/algorithms experience
  • 1+ year NLP experience
  • 1+ year ML infrastructure experience
  • Develop optimized inference algorithms
  • Improve decoding/prefill efficiency
Good to have:
  • Master's/PhD in CS
  • Experience with accessible technologies

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 2 years of experience with data structures or algorithms.
  • 1 year of experience with Natural Language Processing (NLP) concepts or techniques.
  • 1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical fields.
  • Experience developing accessible technologies.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

Google Research addresses challenges that define the technology of today and tomorrow. From conducting fundamental research to influencing product development, our research teams have the opportunity to impact technology used by billions of people every day.

Our teams aspire to make discoveries that impact everyone, and core to our approach is sharing our research and tools to fuel progress in the field -- we publish regularly in academic journals, release projects as open source, and apply research to Google products.

Responsibilities

  • Develop optimized fast inference algorithms in Google's Large Language Model (LLM) codebases.
  • Improve decoding efficiency/latency and prefillI efficiency/latency.
  • Analyze and profile execution via xprof and other tools.
  • Develop pre-training, tail-patching and torso-patching compute-efficient and token-efficient models.
  • Introduce modeling architectural changes to improve efficiency or quality. Work closely with the Gemini Development and Modeling (GDM), the Gemini team and Product Areas to address the product needs in those areas.

Similar Jobs

Google - Software Engineer III, Machine Learning, Google Ads

Google

Los Angeles, California, United States (On-Site)
5 Months ago
Nintendo - Software Engineer (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
10 Months ago
Canva - China App Store Marketing Partnerships Specialist

Canva

Beijing, Beijing, China (Remote)
1 Month ago
Google - Senior Software Engineer, Chrome Autofill

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Mozilla - Staff Machine Learning Engineer, Gen AI

Mozilla

Denmark (Remote)
6 Months ago
Google - Signal and Power Integrity Engineer, Machine Learning

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Riot Games - Researcher III - RDS Central User Research Team

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Ubisoft - Research and Development Scientist

Ubisoft

Montreal, Quebec, Canada (Hybrid)
1 Week ago
Google - Software Engineering Manager, Chrome Sync Server

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Student Researcher (Doubao (Seed) - Foundation Model) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Cloud AI Engineer, Global Services Delivery

Google

Mexico City, Mexico City, Mexico (On-Site)
17 Hours ago
Google - Software Engineer III, Mobile (iOS)

Google

Mountain View, California, United States (On-Site)
1 Week ago
Google - Senior Software Engineer, Core, Education and Activation

Google

Mexico City, Mexico City, Mexico (On-Site)
1 Week ago
Google - Software Engineer III, AI/ML GenAI, Payments

Google

Mountain View, California, United States (On-Site)
1 Week ago
NVIDIA - Signal and Power Integrity Engineer (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
G5 Games - C++ Gameplay Programmer

G5 Games

Limassol, Limassol, Cyprus (Remote)
5 Months ago
Google - Senior Software Engineer, Ads, ML Infrastructure

Google

Pittsburgh, Pennsylvania, United States (On-Site)
19 Hours ago
NVIDIA - Senior Software Video Engineer

NVIDIA

Ra'anana, Center District, Israel (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Tel Aviv-Yafo, Tel Aviv District, Israel

NVIDIA - Senior Firmware Verification Engineer, PCIe

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Senior Physical Design Backend Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
NVIDIA - Senior Software Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - Principal Software Architect, GPU Networking Research

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
Plarium - Data Engineer

Plarium

Herzliya, Tel Aviv District, Israel (Hybrid)
9 Months ago
Playtika - User Acquisition Lead

Playtika

Israel (On-Site)
5 Months ago
Plarium - Marketing Business Analyst

Plarium

Herzliya, Tel Aviv District, Israel (On-Site)
1 Month ago
Microsoft - Senior Data Scientist - Microsoft Threat Protection

Microsoft

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Week ago
NVIDIA - Hardware Board Design Manager, IC Product

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Match Group - Senior ML Software Engineering Team Leader

Match Group

Seoul, South Korea (Hybrid)
6 Months ago
Riot Games - Principal Software Engineer, Foundations Developer Experience & Workflows

Riot Games

Los Angeles, California, United States (On-Site)
6 Months ago
NVIDIA - Senior Research Engineer for Reinforcement Learning

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Krafton  - PUBG IP Franchise China Business PM (6+ years)

Krafton

Seoul, South Korea (On-Site)
2 Months ago
NVIDIA - Senior Platform Software Engineer, PCIe

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
Rivos - Logic Equivalence Check (LEC) Engineer

Rivos

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
NVIDIA - Senior Synthesis Flow Development Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Rivos - SOC Physical Design Verification Engineer - Full Time

Rivos

Hsinchu, Hsinchu City, Taiwan (Hybrid)
6 Months ago
Fluence - Lead Engineer - Advanced Battery Modules

Fluence

Houston, Texas, United States (Hybrid)
6 Months ago
ByteDance - SOC System Architect

ByteDance

San Jose, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Dublin, County Dublin, Ireland (On-Site)

New York, New York, United States (On-Site)

Waterloo, Ontario, Canada (On-Site)

Taipei City, Taiwan (On-Site)

San Francisco, California, United States (On-Site)

Saint-Ghislain, Wallonia, Belgium (On-Site)

Bengaluru, Karnataka, India (On-Site)

Austin, Texas, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug