Software Engineer, Speed

1 Month ago • 2 Years + • Research & Development

Job Summary

Job Description

Google Research seeks a Software Engineer specializing in speed optimization for Large Language Models (LLMs). Responsibilities include developing fast inference algorithms, improving decoding and prefill efficiency, analyzing execution using profiling tools, developing compute-efficient models (pre-training, tail/torso patching), introducing architectural changes for efficiency/quality, and collaborating with the Gemini team to meet product needs. The role requires experience in software development, data structures, algorithms, NLP, and ML infrastructure. The ideal candidate will possess a strong understanding of LLM architectures and optimization techniques.
Must have:
  • 2+ years software development experience
  • 2+ years data structures/algorithms experience
  • 1+ year NLP experience
  • 1+ year ML infrastructure experience
  • Develop optimized inference algorithms
  • Improve decoding/prefill efficiency
Good to have:
  • Master's/PhD in CS
  • Experience with accessible technologies

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 2 years of experience with data structures or algorithms.
  • 1 year of experience with Natural Language Processing (NLP) concepts or techniques.
  • 1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical fields.
  • Experience developing accessible technologies.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

Google Research addresses challenges that define the technology of today and tomorrow. From conducting fundamental research to influencing product development, our research teams have the opportunity to impact technology used by billions of people every day.

Our teams aspire to make discoveries that impact everyone, and core to our approach is sharing our research and tools to fuel progress in the field -- we publish regularly in academic journals, release projects as open source, and apply research to Google products.

Responsibilities

  • Develop optimized fast inference algorithms in Google's Large Language Model (LLM) codebases.
  • Improve decoding efficiency/latency and prefillI efficiency/latency.
  • Analyze and profile execution via xprof and other tools.
  • Develop pre-training, tail-patching and torso-patching compute-efficient and token-efficient models.
  • Introduce modeling architectural changes to improve efficiency or quality. Work closely with the Gemini Development and Modeling (GDM), the Gemini team and Product Areas to address the product needs in those areas.

Similar Jobs

CrowdStrike - Sr. Backend Engineer

CrowdStrike

Canada (Remote)
3 Weeks ago
IMC - Quantitative Developer

IMC

Chicago, Illinois, United States (On-Site)
1 Month ago
Grab - Data Scientist

Grab

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Fluence - Asset Performance Engineer

Fluence

Melbourne, Victoria, Australia (Hybrid)
3 Months ago
ByteDance - Video Experience Software Engineer Intern

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Google - Student Researcher, PhD, Winter/Summer 2025

Google

Mountain View, California, United States (On-Site)
7 Months ago
Sony Interactive Entertainment - Open Position: System Software/Embedded Systems

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
2 Months ago
Tesla - Lead/Manager (Power) Electronic/Electrical Design Engineer

Tesla

Brandenburg, Germany (On-Site)
3 Months ago
ByteDance - Machine Learning Engineer - Machine Learning Infrastructure

ByteDance

San Jose, California, United States (On-Site)
7 Months ago
NVIDIA - Senior Technical Program Manager – Silicon Solutions

NVIDIA

Santa Clara, California, United States (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Site Reliability Engineer, Google Cloud Storage

Google

Sydney, New South Wales, Australia (On-Site)
1 Month ago
Epic Games - Senior Software Engineer

Epic Games

Germany (On-Site)
1 Month ago
Google - Software Engineer III, Full Stack, Google Ads

Google

Mountain View, California, United States (On-Site)
7 Months ago
STAGE - Analytics Engineer

STAGE

Noida, Uttar Pradesh, India (On-Site)
2 Months ago
Prophecy Simple Data Labs - Senior Backend Engineer

Prophecy Simple Data Labs

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
ByteDance - Software Engineer Graduate (Applied Machine Learning - Engine) - 2025 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
7 Months ago
High Moon Studios - Senior Gameplay Engineer

High Moon Studios

Carlsbad, California, United States (Hybrid)
1 Month ago
Epic Games - Automation Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
2 Months ago
QuinStreet - Sr. CSS Developer

QuinStreet

Monterrey, Nuevo Leon, Mexico (Remote)
1 Month ago
Google - Software Engineer, Photos, Early Career

Google

Sydney, New South Wales, Australia (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Tel Aviv-Yafo, Tel Aviv District, Israel

Scopely - Senior Product Manager - Yahtzee!

Scopely

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Playtika - Loyalty Manager

Playtika

Israel (On-Site)
5 Months ago
NVIDIA - Senior STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Moon Active - Unity Team Lead

Moon Active

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
11 Months ago
NVIDIA - Senior HPC AI Cluster Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Play Perfect - UX/UI Artist

Play Perfect

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Tesla - Senior Operations Coordinator, Sales and Service

Tesla

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Entrata - Senior Accountant

Entrata

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
Google - Verification Lead, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Trackman - Team Lead - Radar & High-Speed Electronics

Trackman

(On-Site)
2 Months ago
Google - Imaging System Architect

Google

Mountain View, California, United States (On-Site)
1 Month ago
NVIDIA - Senior Product Architect

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - System Software Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
Tencent - Senior Regional Game Operation Manager

Tencent

London, England, United Kingdom (On-Site)
3 Months ago
Rivos - Analog Mixed Signal Design

Rivos

Hsinchu, Hsinchu City, Taiwan (Hybrid)
7 Months ago
Riot Games - Staff Software Engineer, Unreal Tools - MMO

Riot Games

Los Angeles, California, United States (On-Site)
7 Months ago
Meta - Software Engineer, Machine Learning

Meta

Redmond, Washington, United States (On-Site)
7 Months ago
Virtuos - Game Programming Internship

Virtuos

Malaysia (On-Site)
1 Month ago
Assystems - Structural Design Engineer

Assystems

Mumbai, Maharashtra, India (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

London, England, United Kingdom (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Zürich, Zurich, Switzerland (On-Site)

Kirkland, Washington, United States (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

Seattle, Washington, United States (On-Site)

Haryana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug