Staff SW Systems Engineer (9291)

1 Month ago • All levels

About the job

Job Description

Seeking a Senior/Staff/Principal Engineer with deep expertise in GPU/TPU acceleration for Edge AI. Must have extensive hands-on experience in local Large Language Model (LLM) inference and embedded GPU/TPU architectures. You'll play a crucial role in shaping future Edge AI solutions by developing and optimizing AI inference models for deployment on edge devices.
Must have:
  • GPU/TPU Acceleration
  • Edge AI Inference
  • Large Language Models
  • Embedded Architectures
Good to have:
  • Micro-architecture Development
  • Performance Profiling
  • Edge Computing Platforms
  • AI Frameworks
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
Senior/Staff/Principal Engineer – Edge AI LLM 9291

We are seeking a talented Senior/Staff/Principal Engineer with specialized expertise in GPU/TPU acceleration to join our team. The ideal candidate will have extensive hands-on experience in local Large Language Models (LLM) inference with embedded GPU/TPU architectures. As Principal Engineer specializing in Edge AI, you will play a crucial role in shaping the future Edge AI solution, leveraging the power of GPU/TPU acceleration and enterprise grade, large scale edge compute.
 
The successful candidate will combine technical excellence with effective leadership, creating a positive impact on both projects and team dynamics.

Key Responsibilities:

    • High-Level Design and Architecture
    • Influence the Edge AI strategy by providing expert advice on design and architecture.
    • Make critical decisions regarding technical directions, scalability, and system performance.
    • Develop and optimize AI inference models for deployment on edge devices with embedded GPU/TPU accelerators, focusing on local Low Latency Model (LLM) inference.
    • Implement and fine-tune low-latency model inference pipelines to meet real-time performance requirements.
    • Collaborate with cross-functional teams to integrate AI inference solutions into edge computing platforms and applications.
    • Collaborate with the GPU Hardware Design Team to design and optimize GPUs that power next-generation devices.
    • Conduct performance profiling and optimization to maximize the efficiency of GPU/TPU acceleration for local LLM inference.
    • Work on micro-architecture development, ensuring efficient execution of graphics, compute, and AI workloads within energy and area constraints.
    • Stay current with advancements in GPU/TPU technologies and edge AI frameworks, incorporating them into solution designs as appropriate.
    • Provide technical expertise and support to project teams, ensuring successful implementation and deployment of edge AI solutions.

Team Leadership:

    • Lead and inspire a team of engineers, providing guidance, setting goals, and ensuring collaboration.
    • Oversee project planning, execution, and delivery, ensuring alignment with business objectives.
    • Manage all phases of technical projects, from conception to completion.
    • Develop project specifications, track progress, and control costs.
    • Foster a positive work environment, encouraging professional growth and knowledge sharing.
undefined
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

View All Jobs

Get notified when new jobs are added by Extreme Network

Similar Jobs

paypal - Sr. Storage and Systems Engineer

paypal, India (Hybrid)

paypal - Sr. Storage and Systems Engineer

paypal, India (Hybrid)

Light Speed Studios - システムエンジニア|Systems Engineer

Light Speed Studios, Japan (On-Site)

NBC universal - Sr Systems Engineer

NBC universal, United States (Remote)

ARHS - Systems Engineer

ARHS, Malta (On-Site)

Assystems - Systems Engineer

Assystems, United Kingdom (On-Site)

Assystems - Senior Systems Engineer

Assystems, United Kingdom (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

King - Senior Project Specialist

King, United Kingdom (On-Site)

King - Senior Java Developer, Shared Technology

King, United Kingdom (On-Site)

Warner Bros. Games - Sr. Manager, Brand Management and Product Marketing

Warner Bros. Games, United States (Hybrid)

Aristocrat Gaming - Regional Marketing Manager, Marketing Partnership

Aristocrat Gaming, United States (Hybrid)

Aristocrat Gaming - Planning & Procurement Analyst

Aristocrat Gaming, United States (Hybrid)

DraftKings - Senior Data Science Engineer, Search

DraftKings, United States (On-Site)

DraftKings - Senior Data Science Engineer, Personalization

DraftKings, United States (On-Site)

Light Speed Studios - Design Director

Light Speed Studios, Canada (On-Site)

ByteDance - Senior Network Engineer- IAAS- San Jose

ByteDance, United States (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Toronto, Ontario, Canada

The Walt Disney Company - Senior Creature Technical Director

The Walt Disney Company, Canada (On-Site)

The Walt Disney Company - Lead Creature Technical Director

The Walt Disney Company, Canada (On-Site)

Light Speed Studios - Design Director

Light Speed Studios, Canada (On-Site)

IGG - Senior Gameplay/System Designer

IGG, Canada (On-Site)

Sledgehammer Games - Senior Lighting Artist - Sledgehammer Games Toronto

Sledgehammer Games, Canada (On-Site)

Sledgehammer Games - Systems Designer - Sledgehammer Games Toronto

Sledgehammer Games, Canada (On-Site)

Sledgehammer Games - Lead Systems Designer - Sledgehammer Games Toronto

Sledgehammer Games, Canada (On-Site)

Sledgehammer Games - Senior Technical Artist – Sledgehammer Games

Sledgehammer Games, Canada (On-Site)

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Get notifed when new similar jobs are uploaded