Edge AI Staff Engineer

2 Months ago • 5 Years +

Job Summary

Job Description

The Edge AI Staff Engineer will be responsible for shaping the future Edge AI solution, utilizing GPU/TPU acceleration and enterprise-grade, large-scale edge compute. Key responsibilities include designing and architecting edge AI solutions, optimizing AI inference models for edge devices with GPU/TPU accelerators, collaborating with cross-functional teams, and leading a team of engineers. This role involves technical leadership, project management, and ensuring successful implementation of edge AI solutions, as well as staying current with the latest advancements in GPU/TPU technologies and edge AI frameworks. The engineer will also conduct performance profiling and optimization to maximize the efficiency of GPU/TPU acceleration for local LLM inference.
Must have:
  • Experience in AI model development and deployment.
  • Strong programming skills in Python and C++
  • Proficiency in LLM frameworks and deep learning libraries.
  • Extensive experience with GPU/TPU acceleration for AI inference.
  • Hands-on experience with one or more GPU frameworks.
Good to have:
  • Experience with edge device hardware and software integration.
  • Familiarity with edge computing architectures and IoT platforms.
  • Experience with edge AI applications in robotics, autonomous vehicles, or industrial automation.

Job Details

Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions. They rely on our top-rated services and support to accelerate their digital transformation efforts and deliver unprecedented progress. With double-digit growth year over year, no provider is better positioned to deliver scalable outcomes than Extreme.

Inclusion is one of our core values and in our DNA. We are committed to fostering an inclusive workplace that embraces our differences and creates an atmosphere where all our employees thrive because of their differences, not in spite of them.

Become part of Something big with Extreme! As a global networking leader, learn why there’s no better time to join the Extreme team.

Job Description:

We are seeking a talented Edge AI Staff Engineer with specialized expertise in GPU/TPU acceleration to join our team. The ideal candidate will have extensive hands-on experience in local Large Language Models (LLM) inference with embedded GPU/TPU architectures. As Staff Engineer specializing in Edge AI, you will play a crucial role in shaping the future Edge AI solution, leveraging the power of GPU/TPU acceleration and enterprise grade, large scale edge compute.
 
The successful candidate will combine technical excellence with effective leadership, creating a positive impact on both projects and team dynamics.

Key Responsibilities

    • High-Level Design and Architecture

    • Influence the Edge AI strategy by providing expert advice on design and architecture.
    • Make critical decisions regarding technical directions, scalability, and system performance.
    • Develop and optimize AI inference models for deployment on edge devices with embedded GPU/TPU accelerators, focusing on local Low Latency Model (LLM) inference.
    • Implement and fine-tune low-latency model inference pipelines to meet real-time performance requirements.
    • Collaborate with cross-functional teams to integrate AI inference solutions into edge computing platforms and applications.
    • Collaborate with the GPU Hardware Design Team to design and optimize GPUs that power next-generation devices.
    • Conduct performance profiling and optimization to maximize the efficiency of GPU/TPU acceleration for local LLM inference.
    • Work on micro-architecture development, ensuring efficient execution of graphics, compute, and AI workloads within energy and area constraints.
    • Stay current with advancements in GPU/TPU technologies and edge AI frameworks, incorporating them into solution designs as appropriate.
    • Provide technical expertise and support to project teams, ensuring successful implementation and deployment of edge AI solutions.

    • Team Leadership:
    • Lead and inspire a team of engineers, providing guidance, setting goals, and ensuring collaboration.
    • Oversee project planning, execution, and delivery, ensuring alignment with business objectives.
    • Manage all phases of technical projects, from conception to completion.
    • Develop project specifications, track progress, and control costs.
    • Foster a positive work environment, encouraging professional growth and knowledge sharing.

Qualifications:

    • Bachelor’s degree in computer science, Engineering, or a related field; Master’s degree preferred.
    • 5+ years of hands-on experience in AI model development and deployment, with a focus on edge computing and local LLM inference.
    • Strong programming skills in languages such as Python and C++
    • Proficiency in LLM frameworks (e.g., vLLM, Text generation inference, OpenLLM, Ray Serve and HuggingFace Transformers) and deep learning libraries.
    • Extensive experience with GPU/TPU acceleration for AI inference, including optimization techniques (tensor, pipeline, data, sharded data parallelism) and performance tuning,
    • Hands on experience with one or more GPU frameworks: CUDA, Vulkan, OpenCL 
    • Deep knowledge of GPU memory layout, familiarity with NVIDIA Jatison, ARM Mali or relevant SoC configurations.
    • Knowledge of parallel computation, memory scheduling, and structural optimization
    • Excellent problem-solving and analytical skills, with a passion for innovation and continuous learning.

Additional Skills (Preferred):

    • Experience with edge device hardware and software integration.
    • Familiarity with edge computing architectures and IoT platforms.
    • Experience with edge AI applications in domains such as robotics, autonomous vehicles, or industrial automation.
Extreme Networks, Inc. (EXTR) creates effortless networking experiences that enable all of us to advance. We push the boundaries of technology leveraging the powers of machine learning, artificial intelligence, analytics, and automation. Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions and rely on our top-rated services and support to accelerate their digital transformation efforts and deliver progress like never before. For more information, visit Extreme's website or follow us on Twitter, LinkedIn, and Facebook.

We encourage people from underrepresented groups to apply. Come Advance with us! In keeping with our values, no employee or applicant will face discrimination/harassment based on: race, color, ancestry, national origin, religion, age, gender, marital domestic partner status, sexual orientation, gender identity, disability status, or veteran status. Above and beyond discrimination/harassment based on “protected categories,” Extreme Networks also strives to prevent other, subtler forms of inappropriate behavior (e.g., stereotyping) from ever gaining a foothold in our organization. Whether blatant or hidden, barriers to success have no place at Extreme Networks.

Similar Jobs

Amber Studio Careers - Lead Level Designer

Amber Studio Careers

Bogotá, Bogota, Colombia (Remote)
2 Weeks ago
Qualcomm - PMIC Reference HW Design Engineer

Qualcomm

San Diego, California, United States (On-Site)
2 Weeks ago
Techland - Senior Level Designer

Techland

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
6 Months ago
FunPlus - Concept Artist

FunPlus

Lisbon, Lisbon, Portugal (Hybrid)
1 Month ago
Epic Games - Senior Environment Artist

Epic Games

(On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - IP Design Group Lead, Networking, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Cloud Imperium Games - Senior Level Designer

Cloud Imperium Games

Manchester, England, United Kingdom (On-Site)
2 Weeks ago
Playtika - Senior Level Designer - Solitaire Grand Harvest

Playtika

Spain (On-Site)
4 Months ago
limit break - Unity Level Integrator (Japan)

limit break

Tokyo, Japan (On-Site)
6 Months ago
Addepar - Sr. Product Designer - Platform Architecture

Addepar

Canada (Remote)
1 Month ago
Qualcomm - Engineer, WLAN

Qualcomm

Bengaluru, Karnataka, India (On-Site)
4 Weeks ago
GlobalStep - Vice President of Product Development

GlobalStep

United States (On-Site)
2 Weeks ago
Next Level Games - Producer

Next Level Games

British Columbia, Canada (Hybrid)
3 Months ago
Qualcomm - RFIC Engineer

Qualcomm

Bengaluru, Karnataka, India (On-Site)
1 Month ago
GlobalStep - Croatian Localization Video game Tester

GlobalStep

Montreal, Quebec, Canada (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Ontario, Canada

Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Vancouver, British Columbia, Canada (Remote)
3 Months ago
Turbulent - Senior Tools Developer

Turbulent

Montreal, Quebec, Canada (On-Site)
1 Month ago
Autodesk - Sr. Principal Construction Research Scientist

Autodesk

Toronto, Ontario, Canada (Hybrid)
1 Week ago
Awaceb - Lead 3D Animation

Awaceb

Montréal, Québec, Canada (Hybrid)
1 Month ago
Intrepid Studios,  Inc  - Don’t see your position posted? Convince us why you’d be a great hire!

Intrepid Studios, Inc

Canada (On-Site)
7 Months ago
Nagarro - Associate Staff Consultant, Business Analyst

Nagarro

Canada (Remote)
8 Months ago
Next Level Games - Material Artist

Next Level Games

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago
Amber - Localization Quality Assurance (Danish)

Amber

Quebec, Canada (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!