Senior/Staff Software Engineer (CUDA Expert)

1 Month ago • All levels

Job Summary

Job Description

As a Software Engineer in the AI Core BU at Nu, you will focus on building and scaling foundational cloud, data, and AI infrastructure that supports machine learning workloads. You will be expected to demonstrate deep experience with GPU programming (CUDA, Triton, or OpenCL) optimizing deep learning workloads. You will collaborate with other engineers, share performance learnings, and mentor others. You will be encouraged to stay up to date with the latest in AI performance research and GPU architecture advancements.
Must have:
  • Deep experience with GPU programming (CUDA, Triton, or OpenCL)
  • Strong understanding of large language model architectures
  • Familiarity with memory management and kernel fusion
  • Experience with PyTorch internals or custom kernel development
  • Proficiency in Python and C++
  • Familiarity with inference acceleration frameworks
Perks:
  • High-Impact, Cross-Functional Work
  • Cutting-Edge GPU & LLM Optimization
  • Greenfield & Production-Scale Systems
  • Ownership & Growth
  • Engineering-Driven Culture
  • Remote work with quarterly trips to Sao Paulo
  • Top Tier Medical Insurance
  • Top Tier Dental and Vision Insurance
  • 20 days time off, 14 company holidays
  • Life Insurance and AD&D
  • Extended maternity and paternity leaves
  • Nucleo - Our learning platform of courses
  • NuLanguage - Our language learning program
  • NuCare - Our mental health and wellness assistance program
  • 401K
  • Saving Plans - Health Saving Account and Flexible Spending Account

Job Details

About Nu

Nu is the world’s largest digital banking platform outside of Asia, serving over 105 million customers across Brazil, Mexico, and Colombia. The company has been leading an industry transformation by leveraging data and proprietary technology to develop innovative products and services. Guided by its mission to fight complexity and empower people, Nu caters to customers’ complete financial journey, promoting financial access and advancement with responsible lending and transparency. The company is powered by an efficient and scalable business model that combines low cost to serve with growing returns. Nu’s impact has been recognized in multiple awards, including Time 100 Companies, Fast Company’s Most Innovative Companies, and Forbes World’s Best Banks. Learn more: https://international.nubank.com.br/careers/

 

About the role

At Nubank, one of our engineering principles is "Leverage Through Platforms". We believe that platforms are a very efficient way of solving complex concerns that are needed for different products and teams.
The AI Infrastructure Squad within the AI Core BU builds and scales the foundational cloud, data, and AI infrastructure that powers machine learning workloads across the organization. We focus on performance, reliability, and scalability in AI systems - working on everything from training infrastructure to low-latency inference.


As a Software Engineer in the AI Core BU, we expect you to demonstrate:

  • Deep experience with GPU programming (CUDA, Triton, or OpenCL), with a focus on performance optimization for deep learning workloads.
  • Strong understanding of large language model architectures (e.g., Transformer variants) and experience profiling and tuning their performance.
  • Familiarity with memory management, kernel fusion, quantization, tensor parallelism, and GPU-accelerated inference.
  • Experience with PyTorch internals or custom kernel development for AI workloads.
  • Hands-on knowledge of low-level optimizations in training and inference pipelines, such as FlashAttention, fused ops, and mixed-precision computation.
  • Proficiency in Python and C++
  • Familiarity with inference acceleration frameworks like TensorRT, DeepSpeed, vLLM, or ONNX Runtime.
Project Experience:
  • Demonstrated experience profiling and debugging GPU performance bottlenecks in LLM training or inference pipelines.
  • Has optimized large-scale ML workloads for throughput, latency, or cost—especially in production or research environments.
  • Experience contributing to or implementing custom GPU kernels for high-impact components (e.g., attention, normalization, or activation layers).
  • Proven ability to work across research and engineering teams to bridge model design and system performance.
  • Has designed infrastructure that scales across hundreds or thousands of GPUs in cloud or on-prem clusters.

 

We’re looking for individuals who are passionate about pushing the boundaries of LLM inference and training performance. In this role, you’ll work in a fast-paced environment, helping to design and scale cutting-edge AI infrastructure. You’ll think like an owner, balancing engineering rigor with practical constraints to deliver impactful systems that support our most ambitious AI workloads.

You’ll collaborate closely with other engineers, share performance learnings across the team, and mentor others as we continuously evolve our platform. We value curiosity and a self-driven mindset — you’ll be encouraged to stay up to date with the latest in AI performance research, GPU architecture advancements, and open-source tooling.

 

What we have to offer

  • High-Impact, Cross-Functional Work – Collaborate with researchers, ML engineers, and infrastructure teams to design systems that support training and inference for the company’s most critical AI models.
  • Cutting-Edge GPU & LLM Optimization – Tackle core performance challenges in LLM serving and training. Dive deep into GPU internals, custom kernels, and distributed execution.
  • Greenfield & Production-Scale Systems – Build both new foundational components (e.g., custom ops, inference runtimes) and improve large-scale infrastructure already powering production AI workloads.
  • Ownership & Growth – Influence architecture, mentor others, and lead technical initiatives with autonomy and visibility.
  • Engineering-Driven Culture – Work in a team that values deep technical work, collaboration, and pragmatic innovation at the edge of AI systems performance.


Our Benefits

  • Remote work, with quarterly trips to Sao Paulo to build relationships with coworkers. 
  • Top Tier Medical Insurance
  • Top Tier Dental and Vision Insurance
  • 20 days time off, 14 company holidays, and great culture that emphasizes work life balance. 
  • Life Insurance and AD&D
  • Extended maternity and paternity leaves 
  • Nucleo - Our learning platform of courses
  • NuLanguage - Our language learning program
  • NuCare - Our mental health and wellness assistance program
  • Extended maternity and paternity leaves 
  • 401K
  • Saving Plans - Health Saving Account and Flexible Spending Account


    #LI-Remote

Similar Jobs

AppLovin - Research Scientist

AppLovin

Palo Alto, California, United States (On-Site)
1 Month ago
Google - Student Researcher, PhD, Winter/Summer 2025

Google

Waterloo, Ontario, Canada (On-Site)
7 Months ago
GoDaddy - Senior Machine Learning Scientist

GoDaddy

India (Remote)
2 Weeks ago
Unity - Staff Software Engineer

Unity

San Francisco, California, United States (Hybrid)
1 Month ago
Google - Student Researcher, BS/MS, Winter/Summer 2025

Google

Ann Arbor, Michigan, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

bytedance - Machine Learning Engineer Intern (E-commerce-Supply Chain & Logistics)

bytedance

Seattle, Washington, United States (On-Site)
2 Months ago
bytedance - Senior Machine Learning Ops Engineer, ML System

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Archipelago - Senior Machine Learning Engineer

Archipelago

Noida, Uttar Pradesh, India (On-Site)
1 Month ago
Google - Software Engineer, Generative AI Blackbelt, Google Cloud Platform

Google

Taipei City, Taiwan (On-Site)
2 Months ago
Google - Staff Image Quality Evaluation Engineer, Silicon

Google

Mountain View, California, United States (On-Site)
1 Month ago
Roblox - Senior Machine Learning Engineer, PhD New Grad - Creator Services Machine Intelligence

Roblox

San Mateo, California, United States (On-Site)
1 Month ago
bytedance - Research Scientist Graduate (Foundation Models for Science - ByteDance Research) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Airlab Inc  - Artificial Intelligence Researcher

Airlab Inc

Montreal, Quebec, Canada (On-Site)
11 Months ago
NVIDIA - Senior Developer Technology Engineer, Public Sector

NVIDIA

Santa Clara, California, United States (Remote)
3 Months ago
Qualcomm - RF and Mixed-Signal Bench Characterization Engineer, Senior

Qualcomm

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Durham, North Carolina, United States

Vertx Inc. - GSI Channel Sales Manager

Vertx Inc.

United States (Remote)
2 Weeks ago
bytedance - Technology Internal Audit Lead

bytedance

Los Angeles, California, United States (Hybrid)
6 Months ago
Apple - Security Supervisor

Apple

Cupertino, California, United States (On-Site)
3 Weeks ago
Drive mode - Senior DevOps Engineer

Drive mode

Mountain View, California, United States (Hybrid)
1 Month ago
NVIDIA - Senior Optical Mixed Signal Design Validation Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Google - Photonic Engineer, Machine Learning Systems, Platforms

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
InnoPhase IoT - Staff/Sr. Staff PHY Design Engineer

InnoPhase IoT

San Jose, California, United States (On-Site)
1 Month ago
Activision - Senior Sales Account Executive

Activision

New York, New York, United States (Hybrid)
2 Months ago
Universal Music - Manager, North America Catalog Services

Universal Music

Franklin, Tennessee, United States (On-Site)
3 Months ago
Philips - Contract Support Analyst

Philips

Cambridge, Massachusetts, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Nubank was born in 2013 with the mission to fight against the complexity of the financial market to help our customers regain control of their financial lives. We have spent 11 years dedicated to bringing very simple ideas to places no one has ever taken them. For us, past success does not guarantee the future, which is why every day is “Day 1.” Being part of Nubank is embarking on a long-term journey where we know each challenge sparks creativity and innovation, where obstacles become opportunities to go a little further. Recently, we reached the milestone of 100 million customers globally, a significant achievement in our journey, but we know it wasn’t just the customers who chose us. We have over 8,000 Nubankers who choose to work with us daily.

State Of São Paulo, Brazil (On-Site)

Montevideo, Montevideo Department, Uruguay (On-Site)

Mexico City, Mexico (On-Site)

Mexico City, Mexico (Hybrid)

State Of São Paulo, Brazil (On-Site)

State Of São Paulo, Brazil (On-Site)

State Of São Paulo, Brazil (On-Site)

State Of São Paulo, Brazil (On-Site)

State Of São Paulo, Brazil (On-Site)

State Of São Paulo, Brazil (Hybrid)

View All Jobs

Get notified when new jobs are added by nubank

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug