Senior MLOps Engineer - Security and Networking Research

3 Months ago • 5 Years + • DevOps

Job Summary

Job Description

NVIDIA seeks a Senior MLOps Engineer to build and maintain infrastructure for deploying security and networking AI models. Responsibilities include developing scalable infrastructure, designing data pipelines, optimizing ML models for performance and scalability, collaborating with data scientists and DevOps teams, implementing CI/CD pipelines, managing A/B testing, and building monitoring systems. The role requires expertise in ML frameworks (TensorFlow, PyTorch), cloud platforms, data processing tools (Spark, Hadoop), and CI/CD tools. The ideal candidate will have a strong security and networking background, possess proficiency in Python/Java/Scala, and be a detail-oriented problem solver.
Must have:
  • 5+ years ML model deployment experience
  • Proficiency in Python/Java/Scala
  • ML framework expertise (TensorFlow, PyTorch)
  • Cloud platform experience
  • Data processing tools (Spark, Hadoop)
  • CI/CD experience
  • Security and networking knowledge
Good to have:
  • Generative model serving experience
  • Vector database knowledge
  • Network protocol and Linux internals knowledge

Job Details

We're looking for a Senior MLOps Engineer to join a group that specializes in Security and Networking, in relation to ML/AI development. As a Senior MLOps Engineer, you’ll build and maintain the infrastructure, tools and processes necessary to support the machine learning and AI lifecycle in a production environment. You collaborate closely with data scientists, software engineers and devOps teams to ensure smooth deployment, modeling and optimization of AI models. This role involves creative problem solving alongside engineering teams, and is pivotal for the continued success of AI networking security. 

What you’ll be doing:

  • Developing, improving and optimizing scalable infrastructure for handling and deploying security and networking AI models in production, ensuring high availability, scalability, performance. 

  • Designing and implementing data pipelines to efficiently process and transform large volumes of data for training and inference purposes.

  • Optimizing and fine-tuning ML models for performance, scalability, and resource utilization, considering factors such as latency, efficiency, and cost.

  • Collaborating closely with data scientists and software engineers to operationalize and deploy ML models, including versioning, packaging and integration with existing systems. Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.

  • Collaborating with DevOps teams to integrate pipelines and workflows into the CI/CD process, ensuring flawless deployments and rollbacks.

  • Implementing and managing A/B testing frameworks.

  • Building and maintaining monitoring and alerting systems to proactively identify and resolve issues relating to quality, performance and infrastructure.

  • Implementing access controls, authentication mechanisms, and encryption standards for ML models and data.

  • Documenting guidelines, and standard operating procedures for MLOps processes and sharing knowledge with the wider team.

  • Develop proof-of-concepts for new features

What we need to see:

  • BS/MSc in CS/CE or related field (or equivalent experience)

  • Strong background in machine learning with a track record of deploying and maintaining models in production - at least 5 years of experience.

  • Proficiency in programming languages such as Python, Java, or Scala, along with experience in using ML frameworks and libraries (e.g. TensorFlow, PyTorch).

  • Proficiency in microservices architecture, container orchestration, and cloud platforms for deploying and scaling ML applications.

  • Knowledge of inference optimization techniques.

  • Experience with tools for data processing and storage (e.g. Apache Spark, Hadoop, SQL databases, NoSQL databases). 

  • Understanding of build infrastructure and CI/CD tools and practices (e.g. Jenkins)

  • Detail-oriented and care deeply about robust, well tested, high-performance code in production environments.

  • You are proactive, take full ownership of your deliverables, have a can-do approach, and excellent communication and collaboration skills, able to work effectively in multifunctional teams. 

Ways to stand out from the crowd:

  • Knowledge of network protocols and Linux internals

  • Security and networking background, with knowledge of security protocols, network architectures, firewalls, intrusion detection systems, and other relevant security and networking concepts

  • Familiarity with generative models and their serving

  • Experience with vector databases, similarity search and reranking algorithms

  • Knowledge of network security principles and practices

NVIDIA has some of the most forward-thinking and hardworking people on the planet working for us and, due to unprecedented growth, our special engineering teams are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Bungie - Senior Infrastructure Engineer

Bungie

United States (Hybrid)
1 Month ago
SuperPlay - Middle Server Developer

SuperPlay

Poland (On-Site)
3 Months ago
Infoblox - Commercial Account Executive II - Japan

Infoblox

Tokyo, Japan (On-Site)
6 Months ago
NVIDIA - Senior Production Engineer - Storage

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Xsolla - Director of Development (Dev Director)

Xsolla

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
3 Months ago
Assystems - DevOps Engineer

Assystems

Gurugram, Haryana, India (On-Site)
5 Months ago
GoTo Group - Lead Software Engineer - Engineering Platforms

GoTo Group

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Info Stretch - Senior Engineer

Info Stretch

Mumbai, Maharashtra, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

STAGE - Creative Content Manager - Series

STAGE

Noida, Uttar Pradesh, India (On-Site)
7 Months ago
Sonar Source - Major Account Manager - DACH

Sonar Source

London, England, United Kingdom (On-Site)
4 Months ago
SparkCognition - DevOps Engineer

SparkCognition

Bengaluru, Karnataka, India (On-Site)
7 Months ago
PwC - Senior Cloud & Digital Consultant - Financial Sector

PwC

Amsterdam, North Holland, Netherlands (On-Site)
6 Months ago
Google - Senior Software Engineer, Google Cloud

Google

Pune, Maharashtra, India (On-Site)
5 Months ago
Google - Staff Software Engineer, Security/Privacy, Google Cloud Security and Privacy

Google

Kirkland, Washington, United States (On-Site)
5 Months ago
NVIDIA - Principal Software Architect, GPU Networking Research

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
NVIDIA - Customization and Verification Manager

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
ByteDance - Backend Software Engineer - Global E-Commerce Supply Chain Merchant Platform

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Yokne'am Illit, North District, Israel

PLAYSTUDIOS - Marketing Data Engineer

PLAYSTUDIOS

Tel Aviv District, Israel (On-Site)
2 Months ago
SuperPlay - 3D Animator Disney Solitaire

SuperPlay

Tel Aviv District, Israel (On-Site)
3 Months ago
SciPlay - Art Direction Lead

SciPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
PAPAYA - Customer Support Agent

PAPAYA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Ludeo - Senior Front End Engineer

Ludeo

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - Interconnect Hardware Test Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
Plarium - Survey Researcher

Plarium

Herzliya, Tel Aviv District, Israel (On-Site)
2 Months ago
SuperPlay - 2D ARTIST

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
5 Months ago
NVIDIA - Senior Project Manager, ICPE

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
NVIDIA - Physical Design Power Optimization Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Luxoft - Automation Architect

Luxoft

Bengaluru, Karnataka, India (On-Site)
4 Months ago
The Walt Disney Company - Principal Software Engineer

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
2 Months ago
Take-Two Interactive - Senior Systems Engineer

Take-Two Interactive

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Meta - Production Engineer

Meta

Dublin, County Dublin, Ireland (On-Site)
5 Months ago
Metacore - DevOps Advocate

Metacore

Helsinki, Uusimaa, Finland (Hybrid)
5 Months ago
Wargaming - Senior Infrastructure Engineer (Python) (Game Engine Development Team)

Wargaming

Belgrade, Serbia (Hybrid)
4 Months ago
Buckman - Senior Lead Digital Innovation Engineer - Solution Architect

Buckman

Chennai, Tamil Nadu, India (On-Site)
5 Months ago
Kwalee - DevOps Engineer

Kwalee

Royal Leamington Spa, England, United Kingdom (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug