Engineering Farm Engineer

2 Months ago • 8 Years + • Software Development & Engineering

Job Summary

Job Description

The Engineering Farm Engineer at NVIDIA is responsible for architecting and maintaining solutions for large compute clusters, ensuring efficient operation and improving user experience for both customers and internal engineers. This role involves automating tasks, performance tuning, and proactive identification of potential outages. Responsibilities include maintaining server infrastructure, scaling systems sustainably, resolving production issues, conducting blameless postmortems, supporting on-call weeks, and designing solutions using efficient algorithms and regular SDLC processes. The ideal candidate will have strong experience in software design, algorithms, data structures, and one or more languages such as Python, Perl, Go, or Ruby, along with a systematic problem-solving approach and excellent communication skills.
Must have:
  • 8+ years experience in CS or related field
  • SW Design, Algorithms, Data Structures
  • Experience with Python, Perl, Go, or Ruby
  • SQL & NoSQL database understanding
  • System-level problem-solving skills
Good to have:
  • Experience with LSF and SLURM
  • Linux administration and automation
  • Experience mentoring junior engineers
  • Experience architecting scalable tools

Job Details

For two decades, we have pioneered visual computing, the art and science of computer graphics. With our invention of the GPU - the engine of modern visual computing - the field has expanded to encompass video games, movie production, product design, medical diagnosis, and scientific research. Today, we stand at the beginning of the next era, the AI computing era, ignited by a new computing model, GPU deep learning. This new model - where deep neural networks are trained to recognize patterns from massive amounts of data - has shown to be deeply effective at solving some of the most complex problems in everyday life.

Engineering Farm Engineer is responsible for architecting solutions around our large compute cluster to make it work efficiently and improve the user experience for customers as well as engineers supporting the cluster.  Much of our SW engineering work focuses on eliminating manual work through automation, performance tuning, and growing the efficiency of production systems. Practices such as limiting time spent on reactive operational work, blameless postmortems, and proactive identification of potential outages factor into iterative improvement that is key to product quality and interesting and dynamic day-to-day work.  We promote self-direction to work on meaningful projects, while we also strive to build an environment that provides the support and mentorship needed to learn and grow.

What you will be doing:

  • Maintain server infrastructure and services once they are live by measuring and monitoring availability, latency, and overall system health.

  • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.

  • Work with different SMEs and help provide quality resolution to the production issues to the customer

  • Practice sustainable incident response and blameless postmortems.

  • Understand complex and vast infrastructure and support it during on-call weeks

  • Independently Architect and design solutions with SW engineering approach using the right and efficient algorithms, implemented with regular SDLC process that includes requirements gathering, SW design, testing, deployment, & release.  

  • Support large-scale server infrastructure with monitoring, logging, and alerting with promised uptime.

  • Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation, and refinement.

  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management, and launch reviews.

What we need to see:

  • BS degree with 8+ years of experience in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.

  • Experience with SW Design, Algorithms, data structures, and software design.

  • Experience in one or more of the following: Python, Perl, Go, or Ruby using an Object-oriented approach.

  • Experience in mentoring junior engineers or leading a team.

  • Basic understanding of SQL & NoSQL Data platforms, database queries, and data analysis.

  • Interest in crafting, analyzing, and fixing large-scale distributed systems.

  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.

  • Ability to debug and optimize code and automate routine tasks.

  • Ability to learn quickly and adapt to different platforms as per the needs of the project.

Ways to stand out of the crowd:

  • Demonstrated experience with architecting and building scalable and maintainable tools following SW best practices

  • Demonstrated experience with leading a project from inception to completion along with significant independent contribution

  • Good hands-on experience with schedulers like LSF and SLURM

  • Good understanding of Linux Administration or done automation around it

  • Experience in debugging infrastructure or UNIX-related issues

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Ansys - Lead R&D Engineer - EDA

Ansys

Noida, Uttar Pradesh, India (On-Site)
1 Month ago
tic toe games - Game Producer

tic toe games

Burbank, California, United States (On-Site)
2 Months ago
NC America llc - Customer Service Lead

NC America llc

Irvine, California, United States (On-Site)
2 Months ago
London stock Exchange - Sales Order Specialist

London stock Exchange

Beijing, China (On-Site)
3 Weeks ago
Tesla - Sales Trainer

Tesla

Berlin, Berlin, Germany (On-Site)
4 Months ago
eBay - MTS 2, Software Engineer

eBay

San Jose, California, United States (On-Site)
4 Weeks ago
Rockstar Games - Software Engineer (GO)

Rockstar Games

Leeds, England, United Kingdom (On-Site)
1 Month ago
Alphawave Semi - Senior Design Verification Engineer (HSI- High Speed Interfaces)

Alphawave Semi

Toronto, Ontario, Canada (On-Site)
1 Month ago
LTI Mindtree - Specialist - Software Engineering

LTI Mindtree

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
Apple - Software Integrity Engineer

Apple

San Diego, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Sandsoft Games - Senior Marketing Artist

Sandsoft Games

Riyadh, Riyadh Province, Saudi Arabia (Remote)
1 Month ago
OKX - Senior Agent, Customer Service

OKX

Istanbul, İstanbul, Türkiye (On-Site)
1 Month ago
Zuru - Quality Control Manager – Pet food

Zuru

Chon Buri, Thailand (On-Site)
3 Months ago
legion - Senior Software Engineer, Backend

legion

Bucharest, Bucharest, Romania (Hybrid)
2 Weeks ago
sound cloud - Senior Executive Assistant

sound cloud

New York, United States (On-Site)
4 Weeks ago
OKX - Manager, Customer Service

OKX

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Month ago
Philips - Long Term Resource Planner 2 - ServiceMax

Philips

Bothell, Washington, United States (On-Site)
1 Month ago
Interactive Brokers - Information Security Controls Manager

Interactive Brokers

Greenwich, Connecticut, United States (Hybrid)
1 Month ago
SciPlay - Senior Software Engineer

SciPlay

Cedar Falls, Iowa, United States (Hybrid)
5 Months ago
Unity - Customer Experience Advisor

Unity

Tokyo, Japan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Capgemini - GL Accounting

Capgemini

Pune, Maharashtra, India (On-Site)
1 Month ago
PwC - Associate - Mumbai Shivaji Park - Technology Consulting

PwC

Mumbai, Maharashtra, India (On-Site)
9 Months ago
InMobiInMobi - Product Manager

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Capgemini - Automation Engineer

Capgemini

Chennai, Tamil Nadu, India (On-Site)
1 Month ago
Glean - Software Engineer, Machine Learning (India)

Glean

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Assystems - Tunnel Design Engineer

Assystems

Mumbai, Maharashtra, India (On-Site)
8 Months ago
Zscaler - Staff Application Security Engineer

Zscaler

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Cubic corporation - Software Architect

Cubic corporation

Hyderabad, Telangana, India (On-Site)
2 Weeks ago
Ethos Life - Senior Frontend Engineer

Ethos Life

Bengaluru, Karnataka, India (On-Site)
1 Month ago
DMG - Senior Staff Engineer

DMG

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Software Development & Engineering Jobs

AECOM - Senior Resident Engineer V - Transportation

AECOM

Murray, Utah, United States (On-Site)
1 Week ago
The Walt Disney Company - Lead Software Engineer - Ad Platforms

The Walt Disney Company

Glendale, California, United States (On-Site)
2 Months ago
Alpha Sense - Technical Support Engineer

Alpha Sense

United States (Remote)
1 Month ago
Valve corporation - Mechanical Engineer

Valve corporation

Bellevue, Washington, United States (On-Site)
6 Months ago
Tesla - Manufacturing Engineer, Battery Cell

Tesla

Brandenburg, Germany (On-Site)
4 Months ago
Nordson Corporation - Senior Engineer, Electrical

Nordson Corporation

Amherst, Ohio, United States (On-Site)
1 Month ago
The Walt Disney Company - Senior Software Engineer - Activation

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Months ago
Razer - Software Engineer

Razer

Shah Alam, Selangor, Malaysia (On-Site)
2 Weeks ago
Google - Software Engineer, People with Disabilities

Google

(On-Site)
7 Months ago
Inveniolsi - SAP BODS Consultant

Inveniolsi

Mumbai, Maharashtra, India (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug