Principal Software Engineer - MLOps Platform

9 Minutes ago • 8 Years + • Research Development

Job Summary

Job Description

Autodesk is seeking an experienced Principal Software Engineer for the AI/ML Platform (AMP) team in Pune, India. This role involves designing, implementing, and managing software systems for the full ML development lifecycle, including model development studio, feature store, model serving, and observability. The ideal candidate will have a strong background in MLOps, data engineering, and DevOps, with expertise in building high-scale deployment architectures and observability solutions across multi-cloud environments. Responsibilities include system design, mentoring, and ensuring end-to-end ownership of components.
Must have:
  • Design, implement and manage software systems for the AI/ML Platform.
  • Orchestrate the full ML development lifecycle for partner teams.
  • Define multi-cloud architecture components leveraging AWS and Azure.
  • Build systems for monitoring AI/ML model performance, including drift detection, data fairness/bias, and anomalies.
  • Develop tools for building and deploying ML artifacts in production environments.
  • Automate and orchestrate tasks related to managing big data transformation and processing.
  • Build large-scale data stores for ML artifacts.
  • Design and implement low-latency, scalable prediction and inference services.
  • Collaborate across diverse teams including machine learning researchers, developers, product managers, software architects, and operations.
  • Take end-to-end ownership of components, including design, architecture, implementation, rollout, onboarding support, production on-call support, testing/verification, and investigations.
  • Bachelor’s degree in Computer Science or equivalent practical experience.
  • Over 8 years of experience in software development and engineering, delivering production systems and services.
  • Prior experience of working with MLOps team at the intersection of ML model deployments, DevOps and data engineering.
  • Ability to fluently translate design into high quality code in golang, python, Java.
  • Knowledge of DevOps practices, containerization, orchestration tools such as CI/CD, Terraform, Docker, Kubernetes, Gitops.
  • Demonstrate knowledge of distributed data processing frameworks, orchestrators, and data lake architectures using technologies such as Spark, Airflow, iceberg/parquet formats.
  • Prior collaborations with Data science teams to deploy their models, setting up ML observability for inference level monitoring.
  • Knowledge of Amazon AWS and/or Azure cloud for solutioning large scale application deployments.
  • Excellent communication and collaboration skills.
Good to have:
  • Experience in integrating with third party vendors.
  • Experience in latency optimization with the ability to diagnose, tune, and enhance the efficiency of serving systems.
  • Familiarity with tools and frameworks for monitoring and managing the performance of AI/ML models in production (e.g., MLflow, Kubeflow, TensorBoard).
  • Familiarity with distributed model training/inference pipelines using (KubeRay or equivalent).
  • Exposure to leveraging GPU computing for AI/ML workloads, including experience with CUDA, OpenCL, or other GPU programming tools.
  • Exposure to ML libraries such as PyTorch, TensorFlow, XGBoost, Pandas, and ScikitLearn.
  • Exposure for building RAG based applications by collaborating with other product teams, Data scientists/AI engineers.
Perks:
  • competitive compensation package
  • annual cash bonuses
  • stock grants
  • comprehensive benefits package

Job Details

Job Requisition ID #

25WD85491

Position Overview

We are looking for an experienced Principal Software Engineer to join our platform team focusing on AI/ML Platform (AMP). This team builds and maintains central components to fast track the development of new ML/AI models such as model development studio, feature store, model serving and model observability. The ideal candidate would have a background in ML Ops, Data engineering and DevOps with the experience of building high scale deployment architectures and observability. As an important contributor to our engineering team, you will help shape the future of our AI/ML capabilities, delivering solutions that inspire value for our organization.

You will report directly to an Engineering Manager, and you will be based in Pune.

Responsibilities

  • System design: You will design, implement and manage software systems for the AI/ML Platform and orchestrate the full ML development lifecycle for the partner teams
  • Mentoring: Spreading your knowledge, sharing best practices and doing design reviews to step up the expertise at the team level
  • Multi-cloud architecture: Define components which leverages strengths from multiple cloud platforms (e.g., AWS, Azure) to optimize performance, cost, and scalability
  • AI/ML observability: You will build systems for monitoring performance of AI/ML models and find insights on the underlying data such as drift detection, data fairness/bias and anomalies
  • ML Solution Deployment: You will develop tools for building and deploying ML artefacts in production environments and facilitating a smooth transition from development to deployment
  • Big Data Management: Automate and orchestrate tasks related to managing big data transformation and processing and build large-scale data stores for ML artifacts
  • Scalable Services: Design and implement low-latency, scalable prediction, and inference services to support the diverse needs of our users
  • Cross-Functional Collaboration: Collaborate across diverse teams, including machine learning researchers, developers, product managers, software architects, and operations, fostering a collaborative and cohesive work environment
  • End-to-end ownership: You will take the end-to-end ownership of the components and work with other engineers in the team including design, architecture, implementation, rollout and onboarding support to partner teams, production on-call support, testing/verification, investigations etc

Minimum Qualifications

  • Educational Background: Bachelor’s degree in Computer Science or equivalent practical experience
  • Experience: Over 8 years of experience in software development and engineering, delivering production systems and services
  • Prior experience of working with MLOps team at the intersection of the expertise across ML model deployments, DevOps and data engineering
  • Hands-on skills: Ability to fluently translate the design into high quality code in golang, python, Java
  • Knowledge of DevOps practices, containerization, orchestration tools such as CI/CD, Terraform, Docker, Kubernetes, Gitops
  • Demonstrate knowledge of distributed data processing frameworks, orchestrators, and data lake architectures using technologies such as Spark, Airflow, iceberg/ parquet formats
  • Prior collaborations with Data science teams to deploy their models, setting up ML observability for inference level monitoring
  • Exposure for building RAG based applications by collaborating with other product teams, Data scientists/AI engineers
  • Demonstrate creative problem-solving skills with the ability to break down problems into manageable components
  • Knowledge of Amazon AWS and/or Azure cloud for solutioning large scale application deployments
  • Excellent communication and collaboration skills, fostering teamwork and effective information exchange

Preferred Qualifications

  • Experience in integrating with third party vendors
  • Experience in latency optimization with the ability to diagnose, tune, and enhance the efficiency of serving systems
  • Familiarity with tools and frameworks for monitoring and managing the performance of AI/ML models in production (e.g., MLflow, Kubeflow, TensorBoard)
  • Familiarity with distributed model training/inference pipelines using (KubeRay or equivalent)
  • Exposure to leveraging GPU computing for AI/ML workloads, including experience with CUDA, OpenCL, or other GPU programming tools, to significantly enhance model training and inference performance
  • Exposure to ML libraries such as PyTorch, TensorFlow, XGBoost, Pandas, and ScikitLearn

#LI-AC3

**Learn More**

About Autodesk

Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.

We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.

When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!

Salary transparency

Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.

Diversity & Belonging

We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging

Are you an existing contractor or consultant with Autodesk?

Please search for open jobs and apply internally (not on this external site).

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Pune, Maharashtra, India

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Research Development Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Autodesk is changing how the world is designed and made. Our technology spans architecture, engineering, construction, product design, manufacturing, media, and entertainment, empowering innovators everywhere to solve challenges big and small. From greener buildings to smarter products to more mesmerizing blockbusters, Autodesk software helps our customers to design and make a better world for all. Over 100 million people use Autodesk software like AutoCAD, Revit, Maya, 3ds Max, Fusion 360, SketchBook, and more to unlock their creativity and solve important design, business and environmental challenges. Our software runs on both personal computers and mobile devices and taps the infinite computing power of the cloud to help teams around the world collaborate, design, simulate and fabricate their ideas in 3D. We provide exceptional compensation/benefit packages and we’d love for you to join us. We’re proud to be an equal opportunity employer and we consider all qualified applicants without regard to race, gender, disability, veteran status or other protected category. To see our culture in action, check out #AutodeskLife. We are headquartered in the San Francisco Bay Area and have more than 10,000 employees worldwide.

Bengaluru, Karnataka, India (Hybrid)

Pune, Maharashtra, India (On-Site)

New Delhi, Delhi, India (On-Site)

Hong Kong, Hong Kong (On-Site)

Montreal, Quebec, Canada (Hybrid)

Singapore, Singapore (On-Site)

View All Jobs

Get notified when new jobs are added by Autodesk

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug