Sr. Engineer ML Platform

1 Month ago • 10 Years +
Research Development

Job Description

As an engineer in this team, you will play an integral role as we build out our ML Platform & GenAI Studio from the ground up. You will collaborate closely with Data Platform Software Engineers, Data Scientists & Threat Analysts to design, implement, and maintain scalable ML pipelines for Data Preparation, Cataloguing, Feature Engineering, Model Training, and Model Serving that influence critical business decisions. You’ll be a key contributor in a production-focused culture that bridges the gap between model development and operational success. Future plans include generative AI investments for use cases such as modelling attack paths for IT assets.
Good To Have:
  • Java/Scala exposure
  • Go
  • Iceberg
  • Pinot or other time-series/OLAP-style database
  • Jenkins
  • Parquet
  • Protocol Buffers/GRPC
Must Have:
  • Help design, build and facilitate adoption of a modern ML platform including support for use cases like GenAI
  • Understand current ML workflows, anticipate future needs and identify common patterns and exploit opportunities to templatize into repeatable components for model development, deployment, and monitoring
  • Build a platform that scales to thousands of users and offers self-service capability to build ML experimentation, training and inference pipelines
  • Leverage workflow orchestration tools to deploy efficient and scalable execution of complex data and ML pipelines
  • Champion software development best practices around building distributed systems
  • Leverage cloud services like Kubernetes, blob storage, and queues in our cloud first environment
  • Develop and deploy machine learning solutions to production
  • Experience with ML Platform tools like Jupyter Notebooks, NVidia Workbench, MLFlow, Ray
  • Experience building data platform product(s) or features with Apache Spark, Flink or comparable tools
  • Proficiency in distributed computing and orchestration technologies (Kubernetes, Airflow)
  • Production experience with infrastructure-as-code tools such as Terraform, FluxCD
  • Expert level experience with Python
  • Expert level experience with containerization frameworks
  • Strong analytical and problem solving skills
  • Exceptional interpersonal and communication skills
Perks:
  • Remote-friendly and flexible work culture
  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees regardless of level or role
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe

Add these skills to join the top 1% applicants for this job

communication
game-texts
protocol-buffers
model-serving
terraform
spark
kubernetes
python
scala
jenkins
java
machine-learning

About the Role:

As an engineer in this team, you will play an integral role as we build out our ML Platform & GenAI Studio from the ground up. Since the launch of ChatGPT, # Phishing attacks has increased by 138% and hence the ML platform is a critical capability for Crowdstrike in its fight against bad actors. For this mission we are building a team in Bangalore. You will collaborate closely with Data Platform Software Engineers, Data Scientists & Threat Analysts to design, implement, and maintain scalable ML pipelines that will be used for Data Preparation, Cataloguing, Feature Engineering, Model Training, and Model Serving that influence critical business decisions. You’ll be a key contributor in a production-focused culture that bridges the gap between model development and operational success. Future plans include generative AI investments for use cases such as modelling attack paths for IT assets.

Location: Bangalore

Candidates must be comfortable to visit office once a week

What You’ll Do:

  • Help design, build and facilitate adoption of a modern ML platform including support for use cases like GenAI
  • Understand current ML workflows, anticipate future needs and identify common patterns and exploit opportunities to templatize into repeatable components for model development, deployment, and monitoring
  • Build a platform that scales to thousands of users and offers self-service capability to build ML experimentation, training and inference pipelines
  • Leverage workflow orchestration tools to deploy efficient and scalable execution of complex data and ML pipelines
  • Champion software development best practices around building distributed systems
  • Leverage cloud services like Kubernetes, blob storage, and queues in our cloud first environment

What You’ll Need:

  • B.S. /MS in Computer Science or a related field and 10+ years related experience; or M.S. with 8+ years of experience;
  • 3+ years experience developing and deploying machine learning solutions to production. Familiarity with typical machine learning workflows from an engineering perspective (how they are built and used, not necessarily the theory); familiarity with supervised / unsupervised approaches: how, why, and when and labelled data is created and used
  • 3+ years experience with ML Platform tools like Jupyter Notebooks, NVidia Workbench, MLFlow, Ray etc.
  • Experience building data platform product(s) or features with (one of) Apache Spark, Flink or comparable tools
  • Proficiency in distributed computing and orchestration technologies (Kubernetes, Airflow, etc.)
  • Production experience with infrastructure-as-code tools such as Terraform, FluxCD
  • Expert level experience with Python; Java/Scala exposure is recommended. Ability to write Python interfaces to provide standardized and simplified interfaces for data scientists to utilize internal Crowdstrike tools
  • Expert level experience with containerization frameworks
  • Strong analytical and problem solving skills, capable of working in a dynamic environment
  • Exceptional interpersonal and communication skills. Work with stakeholders across multiple teams and synthesize their needs into software interfaces and processes.

Bonus Points:

Critical Skills Needed for Role:

  • Distributed Systems Knowledge
  • Data/ML Platform Experience

Experience with the Following is Desirable:

  • Go
  • Iceberg (highly desirable)
  • Pinot or other time-series/OLAP-style database
  • Jenkins
  • Parquet
  • Protocol Buffers/GRPC

Set alerts for more jobs like Sr. Engineer ML Platform
Set alerts for new jobs by Crowd Strick
Set alerts for new Research Development jobs in India
Set alerts for new jobs in India
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙