Sr. Engineer- Machine Learning Platform (Remote, IND)

Crowd Strick

Job Summary

CrowdStrike is seeking an exceptional Senior ML Platform Engineer to join their elite team. This role involves building next-generation machine learning infrastructure to power CrowdStrike's threat detection and prevention capabilities. The engineer will design and implement enterprise-scale ML infrastructure, architect high-performance model serving solutions, and build robust, scalable systems for model training, serving, and inferencing, processing data at unprecedented scale. The position requires strong expertise in distributed systems and MLOps.

Must Have

  • Design and implement enterprise-scale ML infrastructure
  • Architect high-performance model serving solutions
  • Build robust, scalable systems for model training, serving and inferencing
  • Write clean code in any programming language (preferably Python)
  • Develop automated ML pipelines using state-of-the-art tools
  • Implement sophisticated monitoring and observability solutions
  • Optimize resource utilization across large scale distributed computing environments
  • Design fault-tolerant, highly available ML systems using Airflow or MLflow, HPC (SLURM)/GPU
  • 10+ years of software engineering experience with distributed systems
  • 4+ years of hands-on experience building ML platforms
  • Proven experience with Kubernetes, containerization, and cloud platforms
  • Strong background in performance optimization and scalability
  • Experience with Jupyter Hub, MLflow, or similar ML platforms

Good to Have

  • Contributions to open-source ML infrastructure projects
  • Experience with real-time, high-throughput inference systems
  • Track record of leading technical initiatives
  • Experience with large-scale data processing systems

Perks & Benefits

  • Remote-friendly and flexible work culture
  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe

Job Description

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

About the Role:

We're seeking an exceptional Senior ML Platform Engineer to join our elite team in building next-generation machine learning infrastructure. You'll be at the forefront of developing scalable ML platforms that power CrowdStrike's threat detection and prevention capabilities, processing data at unprecedented scale.

What You'll Do:

Architecture & Development

  • Design and implement enterprise-scale ML infrastructure.
  • Architect high-performance model serving solutions handling millions of predictions per second
  • Build robust, scalable systems for model training, serving and inferencing.
  • Ability to write clean code in any programming language (Preferably Python)
  • Lead technical decisions for critical ML platform components

MLOps & Infrastructure

  • Develop automated ML pipelines using “State of art” tools
  • Implement sophisticated monitoring and observability solutions
  • Optimize resource utilization across large scale distributed computing environments
  • Design fault-tolerant, highly available ML systems using Airflow or MLflow, HPC (SLURM)/ GPU.

What You'll Need:

  • 10+ years of software engineering experience with distributed systems
  • 4+ years of hands-on experience building ML platforms
  • Proven experience with Kubernetes, containerization, and cloud platforms
  • Strong background in performance optimization and scalability
  • Experience with Jupyter Hub, MLflow, or similar ML platforms

Technical Expertise:

  • Distributed Systems: Ray, Kubernetes, Docker or similar large scale distributed systems
  • ML Platforms: MLflow, Kubeflow, JupyterHub or any similar systems
  • Infrastructure: AWS/GCP/Azure/OCI
  • Observability: Prometheus, Grafana
  • CI/CD: GitLab, Jenkins

What Sets You Apart:

  • Contributions to open-source ML infrastructure projects
  • Experience with real-time, high-throughput inference systems
  • Track record of leading technical initiatives
  • Experience with large-scale data processing systems

Why CrowdStrike: https://www.crowdstrike.com/en-us/careers/blog/employee-experience-rose-steingart/

Impact:

Your work will directly strengthen CrowdStrike's core mission of stopping breaches by building and scaling the infrastructure that powers our next-generation ML capabilities. Join us in protecting our customers and shaping the future of cybersecurity.

#LI-DP1

#LI-Remote

Benefits of Working at CrowdStrike:

  • Remote-friendly and flexible work culture
  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees regardless of level or role
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe

CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.

CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.

If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

14 Skills Required For This Role

Talent Acquisition Game Texts Gitlab Aws Azure Model Serving Prometheus Grafana Ci Cd Docker Kubernetes Python Jenkins Machine Learning

Similar Jobs