The Senior Data Scientist role focuses on Data Engineering, Workflow Orchestration, and MLOps. Responsibilities include working with cloud platforms like GCP, AWS, and Azure, utilizing orchestration tools such as Kubernetes, and deployment tools like Spinnaker and Helm Charts. The role requires expertise in data processing with Apache Spark/Beam, workflow orchestration with Google Cloud Composer/Airflow, and MLOps practices including model deployment, monitoring, versioning, experiment tracking, and implementing CI/CD pipelines for machine learning models.
Must Have:- Experience with Google Cloud Platform (GCP), AWS, or Microsoft Azure.
- Proficiency in Kubernetes for orchestration.
- Experience with Spinnaker or Helm Charts for deployment.
- Knowledge of Apache Spark or Apache Beam for data pipelines.
- Familiarity with Google Cloud Composer or Airflow for workflow orchestration.
- Experience with MLOps tools like TensorFlow, Serving TorchServe, MLflow, DVC, or TFX.
- Ability to implement CI/CD pipelines specifically for machine learning models.
- Understanding of distributed training, resource scaling, and automated model retraining.
- Experience with DevOps and CI/CD tools like GitHub Actions or TeamCity.