Senior DevOps Engineer (Brahma)

5 Minutes ago • 6 Years +
Devops

Job Description

At Brahma, we are building the future of AI-powered creative experiences, combining visual effects expertise with generative AI to empower enterprises and creators. This role involves owning the build, deploy, and runtime reliability across Brahma’s hybrid estate, delivering secure, scalable infrastructure for Gen AI based workflows and products. The Senior DevOps Engineer will partner with product and research teams to innovate and ship fast.
Good To Have:
  • Model serving stacks and GPU telemetry/optimization.
  • On-prem operations for GPU/CPU fleets.
  • HPC/VFX pipeline exposure; render farms; real-time engines.
  • Storage systems (S3/MinIO, Ceph/Lustre/NFS), CDN, caching strategies.
  • Messaging/streaming (Kafka) and workflow/orchestration (Argo, Airflow).
Must Have:
  • Design, implement, and operate Slurm and Kubernetes platforms.
  • Build CI/CD pipelines for services, model training, and serving.
  • Implement Infrastructure as Code with Terraform/Terragrunt.
  • Design and implement observability stacks; drive incident response.
  • Secure the stack with least privilege and secrets management.
  • Operate model-serving infrastructure; optimise GPU utilisation.
  • Drive cost visibility and efficiency; forecast capacity.
  • 6+ years in DevOps/SRE/Platform roles running production systems.
  • Expert with Kubernetes and containers.
  • Strong with Terraform and configuration management (Ansible).
  • Proficient with CI/CD (GitHub Actions/GitLab), release strategies.
  • Experience with observability in production (Prometheus/Grafana).
  • Linux mastery, shell scripting, and Python.
  • Cloud proficiency (AWS/GCP/Azure) and Security fundamentals.
  • Experience with data/media-heavy workloads or ML pipelines.

Add these skills to join the top 1% applicants for this job

real-time-vfx
github
game-texts
storytelling
gitlab
networking
incident-response
linux
aws
azure
model-serving
prometheus
ansible
terraform
grafana
ci-cd
kubernetes
python
shell
github-actions

Description

Position at DNEG

At Brahma, we are building the future of AI-powered creative experiences. Our team of engineers, researchers and creative technologists is building the operating system for the future of ethical digital storytelling. We combine award-winning visual effects expertise with cutting-edge generative AI to create tools that empower enterprises and creators to produce high-quality content at scale.

Key Purpose of the Job

Own build, deploy, and runtime reliability across Brahma’s hybrid estate. Deliver secure, scalable infrastructure for Gen AI based workflows and products across hybrid environments. Partner with infrastructure and multidisciplinary product and research teams to help them innovate and ship fast.

Key Responsibilities

  • Design, implement, and operate Slurm and Kubernetes-based platforms across cloud and on-prem GPU nodes, including autoscaling, rollout strategies, and multi-cluster operations.
  • Build CI/CD pipelines for services, model training, and model serving; standardise artifact/version management and environment promotion.
  • Implement Infrastructure as Code with Terraform/Terragrunt and configuration management; enforce drift detection and repeatable environments.
  • Design and implement observability stacks (metrics, logs, tracing); drive incident response and postmortems.
  • Secure the stack with least privilege, secrets management, network policy, and hardened baselines; support ISO/MPA controls with the security team.
  • Operate model-serving infrastructure for real-time and batch workloads; optimise GPU utilisation, concurrency, and latency.
  • Drive cost visibility and efficiency across compute, storage, and egress; forecast capacity and plan lifecycle of hardware and licenses.

Must Haves

  • 6+ years in DevOps/SRE/Platform roles running production systems.
  • Expert with Kubernetes and containers (runtime, scheduling, networking, autoscaling).
  • Strong with Terraform and at least one configuration management tool (Ansible preferred).
  • CI/CD (GitHub Actions [preferred] / GitLab), release strategies, and artifact registries.
  • Observability in production (Prometheus/Grafana preferred).
  • Linux mastery, shell scripting, and a high-level language (Python preferred).
  • Cloud proficiency (AWS/GCP/Azure) and Security fundamentals: IAM, secrets management, network segmentation, image provenance.
  • Experience with data-/media-heavy workloads or ML pipelines in production.
  • Location: EU/UK time zones (±2h).

Nice to Have

  • Model serving stacks and GPU telemetry/optimization.
  • On-prem operations for GPU/CPU fleets.
  • HPC/VFX pipeline exposure; render farms; real-time engines.
  • Storage systems (S3/MinIO, Ceph/Lustre/NFS), CDN, and caching strategies.
  • Messaging/streaming (Kafka) and workflow/orchestration (Argo, Airflow).

About You

  • Pragmatic and systems-thinking oriented.
  • Bias to automate and simplify.
  • Clear communication during incidents and reviews.
  • Ownership across design, operations, and quality.

About Us

We are DNEG, one of the world’s leading visual effects and animation companies for the creation of award-winning feature film, television, and multiplatform content. We employ more than 9,000 people with worldwide offices and studios across North America (Los Angeles, Montréal, Toronto, Vancouver), Europe (London), Asia (Bangalore, Mohali, Chennai, Mumbai) and Australia (Sydney).

At DNEG, we fundamentally believe that embracing our differences is a vital component of our collective success. We are committed to creating an equitable, diverse and inclusive work environment for our global teams, where everyone feels they matter and belong. We welcome and encourage applications from all, regardless of background, experience or disability. Please let us know if you need any adjustments or support during the application process, we will do our best to accommodate your needs. We look forward to meeting you!

Set alerts for more jobs like Senior DevOps Engineer (Brahma)
Set alerts for new jobs by DNEG
Set alerts for new Devops jobs in United Kingdom
Set alerts for new jobs in United Kingdom
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙