Senior DevOps Engineer (with Java)

5 Minutes ago • 3-5 Years
Devops

Job Description

The AI Platform Team is seeking a highly motivated Senior DevOps Engineer with Java experience to drive major transformation within the AI organization. This role involves maintaining, deploying, and improving AI platform services with an emphasis on DevOps, SRE practices, and automation. The engineer will work closely with developers, infrastructure teams, and researchers to ensure robust, scalable, and highly available systems, supporting machine learning infrastructure globally.
Good To Have:
  • Familiarity with metrics and monitoring tools (Prometheus, Grafana) is a plus.
Must Have:
  • Design, implement, and maintain CI/CD pipelines for AI platform services.
  • Manage and troubleshoot Kubernetes clusters, Docker containers, and cloud infrastructure.
  • Ensure high availability (99.999%), system reliability, and security across platforms.
  • Automate operational tasks, monitoring, and deployment workflows.
  • Collaborate with developers and infrastructure teams to deploy and scale services.
  • Analyze and resolve production issues, performance bottlenecks, and functional problems.
  • Define operational standards, versioning practices, and advise teams on DevOps best practices.
  • Prepare documentation, training materials, and provide technical support to platform users.
  • Design, build, and refactor services in Java / Spring framework.
  • Develop and maintain microservices, including service discovery, orchestration, and system APIs.
  • Work with relational (MySQL, PostgreSQL, Oracle, SQL Server) and NoSQL (MongoDB, Cassandra, Redis) databases.
  • Contribute to low-latency, high-throughput web services development.
  • Collaborate with AI platform developers to integrate applications into automated CI/CD workflows.
  • Strong Java / Spring development experience (2–4 years).
  • Hands-on experience with Kubernetes, Docker, and Linux fundamentals.
  • Experience with CI/CD pipelines (Jenkins or similar), test automation, and DevOps practices.
  • Overall 3–5 years of relevant DevOps / SRE experience.
Perks:
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

Add these skills to join the top 1% applicants for this job

cross-functional
communication
problem-solving
oracle
game-texts
postgresql
mysql
linux
nosql
prometheus
grafana
spring-framework
redis
mongodb
ci-cd
cassandra
docker
microservices
kubernetes
sql
jenkins
java
machine-learning

About our customer:

Our customer is a company that inspires passion, courage and imagination, where you can be part of the team shaping the future of global commerce. If you want to shape how millions of people buy, sell, connect, and share around the world and If you’re interested in joining a purpose driven community that is dedicated to creating an ambitious and inclusive workplace, than our company you is deinitely the one that you can be proud to be a part of.

Our customer is a global commerce leader where you can influence how the world buys, sells, and gives. You’ll be part of a work culture that’s been genuinely committed to diversity and inclusion since its founding over twenty five years ago. Here, you can be yourself, do your best work along with a team of professionals, and have a meaningful impact on people across the globe. We seek people with drive, ideas, and a passion for helping small businesses succeed to help craft the future of our customer. Does this sound like you? If so, we’d love to talk to you!

About the team:

We are the AI Platform Team! We are looking for a highly motivated, self-reliant, experienced SRE and customer support engineer who is passionate about driving major transformation within the AI organization.

This role will support the AI Platform and services that provide machine learning infrastructure to researchers and data scientists across the customer globally. You'll be expected to stay in touch with the latest technology development and drive implementation of DevOps practices across the organization and provide customer support.

Role Overview:

This role focuses on maintaining, deploying, and improving AI platform services with emphasis on DevOps, SRE practices, and automation. You will work closely with developers, infrastructure teams, and researchers to ensure robust, scalable, and highly available systems.

Key Responsibilities (DevOps-heavy, ~60%):

  • Design, implement, and maintain CI/CD pipelines for AI platform services.
  • Manage and troubleshoot Kubernetes clusters, Docker containers, and cloud infrastructure.
  • Ensure high availability (99.999%), system reliability, and security across platforms.
  • Automate operational tasks, monitoring, and deployment workflows.
  • Collaborate with developers and infrastructure teams to deploy and scale services.
  • Analyze and resolve production issues, performance bottlenecks, and functional problems.
  • Define operational standards, versioning practices, and advise teams on DevOps best practices.
  • Prepare documentation, training materials, and provide technical support to platform users.

Development Responsibilities (~40%):

  • Design, build, and refactor services in Java / Spring framework.
  • Develop and maintain microservices, including service discovery, orchestration, and system APIs.
  • Work with relational (MySQL, PostgreSQL, Oracle, SQL Server) and NoSQL (MongoDB, Cassandra, Redis) databases.
  • Contribute to low-latency, high-throughput web services development.
  • Collaborate with AI platform developers to integrate applications into automated CI/CD workflows.

Required Skills & Experience:

  • Strong Java / Spring development experience (2–4 years).
  • Hands-on experience with Kubernetes, Docker, and Linux fundamentals.
  • Experience with CI/CD pipelines (Jenkins or similar), test automation, and DevOps practices.
  • Familiarity with metrics and monitoring tools (Prometheus, Grafana) is a plus.
  • Strong debugging and triaging skills, including JVM profiling, memory leak detection, GC tuning.
  • Excellent communication and collaboration skills with cross-functional teams.
  • Strong organizational skills to manage multiple projects in a fast-paced environment.
  • Fluent in English (spoken and written).
  • Overall 3–5 years of relevant DevOps / SRE experience

We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

Set alerts for more jobs like Senior DevOps Engineer (with Java)
Set alerts for new jobs by N-ix
Set alerts for new Devops jobs in India
Set alerts for new jobs in India
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙