Looking for a company that inspires passion, courage, and innovation? With our client, you can help shape the future of global commerce, influencing how millions of people buy, sell, connect, and share worldwide. Join a purpose-driven, inclusive team dedicated to making a meaningful impact globally.
About the team:We are the AI Platform Team, providing highly available, scalable, and automated machine learning infrastructure for researchers and data scientists globally. We are looking for a motivated, self-reliant SRE / DevOps engineer with Node.js experience to drive operational excellence, automation, and platform reliability.
Role Overview:This role focuses on maintaining, deploying, and improving AI/ML platform services with strong emphasis on DevOps, SRE practices, and automation. You will collaborate closely with developers, researchers, and infrastructure teams to ensure robust, scalable, and highly available systems.
Key Responsibilities (DevOps-heavy, ~60%):
- Design, implement, and maintain CI/CD pipelines for AI platform applications.
- Manage and troubleshoot Kubernetes clusters, Docker containers, and cloud infrastructure.
- Ensure high availability (99.999%), system reliability, and security across platforms.
- Automate operational tasks, monitoring, and deployment workflows.
- Collaborate with AI platform developers to deploy and scale services efficiently.
- Analyze and resolve production issues, performance bottlenecks, and functional problems.
- Define operational standards, versioning practices, and advise teams on DevOps best practices.
- Prepare documentation, training materials, and provide technical support to platform users.
Development Responsibilities (~40%):
- Design, build, and refactor services in React / Node.js.
- Integrate backend services with interactive UI components (Jupyter notebooks, developer tools).
- Contribute to developer productivity tools, such as VS Code plugins or ML workflow integrations.
- Collaborate with AI platform developers to integrate applications into automated CI/CD workflows.
Required Skills & Experience:
- Strong JavaScript / Node.js development experience (from 4 years of experience).
- Experience with web front-end / UI integration and ML workflow tools.
- Familiarity with Jupyter notebooks and developer productivity integrations.
- Solid understanding of Kubernetes, Docker, Linux fundamentals, and DevOps practices.
- Experience with CI/CD pipelines (Jenkins or similar), test automation, and monitoring.
- Strong debugging and triaging skills.
- Excellent communication and collaboration skills with cross-functional teams.
- Strong organizational skills to manage multiple projects in a fast-paced environment.
- Fluent in English (spoken and written).
- Overall from 5 years of relevant DevOps / SRE experience.
Why join our team?You will work at the intersection of AI/ML and global commerce, delivering high-impact solutions for millions of users. eBay provides an inclusive, diverse environment for growth, learning, and innovation.
We offer\*:
- Flexible working format - remote, office-based or flexible
- A competitive salary and good compensation package
- Personalized career growth
- Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
- Active tech communities with regular knowledge sharing
- Education reimbursement
- Memorable anniversary presents
- Corporate events and team buildings
- Other location-specific benefits
\*not applicable for freelancers