SDET III – Generative AI QA

1 Month ago • 7-9 Years
Automation

Job Description

Netomi is the leading agentic AI platform for enterprise customer experience, working with global brands like Delta Airlines and MetLife. We enable agentic automation at scale across the customer journey with our no-code platform. We are seeking a Senior SDET with Generative AI testing expertise to lead the development of automation frameworks for AI/ML-powered applications. This role ensures reliability, safety, and scalability of LLM-driven products while advancing traditional test automation for cloud-native systems.
Must Have:
  • Design and maintain Python/Java automation frameworks for web, API, and backend services.
  • Extend frameworks to test LLM integrations with prompt validation and response consistency.
  • Implement model benchmarking for generative AI features.
  • Integrate tests into CI/CD pipelines with cloud workflows.
  • Optimize performance testing for AI endpoints handling high-throughput inference.
  • Debug flaky tests in non-deterministic AI systems.
  • Mentor junior engineers on AI testing best practices.
  • Research tools like LangChain, synthetic data generators, or adversarial testing.
  • Advocate for ML-specific quality metrics beyond traditional pass/fail.

Add these skills to join the top 1% applicants for this job

github
game-texts
quality-control
performance-testing
playwright
aws
nosql
selenium
testng
junit
ci-cd
docker
python
sql
github-actions
jenkins
java

About the Company:

Netomi is the leading agentic AI platform for enterprise customer experience. We work with the largest global brands like Delta Airlines, MetLife, MGM, United, and others to enable agentic automation at scale across the entire customer journey. Our no-code platform delivers the fastest time to market, lowest total cost of ownership, and simple, scalable management of AI agents for any CX use case. Backed by WndrCo, Y Combinator, and Index Ventures, we help enterprises drive efficiency, lower costs, and deliver higher quality customer experiences.

Want to be part of the AI revolution and transform how the world’s largest global brands do business? Join us!

We’re seeking a Senior SDET with expertise in Generative AI testing to lead the development of cutting-edge automation frameworks for AI/ML-powered applications. You’ll ensure the reliability, safety, and scalability of LLM-driven products while advancing traditional test automation for cloud-native systems.

Responsibilities

  • AI-Aware Test Automation - Design and maintain Python/Java-based automation frameworks (Selenium, Playwright, TestNG/JUnit) for web, API, and backend services.
  • Extend frameworks to test LLM integrations (OpenAI, HuggingFace, RAG pipelines) with prompt validation, hallucination checks, and response consistency tests.
  • Implement model benchmarking (latency, accuracy, bias/drift detection) for generative AI features.
  • Quality Infrastructure - Integrate tests into CI/CD pipelines (Jenkins, GitHub Actions) with cloud workflows (AWS/GCP).
  • Optimize performance testing (JMeter/Locust) for AI endpoints handling high-throughput inference.
  • Debug flaky tests in (non-deterministic) AI systems.
  • Leadership & Innovation - Mentor junior engineers on AI testing best practices.
  • Research tools like LangChain, synthetic data generators, or adversarial testing techniques.
  • Advocate for ML-specific quality metrics beyond traditional pass/fail.

Requirements

  • 7–9 years in QA automation with strong Python/Java proficiency.
  • Hands-on experience with Selenium, Playwright, REST Assured, and CI/CD tools (Jenkins, Docker).
  • Solid understanding of SQL/NoSQL databases and cloud platforms (AWS/GCP).
  • Exposure to performance testing (JMeter, K6) and scalable test frameworks.
  • Experience with LLM testing (prompt engineering, output validation, rubric-based grading).
  • Familiarity with OpenAI APIs, HuggingFace, or LangChain.
  • Knowledge of synthetic test data generation for edge-case scenarios.
  • Autonomy – Thrive in fast-paced, AI-driven environments with minimal supervision.
  • Analytical Mindset – Debug complex failures in probabilistic AI systems.
  • Communication - Explain technical trade-offs to non-technical stakeholders.

Netomi is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics.

Set alerts for more jobs like SDET III – Generative AI QA
Set alerts for new jobs by Netomi
Set alerts for new Automation jobs in India
Set alerts for new jobs in India
Set alerts for Automation (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙