AI Test Engineer

1 Month ago • Upto 3 Years • Testing

Job Summary

Job Description

We’re looking for a creative and curious Backend & AI Agent Testing Engineer to join our team. You’ll work hands-on with our AI agents and backend services, writing code, debugging, scripting tests, and evaluating LLM prompts to ensure our systems behave reliably — even in unpredictable scenarios. This is not a traditional backend or QA role. You’ll quickly learn new tools, explore unfamiliar apps, and design tests from a user’s perspective to help tame and improve our cutting-edge AI systems.
Must have:
  • Write, debug, and maintain backend code in Python or JavaScript for robust systems.
  • Implement APIs and ensure authentication workflows work as expected.
  • Design and execute creative test strategies for AI agent behavior (LLM-based GenAI).
  • Evaluate AI agent outputs and prompts, contributing to LLM evaluation and metrics.
  • Write scripts and automation to test AI agents and backend workflows.
  • Build lightweight automation frameworks and develop or extend test infrastructure.
  • Dive into new, unfamiliar apps and services quickly, learning and building tests.
  • Anticipate edge cases and potential failure modes from a user's perspective.
  • Creatively manage and test AI systems that may behave inconsistently.
  • Work hands-on with integrations (HubSpot, Salesforce) and modern collaboration tools.
  • Test and validate backend workflows connecting critical user data across systems.
  • Collaborate closely with cross-functional teams in a startup environment.
  • Continuously learn new frameworks, tools, and approaches independently.
Good to have:
  • Experience with HubSpot or Salesforce integrations.
  • Familiarity with tools like Cursor or Windsurf.
  • Knowledge of modern automation frameworks (e.g., Selenium).
  • Experience in B2B product environments.
  • Experience with LLM evaluation frameworks, metrics, or MCPs.
Perks:
  • Work at the intersection of backend engineering, AI, and creative testing.
  • A fast-moving, supportive startup culture that values experimentation and creativity.
  • Opportunity to work with cutting-edge AI systems, tools, and frameworks.
  • Learn and grow rapidly alongside a talented and collaborative team.

Job Details

Description

Data Science problems are everywhere, but the talent is not. At ObviouslyAI, our vision is to turn every company into an AI company. We do this by providing businesses with access to world-class, on-demand data science talent that helps them solve real business problems. On the back end, we empower data scientists with a set of internal groundbreaking tools to help them deliver results in minutes, not months.

We’re a small, scrappy group of people with a strong bent toward failing fast, a bias for action, and attention to detail. We’re focused on doing the best work of our lives and believe in having a healthy separation of work and life. We keep working hours flexible and are building a team with business teams located in San Francisco, CA and engineering teams located in Bangalore, India.

Obviously AI is backed by some of the top venture capital firms in the US, and you’ll be on the ground floor of a fast-growing company with a big mission.

About You

We’re looking for a creative and curious Backend & AI Agent Testing Engineer to join our team. You’ll work hands-on with our AI agents and backend services, writing code, debugging, scripting tests, and evaluating LLM prompts to ensure our systems behave reliably — even in unpredictable scenarios.

This is not a traditional backend or QA role. You’ll quickly learn new tools, explore unfamiliar apps, and design tests from a user’s perspective to help tame and improve our cutting-edge AI systems.

Responsibilities

  • Build and Maintain
  • Write, debug, and maintain backend code in Python or JavaScript, building test cases and backend scripts to support robust and experimental systems.
  • Implement APIs and ensure authentication workflows work as expected, with exposure to integrations.
  • AI Agent Testing & Prompt Evaluation
  • Design and execute creative test strategies for AI agent behavior, particularly around LLM-based (GenAI) agents, ensuring systems behave reliably despite their unpredictable nature.
  • Evaluate AI agent outputs and prompts, contributing to LLM evaluation and metrics using tools like Deepchecks.
  • Scripting & Automation
  • Write scripts and automation to test AI agents and backend workflows.
  • Build lightweight automation frameworks and develop or extend test infrastructure; familiarity with Selenium or similar is a plus.
  • Experimentation & Exploration
  • Dive into new, unfamiliar apps and services quickly — learning them and building tests as if you were an end-user.
  • Get into the “user’s shoes” to anticipate edge cases and potential failure modes.
  • Tame the “beast” — creatively managing and testing AI systems that may behave inconsistently.
  • Tools & Integrations
  • Work hands-on with integrations (e.g., HubSpot, Salesforce — bonus points for experience here) and modern collaboration tools like Cursor or Windsurf.
  • Test and validate backend workflows that connect names, emails, and other critical user data across systems.
  • Collaboration & Continuous Learning
  • Collaborate closely with cross-functional teams in a startup environment, embracing rapid experimentation and iteration.
  • Continuously learn new frameworks, tools, and approaches without handholding, demonstrating a strong growth mindset.

Requirements

  • Experience Level: ~0–3 years in backend development or testing, ideally in a startup or experimental role.
  • Backend Engineering Basics: Able to write backend code, debug, and write test cases in Python or JavaScript.
  • Testing Creativity: Not a traditional tester — you think creatively, experiment boldly, and approach testing like teaching a child.
  • AI Agent Experience: Exposure to or experience testing and prompting AI agents, especially GenAI/LLM-based systems.
  • Automation & Scripting: Comfortable writing backend scripts, automating tests, and creating testing frameworks.
  • Integrations & APIs: Exposure to integrations (bonus if HubSpot, Salesforce), understanding of API interactions and authentication.
  • Tools: Familiarity with tools like Cursor or Windsurf preferred.
  • Knowledge of modern automation frameworks (e.g., Selenium).
  • Experience in B2B product environments.
  • LLM Evaluation & Metrics: Bonus if you’ve worked with LLM evaluation frameworks, metrics, or MCPs.
  • Mindset:
  • Curious and fast learner — you’re comfortable diving into completely unknown tools or apps and figuring them out.
  • Super creative in designing tests beyond “clicking around” — you understand that testing AI systems means “taming the beast.”
  • Willing to experiment, work on ambiguous problems, and wear multiple hats.

Benefits

  • Work at the intersection of backend engineering, AI, and creative testing.
  • A fast-moving, supportive startup culture that values experimentation and creativity.
  • Opportunity to work with cutting-edge AI systems, tools, and frameworks.
  • Learn and grow rapidly alongside a talented and collaborative team.

Similar Jobs

Morning Star - ServiceNow Software Engineer

Morning Star

Chicago, Illinois, United States (Hybrid)
1 Month ago
Doola - AI Engineering Manager

Doola

Bengaluru, Karnataka, India (Remote)
2 Months ago
deel. - Customer Onboarding Manager

deel.

United States (Remote)
4 Weeks ago
EMA - Product Manager

EMA

Bengaluru, Karnataka, India (Hybrid)
10 Months ago
Mercury - Deputy BSA Officer

Mercury

San Francisco, California, United States (Remote)
1 Month ago
Paper games - Test Development Engineer (2026 Autumn Campus Recruitment)

Paper games

Shanghai, China (On-Site)
4 Weeks ago
Next Level Business Services - Workday Integration Tester

Next Level Business Services

Menomonee Falls, Wisconsin, United States (On-Site)
10 Months ago
Nexon - QA Tester

Nexon

El Segundo, California, United States (Hybrid)
1 Month ago
Hitachi - Performance Testing

Hitachi

Pune, Maharashtra, India (Remote)
10 Months ago
Aerovect - AV Systems Test Engineer

Aerovect

Atlanta, Georgia, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Interface AI - Senior Product Designer

Interface AI

(Remote)
6 Months ago
2K - Manager, Social & Community Marketing

2K

Novato, California, United States (On-Site)
1 Month ago
Brillio - Technical Product Owner – Twilio & IVR Solutions

Brillio

Edison, New Jersey, United States (Remote)
1 Month ago
Rackspace Technology - Principal Backend Java Engineer

Rackspace Technology

United States (Hybrid)
2 Months ago
FlockSafety - Traveling Installation Technician

FlockSafety

Buffalo, New York, United States (On-Site)
1 Month ago
Ethos Life - Documentation Specialist

Ethos Life

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Addepar - Senior Product Designer - Portfolio Data Experience

Addepar

United Kingdom (Remote)
1 Month ago
zeta - Senior Associate - Taxation (Indirect Tax)

zeta

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Autodesk - Partner Principle Solutions Executive

Autodesk

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Apple - Watch System Architect

Apple

Cupertino, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Brazil

bytedance - IT Support Engineer

bytedance

State Of São Paulo, Brazil (On-Site)
4 Months ago
Wildlife Studios - Backend Engineer

Wildlife Studios

São Paulo, State Of São Paulo, Brazil (On-Site)
1 Month ago
pipa studios - 2D Illustrator - Mid-Level

pipa studios

São Paulo, Brazil (On-Site)
1 Month ago
Axon - LATAM Business Controller

Axon

State Of São Paulo, Brazil (On-Site)
1 Month ago
PwC - Manager

PwC

Barueri, São Paulo, Brazil (On-Site)
1 Year ago
Google - Software Engineer, Black Community Inclusion

Google

Belo Horizonte, State Of Minas Gerais, Brazil (On-Site)
1 Month ago
ARVORE Immersive Experiences - Creative Director

ARVORE Immersive Experiences

São Paulo, State Of São Paulo, Brazil (Remote)
4 Months ago
PwC - Desenvolvedor Power BI | Senior Associate 2 [tag01]

PwC

São Paulo, State Of São Paulo, Brazil (On-Site)
10 Months ago
Epic Games - Tech Art Lead

Epic Games

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)
5 Months ago
Sporty - Commercial Executive - Media Ad Sales

Sporty

São Paulo, Brazil (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Testing Jobs

Visa - Staff Software Engineer - Test Engineering

Visa

Atlanta, Georgia, United States (Hybrid)
1 Month ago
endava - .NET Automation tester

endava

Timișoara, Timiș, Romania (On-Site)
3 Months ago
kooapps - Quality Assurance Tester

kooapps

Makati City, Metro Manila, Philippines (On-Site)
1 Year ago
Ansys - Spring 2026 Intern - Software Development and Testing

Ansys

Burnaby, British Columbia, Canada (On-Site)
1 Month ago
Cubic corporation - Senior Systems Test Engineer

Cubic corporation

Hyderabad, Telangana, India (On-Site)
2 Months ago
Tesla - Senior High Voltage Battery Mechanical Test Engineer

Tesla

North Brabant, Netherlands (On-Site)
6 Months ago
zoox - Senior Site Reliability Engineer - Automation Test Frameworks

zoox

Foster City, California, United States (On-Site)
1 Year ago
Coherent corp. - Laser Test Engineer

Coherent corp.

Wilsonville, Oregon, United States (On-Site)
3 Months ago
London stock Exchange - Software Development Engineer in Test (SDET)

London stock Exchange

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Capgemini - Selenium + API Testing

Capgemini

Bengaluru, Karnataka, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Zams is an enterprise platform to build AI agents that automate back-office work. Backed by top Silicon Valley investors like B Capital Group, UTEC, TMV, Sequoia Scouts and Facebook. Trusted by hundreds of businesses worldwide.

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (Remote)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Obviously A

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug