Founding Engineer — AI Research Agents (Full-Stack)

16 Minutes ago • All levels • $120,000 PA - $200,000 PA
Research Development

Job Description

Sixtyfour builds AI research agents capable of discovering, linking, and reasoning over comprehensive data about people and companies. This intelligence is then transformed into automated research workflows for sales, recruiting, and marketing. The role involves designing and shipping agentic systems, building search/indexing pipelines, owning robust web scraping, developing backend services, and scaling workloads on AWS + Docker.
Good To Have:
  • Entity resolution/record linkage at scale (probabilistic matching, blocking, deduping)
  • Experience with Langfuse, OpenTelemetry, or similar for tracing/evals
  • Task queues (Celery/RQ), Redis, Postgres
  • Search relevance (BM25/vector/hybrid), embeddings, and retrieval pipelines
  • Playwright/Selenium, stealth browsers, anti-bot frameworks, CAPTCHA providers
  • CI/CD, infrastructure as code (Terraform), and cost/perf observability
  • Security & compliance basics for data handling and PII
Must Have:
  • Design and ship agentic systems (tool calling, multi-agent workflows, structured outputs)
  • Build and operate search/indexing pipelines on OpenSearch/Elasticsearch
  • Own robust web scraping (directory crawling, CAPTCHA handling, anti-bot evasion)
  • Develop backend services in Python + FastAPI with clean contracts and strong observability
  • Scale workloads on AWS + Docker (batch/queue workers, autoscaling, fault tolerance, cost control)
  • Parallelize external API requests safely (rate limits, idempotency, circuit breakers, retries, dedupe)
  • Integrate third-party APIs for enrichment and search; model and cache responses; manage schema evolution
  • Transform and analyze data using Pandas (or similar) for normalization, QA, and reporting
  • Hands-on experience with agentic architectures and prompt engineering
  • Deep knowledge of OpenSearch/Elasticsearch: index design, analyzers, ingestion pipelines, snapshots, rolling upgrades, and zero-downtime reindexing/data migrations
  • Proven web scraping expertise: solving CAPTCHAs, session/auth flows, proxy rotation, stealth techniques, and legal/ethical constraints
  • AWS + Docker in production (at least two of: ECS/EKS, Lambda, SQS/SNS, Batch, Step Functions, CloudWatch)
  • Building high-throughput data/IO pipelines with concurrency (asyncio/multiprocessing), resilient retries, and rate-limit aware scheduling
  • Integrating diverse external APIs (auth patterns, pagination, webhooks); designing stable interfaces and backfills
  • Strong data wrangling with Pandas or equivalent; comfort with large CSV/Parquet workflows and memory/perf tuning
  • Familiarity with Stripe (subscriptions, metered billing, webhooks) and basic front-end changes (React/TypeScript or similar)
  • Excellent ownership, product sense, and pragmatic debugging

Add these skills to join the top 1% applicants for this job

problem-solving
data-analytics
data-structures
game-texts
quality-control
playwright
react
aws
terraform
elasticsearch
selenium
fastapi
pandas
json
redis
ci-cd
docker
front-end
python
next.js
typescript
stripe

About Sixtyfour

We build AI research agents that can discover, link, and reason over everything about people and companies. The platform turns that intelligence into automated research workflows for sales, recruiting, and marketing.

About the role

What you’ll do

  • Design and ship agentic systems (tool calling, multi-agent workflows, structured outputs) that reliably fetch, extract, and normalize data across the web and APIs.
  • Build and operate search/indexing pipelines on OpenSearch/Elasticsearch (schema design, analyzers, reindex/data migration strategies, relevance tuning).
  • Own robust web scraping: directory crawling, CAPTCHA handling, headless browsers, rotating proxies, anti-bot evasion, and backoff/retry policies.
  • Develop backend services in Python + FastAPI with clean contracts and strong observability.
  • Scale workloads on AWS + Docker (batch/queue workers, autoscaling, fault tolerance, cost control).
  • Parallelize external API requests safely (rate limits, idempotency, circuit breakers, retries, dedupe).
  • Integrate third-party APIs for enrichment and search; model and cache responses; manage schema evolution.
  • Transform and analyze data using Pandas (or similar) for normalization, QA, and reporting.
  • Pitch in across the stack: billing (Stripe), and occasional front-end changes to ship end-to-end features.

Minimum requirements

  • Hands-on experience with agentic architectures (tool calling, structured outputs/JSON, planning/execution loops) and prompt engineering.
  • Deep knowledge of OpenSearch/Elasticsearch: index design, analyzers, ingestion pipelines, snapshots, rolling upgrades, and zero-downtime reindexing/data migrations.
  • Proven web scraping expertise: solving CAPTCHAs, session/auth flows, proxy rotation, stealth techniques, and legal/ethical constraints.
  • AWS + Docker in production (at least two of: ECS/EKS, Lambda, SQS/SNS, Batch, Step Functions, CloudWatch).
  • Building high-throughput data/IO pipelines with concurrency (asyncio/multiprocessing), resilient retries, and rate-limit aware scheduling.
  • Integrating diverse external APIs (auth patterns, pagination, webhooks); designing stable interfaces and backfills.
  • Strong data wrangling with Pandas or equivalent; comfort with large CSV/Parquet workflows and memory/perf tuning.
  • Familiarity with Stripe (subscriptions, metered billing, webhooks) and basic front-end changes (React/TypeScript or similar).
  • Excellent ownership, product sense, and pragmatic debugging.

Nice to have

  • Entity resolution/record linkage at scale (probabilistic matching, blocking, deduping).
  • Experience with Langfuse, OpenTelemetry, or similar for tracing/evals; task queues (Celery/RQ), Redis, Postgres.
  • Search relevance (BM25/vector/hybrid), embeddings, and retrieval pipelines.
  • Playwright/Selenium, stealth browsers, anti-bot frameworks, CAPTCHA providers.
  • CI/CD, infrastructure as code (Terraform), and cost/perf observability.
  • Security & compliance basics for data handling and PII.

Technology

Language Models, Opensearch/Elasticsearch, Next.js (typescript), Python, FastAPI, AWS, Docker, Celery workers, Playwright, Supabase, Stripe

Set alerts for more jobs like Founding Engineer — AI Research Agents (Full-Stack)
Set alerts for new jobs by Sixtyfour
Set alerts for new Research Development jobs in United States
Set alerts for new jobs in United States
Set alerts for Research Development (Remote) jobs

Company logo

Sixtyfour

2 Active Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug
Contact Us
hello@outscal.com
Made in INDIA 💛💙