We are looking for a powerhouse engineer who bridges AI development and automated validation. This is a builder-tester role: you will own the full lifecycle of AI features, from writing core Python logic for agentic workflows and tuning prompts, to architecting robust evaluation frameworks that ensure those systems are enterprise-ready.

Core Responsibilities

AI Development & Engineering

Agentic Implementation: Design and implement autonomous agents using frameworks such as LangGraph, CrewAI, or AutoGen.
Prompt Engineering: Own the prompt lifecycle, design, version, and tune system prompts to minimize hallucinations and maximize intent recognition.
RAG Pipeline Development: Build and optimize Retrieval-Augmented Generation (RAG) components, including document ingestion, chunking strategies, and vector database indexing.
Feature Prototyping: Rapidly prototype AI-driven features in Python to validate feasibility before full-scale integration.
Data Curation: Build golden datasets and synthetic data generation scripts to train and evaluate models.

Advanced Validation & Quality Architecture

Automated Evaluation (Eval-as-Code): Build automated pipelines to measure LLM performance across metrics such as faithfulness, relevancy, and toxicity.
Non-Deterministic Testing: Develop strategies to test 'fuzzy' outputs using LLM-assisted evaluation (using one LLM to grade another).
Hybrid Framework Development: Design and maintain a dual-stack automation framework covering both backend/infrastructure and AI/ML validation.
End-to-End Orchestration: Integrate AI tests into the MLOps pipeline so that every model deployment or prompt change triggers a full regression of the agent's reasoning capabilities.
Mentorship & Standards: Serve as the team's subject matter expert, defining standards for writing inherently testable AI code.

Required Skills

5+ years as SDET/QA Automation/AI Engineer
2+ years testing AI/ML products
Production-quality Python code and SaaS testing background
LLMs: OpenAI, Anthropic, Mistral, Llama
Vector Databases: Pinecone, Milvus, Weaviate
Data Science Libraries: Pandas, NumPy
Performance Testing: JMeter, Locust
Message Queues: Kafka, RabbitMQ

This role is closed.See similar open roles

More jobs like this

Agent Harness Engineer

JAPAN AI (Geniee)

¥10M - ¥17M/yrAutoGenLangChain+1 more

🇯🇵

2026-06-13

AI Solutions Engineer

Neuron7

AgnoAutoGen+5 more

🇮🇳

2026-07-27

Senior AI Engineer — GenAI & Autonomous Agents

Jus Mundi

AutoGenLangChain+7 more

🇫🇷

2026-07-27

Ingénieur(e) IA confirmé(e)

ATEME

Claude Agent SDKCrewAI+6 more

🇫🇷

2026-06-13

Forward Deployed Software Engineer

Sarvam AI

RemoteCrewAILangChain+3 more

🇮🇳

2026-07-07

AI Engineer Senior (JavaScript/React)

CI&T

RemoteAutoGenCrewAI+5 more

mama health

Senior AI Developer (Customer Delivery)

CI&T

RemoteAutoGenCrewAI+10 more

🇧🇷

2026-06-16

Senior Agentic (AI) Engineer

Worth AI

RemoteAutoGenClaude Agent SDK+11 more

🇺🇸

2026-06-06

Senior AI Engineer

Yuno

CrewAILangChain+7 more

🇮🇳

2026-07-27

Senior AI Engineer

Reflow (Re:Build Manufacturing)

$143K - $215K/yrAutoGenCrewAI+5 more

🇺🇸

2026-06-06

Agent Harness Engineer

JAPAN AI

¥12M - ¥20M/yrAutoGenLangChain+2 more

🇯🇵

2026-06-06

Agent Harness Engineer

JAPAN AI

¥12M - ¥20M/yrRemoteAutoGenLangChain+1 more

🇯🇵

2026-06-28

Application/Agentic AI Engineer

NextGen Federal Systems

AutoGenCrewAI+3 more

🇺🇸

2026-06-13

LLM Solutions Architect

Xsolla

AutoGenCrewAI+5 more

🇨🇦 🇺🇸

2026-06-06

Staff Software Engineer, AI Engineering

Tebra

$216K - $224K/yrRemoteCrewAILangChain+4 more

🇺🇸

2026-07-20

Senior AI Engineer - Agentic AI @ COOA

ING

€66K - €106K/yrCrewAIGoogle ADK+3 more

🇳🇱

2026-06-28

AI Engineer

Cachet

CrewAILangChain+2 more

🇪🇪

2026-06-13

Senior Full Stack AI Engineer

Ombud

LangChainLangGraph+8 more

🇺🇸

2026-06-06

Senior Software Engineer, AI

Aircall

AutoGenLangChain+1 more

🇪🇸

2026-06-06

View all similar jobs →

Explore related roles

More AutoGen jobs More CrewAI jobs More LangChain jobs More LangGraph jobs More LlamaIndex jobs More Pinecone jobs

Get jobs like this weekly

Join 74 subscribers