At LeoTech, we are passionate about building software that solves real-world problems in the Public Safety sector. Our software has been used to help the fight against continuing criminal enterprises, drug trafficking organizations, identifying financial fraud, disrupting sex and human trafficking rings and focusing on mental health matters.

As an AI/LLM Evaluation & Alignment Engineer on our Data Science team, you will play a critical role in ensuring that our Large Language Model (LLM) and Agentic AI solutions are accurate, safe, and aligned with the unique requirements of public safety and law enforcement workflows. You will design and implement evaluation frameworks, guardrails, and bias-mitigation strategies.

Core Responsibilities:

Build and maintain evaluation frameworks for LLMs and generative AI systems for public safety use cases
Design guardrails and alignment strategies to minimize bias, toxicity, hallucinations
Partner with AI engineers and data scientists on evaluation metrics
Implement continuous evaluation pipelines integrated into CI/CD
Stress test models against edge cases and adversarial prompts
Ensure explainability, transparency, and auditability of AI outputs
Contribute to DevOps/MLOps workflows

Requirements:

Bachelor's/Master's in CS, AI, Data Science
3-5+ years ML/AI engineering, 2+ years on LLM evaluation/safety
Python proficiency with LangGraph, Strands Agents, Pydantic AI, LangChain, HuggingFace, PyTorch, LlamaIndex
DevOps/MLOps pipeline experience (Kubernetes, Terraform, ArgoCD, GitHub Actions)

Technologies: AWS (Bedrock, SageMaker, Lambda), Azure AI, Kubernetes, HuggingFace, OpenAI API, Anthropic, LangChain, LlamaIndex, Ragas, DeepEval, Langfuse, GuardrailsAI, Python, ElasticSearch, Kafka, Airflow.

Salary: $135,000-$160,000. Location: Austin, TX (Remote).

This role is closed.See similar open roles

More jobs like this

Senior Agentic (AI) Engineer

Worth AI

RemoteAutoGenClaude Agent SDK+11 more

🇺🇸

2026-06-06

Agent Harness Engineer

JAPAN AI

¥12M - ¥20M/yrRemoteAutoGenLangChain+1 more

🇯🇵

2026-06-28

Staff AI Engineer (Acquia DAM)

Acquia

$180K - $200K/yrRemoteCrewAILangChain+4 more

🇺🇸

2026-06-24

Senior Software Development Engineer (GenAI, Agentic AI)

Expedia Group

$185K - $258K/yrRemoteLangChainLangGraph+6 more

🇺🇸

2026-06-06

AI Engineer

Platinum Technologies

$160K - $180K/yrAgnoLangChain+10 more

🇺🇸

2026-06-06

Elastic AI Engineer

Elastic

CA$102K - CA$161K/yrRemoteLangChainLangGraph+2 more

🇨🇦

2026-05-21

Forward Deployed AI Engineer, Enterprise

Scale AI

$180K - $225K/yrLangChainLlamaIndex+2 more

Radical AI

Samsara

$102K - $171K/yrRemoteCrewAILangChain+6 more

🇺🇸

2026-05-19

AI Platform Engineer

OpenVPN

$140K - $150K/yrRemoteAWS BedrockVertex AI

🇺🇸

2026-05-06

Senior AI Developer (Customer Delivery)

CI&T

RemoteAutoGenCrewAI+10 more

🇧🇷

2026-06-16

Forward Deployed Engineer - Applied AI - Senior - Financial Services - Consulting

$107K - $201K/yrLangChainLangGraph+7 more

🇺🇸

2026-05-19

AI Engineer

Influur

$150K - $200K/yrRemoteCrewAILangChain+3 more

🇺🇸

2026-05-12

AI Engineer

Scotiabank

AutoGenLangChain+4 more

🇨🇦

2026-05-06

Staff Engineer, Agentic

Inflection AI

$350K - $500K/yrRemoteClaude Agent SDKLangGraph+4 more

🇺🇸

2026-06-24

AI Engineer

ATI Business Group

RemoteLangChainLangGraph+4 more

🇮🇩

2026-06-24

Staff AI Engineer

Acquia

$180K - $230K/yrRemoteCrewAILangChain+3 more

🇺🇸

2026-06-16

Agent Harness Engineer

JAPAN AI (Geniee)

¥10M - ¥17M/yrAutoGenLangChain+1 more

🇯🇵

2026-06-13

Senior Machine Learning Engineer, Zeitgeist, Personalization

Spotify

$184K - $263K/yrRemoteLangChainLlamaIndex+1 more

🇺🇸

2026-05-19

Full-Stack Product Engineer

LlamaIndex

$150K - $230K/yrLlamaIndex

🇺🇸

2026-05-19

View all similar jobs →

Explore related roles

More LangChain jobs More LangGraph jobs More LlamaIndex jobs More Pydantic AI jobs More AWS Bedrock jobs More Langfuse jobs

Get jobs like this weekly

Join 26 subscribers