Role Overview
We are seeking a Senior AI Engineer specializing in LLMs to lead the design, evaluation, and deployment of production-grade generative AI systems. You will own end-to-end LLM solutions, from prototyping to scalable production, while establishing best practices in evaluation, reliability, and responsible AI.

How will you make an impact?

Lead the design and development of LLM-powered applications (chatbots, copilots, agents, internal tools)
Own and evolve the LLM evaluation (evals) strategy, including designing gold-standard datasets and benchmarks, building automated eval pipelines and scoring systems, and defining metrics for factuality, grounding, robustness, and user impact
Diagnose and resolve complex failure modes (hallucinations, retrieval issues, agent breakdowns)
Optimize systems for latency, cost, scalability, and reliability in production
Mentor junior engineers and guide best practices in LLM development and evaluation
Collaborate cross-functionally with product, data, and leadership to shape AI strategy
Set standards for responsible AI, including safety, bias mitigation, and observability

What makes you a great fit?

5+ years of experience in software engineering, machine learning, and applied AI with a track record of driving projects to completion
Strong software engineering fundamentals (testing, modular design, dependency injection) in Python
A track record of taking AI and LLM-powered features from initial concept through deployment and long-term production maintenance
Experience implementing automated testing strategies for non-deterministic systems, and strong debugging and analytical skills for ambiguous model behavior
A strong understanding of prompt engineering and prompt lifecycle management, RAG architectures and retrieval evaluation, and LLM limitations and failure patterns
Solid experience using data analytics techniques (SQL, analysis and visualization) to inform product decisions
A heavy product mindset to deeply understand our product and our customer needs to design the right solutions for them
Strong tech leadership and mentorship skills, and the ability to independently drive projects to completion
Clear communication of trade-offs, risks, and system performance to stakeholders

How can you earn extra bonus points?

Proven experience driving ambiguous projects to completion, mentoring teams, and communicating complex technical risks to stakeholders
The ability to design robust, production-grade evaluation at scale using advanced metrics and statistical validation
Deep expertise in model fine-tuning, adversarial red-teaming, and safety testing to protect the system from edge-case vulnerabilities

This role is closed.See similar open roles

More jobs like this

AI Engineer (Europe)

Hiflylabs

RemoteDatabricksMLflow

🇭🇺

2026-06-06

Senior Software Engineer, AI Product Insights

Mixpanel

$226K - $306K/yrAgno

🇺🇸

2026-07-20

Marketing Ops AI Agent Engineer

Saviynt

Metropolitan Commercial Bank

$130K - $200K/yrLangChainLlamaIndex+2 more

🇺🇸

2026-06-28

Senior Backend Engineer (Ruby), AI Engineering: Duo Agent Platform Tools

GitLab

RemoteMCP

🇨🇦 🇬🇧

2026-07-20

Senior Backend Engineer

Instrumentl

$175K - $220K/yrRemoteLangChainLangGraph+2 more

🇨🇦 🇺🇸

2026-07-20

Senior AI Engineer - Systems & Integration

Emergence

RemoteMLflowWeights & Biases

🇮🇳

2026-07-07

Senior AI Engineer

Vendelux

RemoteLangChainHugging Face

🇺🇸

2026-06-28

Senior Product Engineer, AI

Fin (Intercom)

🇬🇧 🇮🇪

2026-06-28

Senior Python Engineer: AI Agents & Forecasting

Numinous

£80K - £120K/yrLangChainLangGraph+2 more

🇬🇧

2026-06-28

Senior Software Engineer, AI

Aircall

AutoGenLangChain+1 more

🇪🇸

2026-06-06

AI-Native Engineer (Full-Stack / Agentic AI Engineer)

Vecten

RemoteAgnoCrewAI+2 more

🇵🇱

2026-06-28

AI Engineer

Abacum

RemoteLangChainLangGraph+4 more

🇪🇸

2026-06-28

AI Engineer

Norrin

LangChainLlamaIndex+2 more

🇸🇪

2026-06-28

Lead AI Engineer / Architect - Agentic Systems

Limbach Gruppe

AgnoLangChain+7 more

🇩🇪

2026-06-24

GenAI Engineer (m/w/d)

Machine Learning Reply

LangChainDatabricks+2 more

🇩🇪

2026-06-13

Senior AI Engineer

Reflow (Re:Build Manufacturing)

$143K - $215K/yrAutoGenCrewAI+5 more

🇺🇸

2026-06-06

Forward Deployed AI Engineer

indigo.ai

n8n

🇮🇹

2026-07-07

Senior Backend Engineer

Parloa

MCP

🇩🇪

2026-07-07

Forward Deployed Engineer

Salesforce

🇮🇪

2026-07-07

View all similar jobs →

Explore related roles

Hybrid agentic jobs Senior agentic jobs Jobs in Spain

Get jobs like this weekly

Join 70 subscribers