Staff AI Engineer | US | Remote

Grafana Labs
$175K - $220K/yr

AI Infrastructure

Agentic Frameworks

Tech Stack

About the Role

Grafana Labs is a remote-first, open-source powerhouse. More than 20M users of Grafana monitor everything from beehives to climate change. Grafana also helps more than 3,000 companies including Bloomberg, JPMorgan Chase, and eBay manage their observability strategies.

This is a remote opportunity for candidates in the U.S.

The Opportunity
Grafana Labs is seeking a Staff AI Engineer to own the AI agent infrastructure and automation platform that powers our Sales organization. You'll build multi-agent architectures, LLM integrations, and backend services that connect AI models to internal and third-party data platforms. You'll ship production systems that teams depend on daily. This is a high-autonomy role where you own the technical direction.

What You'll Be Doing

Agentic Systems & AI Infrastructure

  • Own end-to-end development of multi-agent AI systems, from architecture and implementation through testing, deployment, and ongoing operation
  • Build modular, composable agentic systems using orchestration frameworks (LangChain, CrewAI, Anthropic MCP, or similar) that operate 24/7 across teams
  • Develop reusable agentic skills that agents invoke across interfaces (Slack, dashboards, internal apps, CLIs)
  • Implement observability and feedback loops including logging, performance metrics, prompt iteration, model evaluation, and cost management
  • Establish governance and compliance standards for AI workflows including access controls, audit trails, PII handling, and human-in-the-loop escalation paths

Systems Integration & Backend Services

  • Build MCP servers, APIs, CLIs, and microservices connecting AI models to business systems (BigQuery, Slack, Salesforce, email, calendars, analytics tools)
  • Architect data flows for retrieval-augmented generation (RAG), connecting LLMs to internal knowledge bases, customer data, and real-time business context
  • Build serverless or containerized services (GCP Cloud Functions, Cloud Run) that scale with usage and integrate with Grafana's cloud infrastructure

Automation & Workflow Manufacturing

  • Partner with RevOps and GTM teams to scope automation problems
  • Deploy workflows using orchestration tools (n8n, Workato, or custom platforms)
  • Build self-service systems with documentation and enablement

Requirements

  • 8+ years software engineering experience (backend, systems integration, or data engineering)
  • 2+ years hands-on experience applying LLMs to production workflows
  • Strong proficiency in Python and JavaScript/Node.js
  • Hands-on experience with LLM frameworks and patterns including prompt engineering, RAG, function calling
  • Experience building and operating multi-agent systems at scale
  • Deep familiarity with Google Cloud Platform and BigQuery
  • Understanding of LLM failure modes and production mitigations

Bonus

  • Vector database experience (Pinecone, Weaviate, ChromaDB, pgvector)
  • Workflow automation platform experience (n8n, Prefect, Clay, PhantomBuster, Apify, Dust)
  • Familiarity with Model Context Protocol (MCP)
  • Observability tools for AI (LangSmith, Weights & Biases)
  • GTM platform experience (Salesforce, HubSpot, Outreach, Gainsight)

Benefits include 30 days annual leave, RSUs, in-person onboarding, equity and bonus eligibility.

US base compensation: $174,986 - $220,000.

Apply on Greenhouse
Apply on Greenhouse

More jobs like this

Explore related roles

Get jobs like this weekly