Senior AI Engineer (AI & Automation)

Grafana Labs
$154K - $185K/yr

AI Infrastructure

Agentic Frameworks

Tech Stack

About the Role

Grafana Labs, the company behind the open observability cloud, is founded on the principles of open source, open standards, open ecosystems, and open culture. We are a 100% remote company with 1,600+ team members across 40+ countries. This is a remote opportunity and we are looking for candidates from the U.S.

The Opportunity
Grafana Labs is seeking a Senior Engineer (AI & Automation) to own the AI agent infrastructure and automation platform that powers our Marketing Operations organization. You'll build multi-agent architectures, LLM integrations, and backend services that connect AI models to internal and third-party data platforms. You'll ship production systems that teams depend on daily.

This is a high-autonomy role where you own the technical direction. You'll identify the highest-leverage problems across Marketing, RevOps, and SDR teams, design the solutions, and ship them. You'll define the technical direction for the automation platform (data models, API contracts, shared libraries, reference architectures) and partner with Data Engineering, GTM Systems, and Field Operations to build scalable, self-service automation that eliminates manual work and drives operational efficiency.

What You'll Be Doing
Agentic Systems & AI Infrastructure

  • Own end-to-end development of multi-agent AI systems, from architecture and implementation through testing, deployment, and ongoing operation
  • Build modular, composable agentic systems using orchestration frameworks (LangChain, CrewAI, Anthropic MCP, or similar) that operate 24/7 across teams
  • Develop reusable agentic skills that agents invoke across interfaces (Slack, dashboards, internal apps, CLIs)
  • Implement observability and feedback loops including logging, performance metrics, prompt iteration, model evaluation, and cost management
  • Establish governance and compliance standards for AI workflows including access controls, audit trails, PII handling, and human-in-the-loop escalation paths

Systems Integration & Backend Services

  • Build MCP servers, APIs, CLIs, and microservices connecting AI models to business systems (BigQuery, Slack, CRMs, email, calendars, analytics tools)
  • Architect data flows for retrieval-augmented generation (RAG), connecting LLMs to internal knowledge bases, customer data, and real-time business context
  • Build serverless or containerized services (GCP Cloud Functions, Cloud Run) that scale with usage and integrate with Grafana's cloud infrastructure

Automation & Workflow Enablement

  • Partner with RevOps, Demand Generation, Regional Marketing, and SDR teams to scope high-impact automation problems, identify bottlenecks, and build solutions with measurable business outcomes
  • Design and deploy workflows using orchestration tools (n8n, Workato, or custom platforms) with CI/CD, testing, and production reliability standards
  • Build systems designed for self-service with documentation, playbooks, and enablement materials

What Makes You a Great Fit

  • 8+ years of software engineering experience with depth in backend development, systems integration, or data/analytics engineering
  • 2+ years hands-on experience applying LLMs/AI to production workflows, not just prototypes
  • Strong proficiency in Python and JavaScript/Node.js with Git-based workflows, code review practices, and testing discipline
  • Hands-on experience with LLM frameworks and patterns including prompt engineering, RAG, function calling/tool use, structured output parsing, and evaluation
  • Experience building and operating multi-agent systems at scale including agent decomposition, orchestration patterns (sequential chains, router/dispatcher, parallel fan-out), state management, and production monitoring
  • Deep GCP, BigQuery, and serverless/containerized services knowledge
  • Understanding of LLM failure modes and production mitigations
  • Fluent with AI-assisted development tools (GitHub Copilot, Cursor, Claude Code)

Bonus Points

  • Vector databases (Pinecone, Weaviate, ChromaDB, Qdrant, pgvector)
  • Marketing/sales platform experience (Salesforce, HubSpot, Marketo, Outreach)
  • Frontend frameworks (React, Slack Block Kit)
  • AI observability tools (LangSmith, Weights & Biases)
  • Workflow orchestration platforms (n8n, Temporal, Prefect, Airflow)
  • Model Context Protocol (MCP) experience
  • B2B SaaS marketing/sales/customer success automation background
  • Open-source community involvement

The base compensation range for this role is USD $154,445 - USD $185,334. All of our roles include Restricted Stock Units (RSUs). 100% Remote, Global Culture. Global annual leave policy of 30 days per annum plus 3 Grafana Shutdown Days.

Apply Now
Apply Now

More jobs like this

Explore related roles

Get jobs like this weekly