- Jobs
- Datadog
- Staff GenAI Engineer - Application Performance Monitoring (APM)
Staff GenAI Engineer - Application Performance Monitoring (APM)
Tech Stack
Agent Workflow
Technical leader driving GenAI/ML projects from concept to production within APM. Builds automated investigation and triaging tools using agentic workflows for incident troubleshooting at scale.
About the Role
We're looking for a Staff Software Engineer with deep experience in GenAI/ML to join Datadog's Application Performance Monitoring (APM) team. APM is a product which provides deep visibility into applications, enabling users to identify performance bottlenecks, troubleshoot issues, and optimize services. With distributed tracing, profiling, out-of-the-box dashboards, and seamless correlation with other telemetry data, Datadog APM provides some of the deepest and most structured visibility into the health and performance of applications. This context sets us up for an opportunity to be the world leaders in agentic investigations and incident troubleshooting.
You'll act as a technical leader within the APM group, focused on agentic workflows. You'll lead efforts to design, train, evaluate, and deploy GenAI/ML models at scale. We're looking for a product-minded ML engineer with strong technical expertise, excellent communication skills, and a track record of driving impactful initiatives end to end.
At Datadog, we place value in our office culture. We operate as a hybrid workplace.
What You'll Do:
- Act as a technical leader within the APM organization, driving GenAI/machine learning projects from concept to production.
- Build and benchmark GenAI/ML models using state-of-the-art techniques.
- Collaborate with cross-functional teams to build automated investigation and triaging tools.
- Influence product direction by bringing a strong product mindset to your work, always advocating for the end user.
- Guide teams through ambiguity, scaling challenges, and evolving requirements with clear technical direction.
- Actively mentor engineers and influence engineering culture through leadership in design reviews, technical talks, and working groups.
Who You Are:
- BS/MS/PhD in a scientific field or equivalent experience
- 10+ years of relevant engineering experience, as well as experience acting as a technical lead
- Proven track record of leading large-scale GenAI/ML initiatives in a product-driven environment
- Significant experience in model deployment, development, training, fine-tuning, or evaluation
- Ability to drive initiatives across cross-functional teams, and solve ambiguous challenges
Benefits and Growth:
- Get to build tools for software engineers, just like yourself.
- Have a lot of influence on product direction and impact on the business.
- Work with skilled, knowledgeable, and kind teammates.
- Competitive global benefits.
- Continuous professional development.
The reasonably estimated yearly salary for this role at Datadog is: $234,000-$300,000 USD