Staff Applied Scientist - Agentic Interfaces
DatadogCloud Monitoring company
New York, United StatesLead
Data & AI
About the role
Define and implement evaluation strategies for AI agent integrations at Datadog.
- •Define what 'good' means for an Agentic interface at Datadog and build the measurement systems that make it true.
- •Key Responsibilities Own the evaluation strategy for Datadog's AI agent integrations.
- •Build the eval datasets, golden traces, and regression harnesses.
- •Drive measurable improvements to retrieval relevance, tool-selection accuracy, and context efficiency.
- •Requirements BS/MS/PhD in a scientific field, or equivalent experience. 10+ years of relevant engineering or applied science experience.
- •Proven track record of leading ML or GenAI initiatives in a product-driven environment.
Tech stack
PythonTensorFlowPyTorchscikit-learnNLPLLMs
Match insights
Tech:Python, TensorFlow, PyTorch, scikit-learn, NLP
Level:Lead