Skip to content
  • Auto
  • Light
  • Dark
DiscordForumGitHubSign up
Development Tools
Testing & evals
View as Markdown
Copy Markdown

Open in Claude
Open in ChatGPT

Letta Evals

Systematic testing for stateful AI agents. Validate changes, prevent regressions, and ship with confidence.

Test agent memory, tool usage, multi-turn conversations, and state evolution with automated grading and pass/fail gates.

Understand the building blocks of evaluations:

Choose how to score your agents: