stars
2.9k
forks
98
last update
Apr 27, 2026
license
MITv2.0.5
// Drops SKILL.md into ~/.claude/skills/
$ claude skills add evals-runner// Run from any project directory
$ claude --skill evals-runner "help me ship this"// Re-run with edits — Claude keeps the skill loaded
$ claude --skill evals-runner "now refactor it"Build, run, and report on LLM evals. Pairwise comparisons, judges, regression detection.
Loading README…
$ cat reviews/