$ whoami evalforge — Creator
@evalforge
Prompt Eval Kit · v0.9.4
Design, run, and score prompt evaluations with variance-aware benchmarks and regression tracking.
Evals Runner · v2.0.5
Build, run, and report on LLM evals. Pairwise comparisons, judges, regression detection.
Are you EvalForge? Claim this profile.