capability

Medical RAG — groundedness & abstention

RAG · medical literature · zero-hallucination
framework · ragasauthor · Vincentcert · AG-26-0142
Verification report

Catches confident fabrication with fake citations. Scores groundedness, citation accuracy, and whether the agent abstains when evidence is missing.

No data leakage
0.98
Ungameable
0.95
Deterministic
0.99
Discriminating power
0.97
Standard coverage
0.90
Discriminating power · reference panel
Reference agentKnown qualityPack score
Grounded-RAG-refgood0.94
Loose-RAG-refbroken0.41
Fabricator-refsabotaged0.07

A good pack scores the known-good agent high and the sabotaged one near zero. That gap is the evidence the meter works.

Guide · How to eval a RAG agent's groundednessBenchmark · RAG agents benchmark

← Back to catalog