paper

KR Labs at ArchEHR-QA 2025: A Verbatim Approach for Evidence-Based Question Answering

Ádám Kovács, Paul Schmitt, Gábor Recski ACL BioNLP 2025 (Shared Task) 2025-08-01 architecture

Abstract

Retrieval-augmented generation systems remain prone to fabrication. This paper presents a lightweight, domain-agnostic verbatim pipeline for evidence-grounded question answering: an extractor selects relevant source sentences, and an LLM drafts a question-specific template that is filled verbatim with extracted evidence. In the ArchEHR-QA 2025 shared task, the system scored 42.01%, ranked top-10 in core metrics, and outperformed the organiser's 70B Llama-3.3 baseline.

TL;DR

The verbatim pipeline constrains answer content through extraction + templating.
The paper reports a 42.01% ArchEHR-QA score, top-10 in core metrics.
The system outperformed the organiser's 70B Llama-3.3 baseline.
Open-source library at github.com/KRLabsOrg/verbatim-rag.

Resources

Open the PDF Open on arXiv Open the repo Open on Hugging Face

For the full text, citation list, and supplementary material, follow the paper link above. The library, models, and examples live in the open-source repository.

Cite

@inproceedings{kovacs-etal-2025-kr,
  title     = {{KR} Labs at {A}rch{EHR}-{QA} 2025: A Verbatim Approach for Evidence-Based Question Answering},
  author    = {Kovacs, Adam and Schmitt, Paul and Recski, Gabor},
  booktitle = {Proceedings of the 24th Workshop on Biomedical Language Processing (Shared Tasks)},
  year      = {2025},
  address   = {Vienna, Austria},
  publisher = {Association for Computational Linguistics},
  pages     = {69--74},
  url       = {https://aclanthology.org/2025.bionlp-share.8/},
  doi       = {10.18653/v1/2025.bionlp-share.8}
}

LettuceDetect: token-level hallucination detection for RAG outputs

Open the repo Subscribe to research updates

Abstract

TL;DR

Resources

Cite

Related