Research & Insights

Start here

A short selection from the research library while the full archive grows.

paper ACL BioNLP 2025 (Shared Task)
1 August 2025

KR Labs at ArchEHR-QA 2025: A Verbatim Approach for Evidence-Based Question Answering

Ádám Kovács, Paul Schmitt, Gábor Recski

The verbatim pipeline constrains answer content through extraction + templating.
paper Preprint
24 February 2025

LettuceDetect: token-level hallucination detection for RAG outputs

Ádám Kovács, Gábor Recski

Token-level hallucination detection for RAG, trained on RAGTruth.
paper Preprint
4 April 2026

Squeez: task-conditioned tool-output pruning for coding agents

Ádám Kovács

Task-conditioned pruning for noisy coding-agent tool output.

Published work

Papers, whitepapers, and posts, listed together because the argument, code, and deployments inform each other.

post
11 June 2026

ACL-Verbatim: Hallucination-Free Question Answering for NLP Researchers

Gábor Recski

Intro to ACL-Verbatim, a benchmark and a pipeline for hallucination-free question answering over the ACL Anthology, based on the VerbatimRAG architecture.
paper arxiv
20 May 2026

ACL-Verbatim: hallucination-free question answering for research

Gábor Recski, Szilveszter Tóth, Nadia Verdha, István Boros, Ádám Kovács

VerbatimRAG is applied to 110K+ papers of the ACL Anthology
post
10 April 2026

Squeez: task-conditioned tool-output pruning for coding agents

Ádám Kovács

Squeez is a benchmark, model, and CLI for reducing noisy coding-agent tool output to the smallest verbatim evidence block worth keeping.
paper Preprint
4 April 2026

Squeez: task-conditioned tool-output pruning for coding agents

Ádám Kovács

Task-conditioned pruning for noisy coding-agent tool output.
post
18 November 2025

Build hallucination-free RAG with Verbatim

Ádám Kovács

A practical introduction to VerbatimRAG, the KR Labs approach to retrieval where answers are assembled from source spans instead of generated freely from retrieved context.
post
31 August 2025

TinyLettuce: small hallucination detectors for RAG

Ádám Kovács

TinyLettuce compresses the LettuceDetect approach into smaller encoder models for deployments where latency, memory, and operating cost matter.
paper ACL BioNLP 2025 (Shared Task)
1 August 2025

KR Labs at ArchEHR-QA 2025: A Verbatim Approach for Evidence-Based Question Answering

Ádám Kovács, Paul Schmitt, Gábor Recski

The verbatim pipeline constrains answer content through extraction + templating.
post
19 May 2025

LettuceDetect goes multilingual: EuroBERT models on Hugging Face

Ádám Kovács

The multilingual LettuceDetect release extends token-level RAG hallucination detection to EuroBERT-backed models for major European languages.
post
28 February 2025

LettuceDetect: a framework for RAG hallucination detection

Ádám Kovács

An introduction to LettuceDetect as a token-level hallucination detection framework for retrieval-augmented generation systems.
paper Preprint
24 February 2025

LettuceDetect: token-level hallucination detection for RAG outputs

Ádám Kovács, Gábor Recski

Token-level hallucination detection for RAG, trained on RAGTruth.
paper Preprint
31 January 2022

POTATO: exPlainable infOrmation exTrAcTion framewOrk

Ádám Kovács, Gábor Recski

Open-source framework for explainable information extraction.

Student research

Theses supervised by KR Labs researchers and university collaborators. They extend the same research lines into retrieval, hallucination detection, rule learning, and explainable information extraction.

2026

thesis TU Wien

Retrieval Augmented Generation: A Multi-Stage Architecture for Verbatim Financial Question Answering

Selenge, supervised by Ádám Kovács
thesis TU Wien

Latency-Tiered Hallucination Detection: Optimizing Supervised-Unsupervised Pipelines for RAG Systems

Rathmayr, supervised by Ádám Kovács
thesis TU Wien

Real-time Prevention of Factual Hallucinations in Retrieval-Augmented Generation

Beccard, supervised by Ádám Kovács
thesis TU Wien

Evaluating Extraction-Based RAG: A Systematic Assessment of VerbatimRAG on the CLAPNQ Benchmark

Kunerth, supervised by Gábor Recski

2025

thesis TU Wien

Symbolic natural language inference for German open information extraction

Ristic, supervised by Gábor Recski
thesis TU Wien

Multilingual hallucination detection for RAG applications

Verdha, supervised by Ádám Kovács
thesis TU Wien

Large language model-based framework for open information extraction, triplet matching, and text comparison

Csakvari, supervised by Ádám Kovács
thesis TU Wien

Open information extraction for fact-checking large language models

Osmanaj, supervised by Ádám Kovács
thesis TU Wien

Rule learning for open information extraction

Sommer, supervised by Gábor Recski
thesis TU Wien

Rule-based open information extraction from German legal domain

Iszak, supervised by Gábor Recski

2024

thesis TU Wien

Advanced pattern matching in graph-based relation extraction: a methodical approach to improving XAI NLP systems

Piwonka, supervised by Gábor Recski
thesis TU Wien

Evaluating LIME-based explanations of relation extraction models

Beham, supervised by Gábor Recski

Projects and collaborations

Research projects that lay the foundations of our technologies, including several collaborations with universities and the public sector. For the productised stack, see Technology.

collaboration

TU Wien Verbatim Platform

A customized Verbatim Platform deployment for TU Wien research needs, based on the open VerbatimRAG methodology. The project covers requirements analysis, data-source selection, testing, deployment, maintenance, support, and documentation.

knowledge graphs

VerbatimKG with WU SemSys

A collaboration with the WU SemSys group extending VerbatimRAG toward question answering over knowledge graphs and knowledge-graph population from trusted text sources. The project is funded by the Vienna Business Agency.

legal NER

CLEAR

A consortium project using RuleChef to support transparent anonymization of text data. The research combines rule-based and machine-learning methods for named entity recognition in German legal and public-sector text.

How to cite

BibTeX for individual papers lives on each paper page. A generic software-citation entry for the practice as a whole is below.

bibtex

@software{krlabs-2026, author = {{KR Labs}}, title = {KR Labs: open-source libraries for verifiable AI}, year = {2026}, url = {https://krlabs.eu/} }

@software{krlabs-2026,
  author = {{KR Labs}},
  title  = {KR Labs: open-source libraries for verifiable AI},
  year   = {2026},
  url    = {https://krlabs.eu/}
}

Occasional notes when we release a paper, model card, repository, project update, or collaboration. A few times a year, written close to the work.

Read it, run it, cite it.

The code, the models, and the papers are open. The systems we deploy use the same architectures, and we support teams in defending them under audit.

Read the VerbatimRAG paper Explore the technology stack

2026

Retrieval Augmented Generation: A Multi-Stage Architecture for Verbatim Financial Question Answering

Latency-Tiered Hallucination Detection: Optimizing Supervised-Unsupervised Pipelines for RAG Systems

Real-time Prevention of Factual Hallucinations in Retrieval-Augmented Generation

Evaluating Extraction-Based RAG: A Systematic Assessment of VerbatimRAG on the CLAPNQ Benchmark