Start here

A short selection from the research library while the full archive grows.

  1. paper ACL BioNLP 2025 (Shared Task)

    The verbatim pipeline constrains answer content through extraction + templating.

  2. paper Preprint

    Token-level hallucination detection for RAG, trained on RAGTruth.

  3. paper Preprint

    Task-conditioned pruning for noisy coding-agent tool output.

Published work

Papers, whitepapers, and posts, listed together because the argument, code, and deployments inform each other.

  1. paper arxiv

    ACL-Verbatim: hallucination-free question answering for research

    VerbatimRAG is applied to 110K+ papers of the ACL Anthology

  2. post

    Squeez is a benchmark, model, and CLI for reducing noisy coding-agent tool output to the smallest verbatim evidence block worth keeping.

  3. paper Preprint

    Task-conditioned pruning for noisy coding-agent tool output.

  4. post

    A practical introduction to VerbatimRAG, the KR Labs approach to retrieval where answers are assembled from source spans instead of generated freely from retrieved context.

  5. post

    TinyLettuce compresses the LettuceDetect approach into smaller encoder models for deployments where latency, memory, and operating cost matter.

  6. paper ACL BioNLP 2025 (Shared Task)

    The verbatim pipeline constrains answer content through extraction + templating.

  7. post

    The multilingual LettuceDetect release extends token-level RAG hallucination detection to EuroBERT-backed models for major European languages.

  8. post

    An introduction to LettuceDetect as a token-level hallucination detection framework for retrieval-augmented generation systems.

  9. paper Preprint

    Token-level hallucination detection for RAG, trained on RAGTruth.

  10. paper Preprint

    Open-source framework for explainable information extraction.

Student research

Theses supervised by KR Labs researchers and university collaborators. They extend the same research lines into retrieval, hallucination detection, rule learning, and explainable information extraction.

2026

  1. thesis TU Wien

    Retrieval Augmented Generation: A Multi-Stage Architecture for Verbatim Financial Question Answering

    Selenge, supervised by Ádám Kovács

  2. thesis TU Wien

    Latency-Tiered Hallucination Detection: Optimizing Supervised-Unsupervised Pipelines for RAG Systems

    Rathmayr, supervised by Ádám Kovács

  3. thesis TU Wien

    Real-time Prevention of Factual Hallucinations in Retrieval-Augmented Generation

    Beccard, supervised by Ádám Kovács

  4. thesis TU Wien

    Evaluating Extraction-Based RAG: A Systematic Assessment of VerbatimRAG on the CLAPNQ Benchmark

    Kunerth, supervised by Gábor Recski

2025

  1. thesis TU Wien
  2. thesis TU Wien

    Multilingual hallucination detection for RAG applications

    Verdha, supervised by Ádám Kovács

  3. thesis TU Wien
  4. thesis TU Wien
  5. thesis TU Wien

    Rule learning for open information extraction

    Sommer, supervised by Gábor Recski

  6. thesis TU Wien

2024

  1. thesis TU Wien
  2. thesis TU Wien

Projects and collaborations

Research projects that lay the foundations of our technologies, including several collaborations with universities and the public sector. For the productised stack, see Technology.

collaboration

TU Wien Verbatim Platform

A customized Verbatim Platform deployment for TU Wien research needs, based on the open VerbatimRAG methodology. The project covers requirements analysis, data-source selection, testing, deployment, maintenance, support, and documentation.

knowledge graphs

VerbatimKG with WU SemSys

A collaboration with the WU SemSys group extending VerbatimRAG toward question answering over knowledge graphs and knowledge-graph population from trusted text sources. The project is funded by the Vienna Business Agency.

legal NER

CLEAR

A consortium project using RuleChef to support transparent anonymization of text data. The research combines rule-based and machine-learning methods for named entity recognition in German legal and public-sector text.

How to cite

BibTeX for individual papers lives on each paper page. A generic software-citation entry for the practice as a whole is below.

bibtex
@software{krlabs-2026, author = {{KR Labs}}, title = {KR Labs: open-source libraries for verifiable AI}, year = {2026}, url = {https://krlabs.eu/} }
@software{krlabs-2026,
  author = {{KR Labs}},
  title  = {KR Labs: open-source libraries for verifiable AI},
  year   = {2026},
  url    = {https://krlabs.eu/}
}

Hear when we publish or ship

Occasional notes when we release a paper, model card, repository, project update, or collaboration. A few times a year, written close to the work.

Read it, run it, cite it.

The code, the models, and the papers are open. The systems we deploy use the same architectures, and we support teams in defending them under audit.