Talmud-IR: A Talmud-Inspired Interface for Discussing RAG Response Quality

Wojciech Kusa; Niklas Deckers; Maik Fröbe; Laura Dietz; Birte Platow; Mark Sanderson

doi:10.1007/978-3-032-21321-1_21

Talmud-IR: A Talmud-Inspired Interface for Discussing RAG Response Quality

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Wojciech Kusa - , Research and Academic Computer Network (Author)
Niklas Deckers - , University of Kassel (Author)
Maik Fröbe - , Friedrich Schiller University Jena (Author)
Laura Dietz - , University of New Hampshire (Author)
Birte Platow - , Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden), Chair of Religious Education (Protestant) (Author)
Mark Sanderson - , Royal Melbourne Institute of Technology University (Author)

Abstract

Retrieval-augmented generation (RAG) systems promise factually grounded answers, yet evaluating their quality remains difficult. Automated metrics and LLM-as-judge approaches offer scalability but risk circularity, benchmark leakage, and loss of diversity. Human assessors, meanwhile, often struggle to notice subtle omissions or hallucinations when responses appear linguistically fluent and confident. We present Talmud-IR, a novel user interface inspired by the dialogic structure of the Talmud. It visualizes RAG outputs as a central text surrounded by layers of evidence, commentary, and meta-assessment, enabling sustained human–LLM discussion about system quality and failure priorities. The prototype supports comparative RAG evaluation, collaborative exploration of “unknown unknowns,” and pedagogical use for teaching critical reading of AI-generated content. Code and Prototype: https://github.com/WojciechKusa/talmud-ir

Details

Original language	English
Title of host publication	Advances in Information Retrieval
Editors	Ricardo Campos, Adam Jatowt, Yanyan Lan, Mohammad Aliannejadi, Christine Bauer, Sean MacAvaney, Avishek Anand, Nan Bai, Masoud Mansoury, Zhaochun Ren, Suzan Verberne
Publisher	Springer Science and Business Media B.V.
Pages	148-153
Number of pages	6
ISBN (electronic)	978-3-032-21321-1
ISBN (print)	978-3-032-21320-4
Publication status	Published - Mar 2026
Peer-reviewed	Yes

Publication series

Series	Lecture Notes in Computer Science
Volume	16486 LNCS
ISSN	0302-9743

Conference

Title	48th European Conference on Information Retrieval
Abbreviated title	ECIR 2026
Conference number	48
Duration	29 March - 2 April 2026
Website	https://ecir2026.eu/
Location	Lijm & Cultuur
City	Delft
Country	Netherlands

Keywords

ASJC Scopus subject areas

Keywords

Exploratory Evaluation, LLM judge, RAG

Research Portal of the TU Dresden

Contributors

Abstract

Details

Publication series

Conference

Keywords

ASJC Scopus subject areas

Keywords