Talmud-IR: A Talmud-Inspired Interface for Discussing RAG Response Quality

Wojciech Kusa; Niklas Deckers; Maik Fröbe; Laura Dietz; Birte Platow; Mark Sanderson

doi:10.1007/978-3-032-21321-1_21

Talmud-IR: A Talmud-Inspired Interface for Discussing RAG Response Quality

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung

Beitragende

Wojciech Kusa - , Research and Academic Computer Network (Autor:in)
Niklas Deckers - , Universität Kassel (Autor:in)
Maik Fröbe - , Friedrich-Schiller-Universität Jena (Autor:in)
Laura Dietz - , University of New Hampshire (Autor:in)
Birte Platow - , Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden), Professur für Religionspädagogik (evangelisch) (Autor:in)
Mark Sanderson - , Royal Melbourne Institute of Technology University (Autor:in)

Abstract

Retrieval-augmented generation (RAG) systems promise factually grounded answers, yet evaluating their quality remains difficult. Automated metrics and LLM-as-judge approaches offer scalability but risk circularity, benchmark leakage, and loss of diversity. Human assessors, meanwhile, often struggle to notice subtle omissions or hallucinations when responses appear linguistically fluent and confident. We present Talmud-IR, a novel user interface inspired by the dialogic structure of the Talmud. It visualizes RAG outputs as a central text surrounded by layers of evidence, commentary, and meta-assessment, enabling sustained human–LLM discussion about system quality and failure priorities. The prototype supports comparative RAG evaluation, collaborative exploration of “unknown unknowns,” and pedagogical use for teaching critical reading of AI-generated content. Code and Prototype: https://github.com/WojciechKusa/talmud-ir

Details

Originalsprache	Englisch
Titel	Advances in Information Retrieval
Redakteure/-innen	Ricardo Campos, Adam Jatowt, Yanyan Lan, Mohammad Aliannejadi, Christine Bauer, Sean MacAvaney, Avishek Anand, Nan Bai, Masoud Mansoury, Zhaochun Ren, Suzan Verberne
Herausgeber (Verlag)	Springer Science and Business Media B.V.
Seiten	148-153
Seitenumfang	6
ISBN (elektronisch)	978-3-032-21321-1
ISBN (Print)	978-3-032-21320-4
Publikationsstatus	Veröffentlicht - März 2026
Peer-Review-Status	Ja

Publikationsreihe

Reihe	Lecture Notes in Computer Science
Band	16486 LNCS
ISSN	0302-9743

Konferenz

Titel	48th European Conference on Information Retrieval
Kurztitel	ECIR 2026
Veranstaltungsnummer	48
Dauer	29 März - 2 April 2026
Webseite	https://ecir2026.eu/
Ort	Lijm & Cultuur
Stadt	Delft
Land	Niederlande

Schlagworte

ASJC Scopus Sachgebiete

Schlagwörter

Exploratory Evaluation, LLM judge, RAG

Forschungsportal der TU Dresden