Talmud-IR: A Talmud-Inspired Interface for Discussing RAG Response Quality
Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung
Beitragende
Abstract
Retrieval-augmented generation (RAG) systems promise factually grounded answers, yet evaluating their quality remains difficult. Automated metrics and LLM-as-judge approaches offer scalability but risk circularity, benchmark leakage, and loss of diversity. Human assessors, meanwhile, often struggle to notice subtle omissions or hallucinations when responses appear linguistically fluent and confident. We present Talmud-IR, a novel user interface inspired by the dialogic structure of the Talmud. It visualizes RAG outputs as a central text surrounded by layers of evidence, commentary, and meta-assessment, enabling sustained human–LLM discussion about system quality and failure priorities. The prototype supports comparative RAG evaluation, collaborative exploration of “unknown unknowns,” and pedagogical use for teaching critical reading of AI-generated content. Code and Prototype: https://github.com/WojciechKusa/talmud-ir
Details
| Originalsprache | Englisch |
|---|---|
| Titel | Advances in Information Retrieval |
| Redakteure/-innen | Ricardo Campos, Adam Jatowt, Yanyan Lan, Mohammad Aliannejadi, Christine Bauer, Sean MacAvaney, Avishek Anand, Nan Bai, Masoud Mansoury, Zhaochun Ren, Suzan Verberne |
| Herausgeber (Verlag) | Springer Science and Business Media B.V. |
| Seiten | 148-153 |
| Seitenumfang | 6 |
| ISBN (elektronisch) | 978-3-032-21321-1 |
| ISBN (Print) | 978-3-032-21320-4 |
| Publikationsstatus | Veröffentlicht - März 2026 |
| Peer-Review-Status | Ja |
Publikationsreihe
| Reihe | Lecture Notes in Computer Science |
|---|---|
| Band | 16486 LNCS |
| ISSN | 0302-9743 |
Konferenz
| Titel | 48th European Conference on Information Retrieval |
|---|---|
| Kurztitel | ECIR 2026 |
| Veranstaltungsnummer | 48 |
| Dauer | 29 März - 2 April 2026 |
| Webseite | |
| Ort | Lijm & Cultuur |
| Stadt | Delft |
| Land | Niederlande |
Schlagworte
ASJC Scopus Sachgebiete
Schlagwörter
- Exploratory Evaluation, LLM judge, RAG