Beyond Boundaries: A Human-like Approach for Question Answering over Structured and Unstructured Information Sources

Jens Lehmann; Dhananjay Bhandiwad; Preetam Gattogi; Sahar Vahdati

doi:10.1162/tacl_a_00671

Beyond Boundaries: A Human-like Approach for Question Answering over Structured and Unstructured Information Sources

Publikation: Beitrag in Fachzeitschrift › Forschungsartikel › Beigetragen › Begutachtung

Beitragende

Jens Lehmann - , Amazon Development Center Germany GmbH, Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig (Autor:in)
Dhananjay Bhandiwad - , Abteilung Verteiltes und Datenintensives Rechnen (VDR), Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden) (Autor:in)
Preetam Gattogi - , Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden), Abteilung Verteiltes und Datenintensives Rechnen (VDR) (Autor:in)
Sahar Vahdati - , Abteilung Verteiltes und Datenintensives Rechnen (VDR), Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden) (Autor:in)

Abstract

Answering factual questions from heterogenous sources, such as graphs and text, is a key capacity of intelligent systems. Current approaches either (i) perform question answering over text and structured sources as separate pipelines followed by a merge step or (ii) provide an early integration, giving up the strengths of particular information sources. To solve this problem, we present ‘‘HumanIQ’’, a method that teaches language models to dynamically combine retrieved information by imitating how humans use retrieval tools. Our approach couples a generic method for gathering human demonstrations of tool use with adaptive few-shot learning for tool augmented models. We show that HumanIQ confers significant benefits, including i) reducing the error rate of our strongest baseline (GPT-4) by over 50% across 3 benchmarks, (ii) improving human preference over responses from vanilla GPT-4 (45.3% wins, 46.7% ties, 8.0% loss), and (iii) outperforming numerous task-specific baselines.

Details

Originalsprache	Englisch
Seiten (von - bis)	786-802
Seitenumfang	17
Fachzeitschrift	Transactions of the Association for Computational Linguistics
Jahrgang	12
Publikationsstatus	Veröffentlicht - 11 Juni 2024
Peer-Review-Status	Ja

Externe IDs

ORCID	/0000-0001-7047-3813/work/191041793

Forschungsportal der TU Dresden

Beyond Boundaries: A Human-like Approach for Question Answering over Structured and Unstructured Information Sources

Beitragende

Abstract

Details

Externe IDs

Schlagworte

ASJC Scopus Sachgebiete