Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-Based Causal Discovery
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
Causal discovery aims to estimate causal structures among variables based on observational data. Large Language Models (LLMs) offer a fresh perspective to tackle the causal discovery problem by reasoning on the metadata associated with variables rather than their actual data values, an approach referred to as knowledge-based causal discovery. In this paper, we investigate the capabilities of Small Language Models (SLMs, defined as LLMs with fewer than 1 billion parameters) with prompt-based learning for knowledge-based causal discovery. Specifically, we present “KG Structure as Prompt”, a novel approach for integrating structural information from a knowledge graph, such as common neighbor nodes and metapaths, into prompt-based learning to enhance the capabilities of SLMs. Experimental results on three types of biomedical and open-domain datasets under few-shot settings demonstrate the effectiveness of our approach, surpassing most baselines and even conventional fine-tuning approaches trained on full datasets. Our findings further highlight the strong capabilities of SLMs: in combination with knowledge graphs and prompt-based learning, SLMs demonstrate the potential to surpass LLMs with larger number of parameters. Our code and datasets are available on GitHub.1(https://github.com/littleflow3r/kg-structure-as-prompt)
Details
| Original language | English |
|---|---|
| Title of host publication | The Semantic Web |
| Editors | Gianluca Demartini, Katja Hose, Maribel Acosta, Matteo Palmonari, Gong Cheng, Hala Skaf-Molli, Nicolas Ferranti, Daniel Hernández, Aidan Hogan |
| Publisher | Springer Science and Business Media B.V. |
| Pages | 87-106 |
| Number of pages | 20 |
| ISBN (electronic) | 978-3-031-77844-5 |
| ISBN (print) | 978-3-031-77843-8 |
| Publication status | Published - Jun 2024 |
| Peer-reviewed | Yes |
Publication series
| Series | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 15231 LNCS |
| ISSN | 0302-9743 |
Conference
| Title | 23rd International Semantic Web Conference |
|---|---|
| Abbreviated title | ISWC 2024 |
| Conference number | 23 |
| Duration | 11 - 15 November 2024 |
| Website | |
| Location | Live! Casino & Hotel Maryland |
| City | Baltimore |
| Country | United States of America |
External IDs
| ORCID | /0000-0001-5458-8645/work/193180541 |
|---|
Keywords
Research priority areas of TU Dresden
ASJC Scopus subject areas
Keywords
- causal relation, knowledge graph, language model