Paths to Causality: Finding Informative Subgraphs Within Knowledge Graphs for Knowledge-Based Causal Discovery
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
Inferring causal relationships between variable pairs is crucial for understanding multivariate interactions in complex systems. Knowledge-based causal discovery -which involves inferring causal relationships by reasoning over the metadata of variables (e.g., names or textual context)-offers a compelling alternative to traditional methods that rely on observational data. However, existing methods using Large Language Models (LLMs) often produce unstable and inconsistent results, compromising their reliability for causal inference. To address this, we introduce a novel approach that integrates Knowledge Graphs (KGs) with LLMs to enhance knowledge-based causal discovery. Our approach identifies informative metapath -based subgraphs within KGs and further refines the selection of these subgraphs using Learning-to-Rank-based models. The top-ranked subgraphs are then incorporated into zero-shot prompts, improving the effectiveness of LLMs in inferring the causal relationship. Extensive experiments on biomedical and open-domain datasets demonstrate that our method outperforms most baselines by up to 44.4 points in F1 scores, evaluated across diverse LLMs and KGs. Our code and datasets are available on GitHub.. https://github.com/susantiyuni/path-to-causality.
Details
| Original language | English |
|---|---|
| Title of host publication | KDD 2025 - Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining |
| Publisher | Association for Computing Machinery |
| Pages | 2778-2789 |
| Number of pages | 12 |
| ISBN (electronic) | 979-8-4007-1454-2 |
| Publication status | Published - 3 Aug 2025 |
| Peer-reviewed | Yes |
Conference
| Title | 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining |
|---|---|
| Abbreviated title | KDD 2025 |
| Conference number | 31 |
| Duration | 3 - 7 August 2025 |
| Website | |
| Location | Toronto Convention Centre |
| City | Toronto |
| Country | Canada |
External IDs
| ORCID | /0000-0001-5458-8645/work/193180546 |
|---|
Keywords
ASJC Scopus subject areas
Keywords
- causal discovery, knowledge graphs, large language models