Materializing Knowledge Bases via Trigger Graphs

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

Abstract

The chase is a well-established family of algorithms used to materialize Knowledge Bases (KBs) for tasks like query answering under dependencies or data cleaning. A general problem of chase algorithms is that they might perform redundant computations. To counter this problem, we introduce the notion of Trigger Graphs (TGs), which guide the execution of the rules avoiding redundant computations. We present the results of an extensive theoretical and empirical study that seeks to answer when and how TGs can be computed and what are the benefits of TGs when applied over real-world KBs. Our results include introducing algorithms that compute (minimal) TGs. We implemented our approach in a new engine, called GLog, and our experiments show that it can be significantly more efficient than the chase enabling us to materialize Knowledge Graphs with 17B facts in less than 40 min using a single machine with commodity hardware.

Details

Original languageEnglish
Pages (from-to)943–956
Number of pages14
JournalProceedings of the VLDB Endowment
Volume14
Issue number6
Publication statusPublished - 12 Apr 2021
Peer-reviewedYes

External IDs

Scopus 85102660603

Keywords

Library keywords