Explore Freddy: Fast word embeddings in database systems

Research output: Contribution to book/conference proceedings/anthology/reportConference contributionContributedpeer-review

Contributors

Abstract

Word embeddings encode a lot of semantic as well as syntactic features and therefore are useful in many tasks especially in Natural Language Processing and Information Retrieval. FREDDY (Fast woRd EmbedDings Database sYstems), an extended PostgreSQL database system, allowing the user to analyze structured knowledge in the database relations together with unstructured text corpora encoded as word embedding by introducing novel operations for similarity calculation and analogy inference. Approximation techniques support these operations to perform fast similarity computations on high-dimensional vector spaces. This demo allows exploring the powerful query capabilities of FREDDY on different database schemes and a variety of word embeddings generated on different text corpora. From a systems perspective, the user is able to examine the impact of multiple approximation techniques and their parameters for similarity search on query execution time and precision.

Details

Original languageEnglish
Title of host publicationDatenbanksysteme fur Business, Technologie und Web, BTW 2019 and 18. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2019
EditorsTorsten Grust, Felix Naumann, Alexander Bohm, Wolfgang Lehner, Theo Harder, Erhard Rahm, Andreas Heuer, Meike Klettke, Holger Meyer
PublisherGesellschaft fur Informatik (GI)
Pages529-532
Number of pages4
ISBN (electronic)9783885796831
Publication statusPublished - 2019
Peer-reviewedYes

Publication series

SeriesGI-Edition : lecture notes in informatics. Proceedings
VolumeP-289
ISSN1617-5468

Conference

TitleDatenbanksysteme fur Business, Technologie und Web, BTW 2019 and 18. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2019 - Database Systems for Business, Technology and Web, BTW 2019 and 18th Symposium of the GI Department "Databases and Information Systems", DBIS 2019
Duration4 - 8 March 2019
CityRostock
CountryGermany

External IDs

dblp conf/btw/GuntherTL19a
ORCID /0000-0001-8107-2775/work/142253493

Keywords

ASJC Scopus subject areas

Keywords

  • K nearest neighbor queries, Relational database, Word embeddings