ImitAL: Learned Active Learning Strategy on Synthetic Data

Julius Gonsior; Maik Thiele; Wolfgang Lehner

doi:10.1007/978-3-031-18840-4_4

ImitAL: Learned Active Learning Strategy on Synthetic Data

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung

Beitragende

Julius Gonsior - , Professur für Datenbanken (Autor:in)
Maik Thiele - , Hochschule für Technik und Wirtschaft (HTW) Dresden (Autor:in)
Wolfgang Lehner - , Professur für Datenbanken (Autor:in)

Abstract

Active Learning (AL) is a well-known standard method for efficiently obtaining annotated data by first labeling the samples that contain the most information based on a query strategy. In the past, a large variety of such query strategies has been proposed, with each generation of new strategies increasing the runtime and adding more complexity. However, to the best of our knowledge, none of these strategies excels consistently over a large number of datasets from different application domains. Basically, most of the existing AL strategies are a combination of the two simple heuristics informativeness and representativeness, and the big differences lie in the combination of the often conflicting heuristics. Within this paper, we propose ImitAL, a domain-independent novel query strategy, which encodes AL as a learning-to-rank problem and learns an optimal combination between both heuristics. We train ImitAL on large-scale simulated AL runs on purely synthetic datasets. To show that ImitAL was successfully trained, we perform an extensive evaluation comparing our strategy on 13 different datasets, from a wide range of domains, with 7 other query strategies.

Details

Originalsprache	Englisch
Titel	Discovery Science
Redakteure/-innen	Poncelet Pascal, Dino Ienco
Herausgeber (Verlag)	Springer Science and Business Media B.V.
Seiten	47-56
Seitenumfang	10
ISBN (Print)	978-3-031-18839-8
Publikationsstatus	Veröffentlicht - 2022
Peer-Review-Status	Ja

Publikationsreihe

Reihe	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band	13601 LNAI
ISSN	0302-9743

Konferenz

Titel	25th International Conference on Discovery Science, DS 2022
Dauer	10 - 12 Oktober 2022
Stadt	Montpellier
Land	Frankreich

Externe IDs

ORCID	/0000-0001-8107-2775/work/174431842
ORCID	/0000-0002-5985-4348/work/174432435

Schlagworte

ASJC Scopus Sachgebiete

Schlagwörter

Active learning, Annotation, Imitation learning, Learning to rank

Forschungsportal der TU Dresden

ImitAL: Learned Active Learning Strategy on Synthetic Data

Beitragende

Abstract

Details

Publikationsreihe

Konferenz

Externe IDs

Schlagworte

ASJC Scopus Sachgebiete

Schlagwörter

Verknüpfte Inhalte

ImitAL: Learned Active Learning Strategy on Synthetic Data