Comparing and Improving Active Learning Uncertainty Measures for Transformer Models

Julius Gonsior; Christian Falkenberg; Silvio Magino; Anja Reusch; Claudio Hartmann; Maik Thiele; Wolfgang Lehner

doi:10.1007/978-3-031-42914-9_9

Comparing and Improving Active Learning Uncertainty Measures for Transformer Models

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Julius Gonsior - , Chair of Databases, TUD Dresden University of Technology (Author)
Christian Falkenberg - , TUD Dresden University of Technology (Author)
Silvio Magino - , TUD Dresden University of Technology (Author)
Anja Reusch - , Chair of Databases, TUD Dresden University of Technology (Author)
Claudio Hartmann - , Chair of Databases, TUD Dresden University of Technology (Author)
Maik Thiele - , Dresden University of Applied Sciences (HTW) (Author)
Wolfgang Lehner - , Chair of Databases, TUD Dresden University of Technology (Author)

Abstract

Despite achieving state-of-the-art results in nearly all Natural Language Processing applications, fine-tuning Transformer-encoder based language models still requires a significant amount of labeled data to achieve satisfying work. A well known technique to reduce the amount of human effort in acquiring a labeled dataset is Active Learning (AL): an iterative process in which only the minimal amount of samples is labeled. AL strategies require access to a quantified confidence measure of the model predictions. A common choice is the softmax activation function for the final Neural Network layer. In this paper we compare eight alternatives on seven datasets and show that the softmax function provides misleading probabilities. Our finding is that most of the methods primarily identify hard-to-learn-from samples (outliers), resulting in worse than random performance, instead of samples, which reduce the uncertainty of the learned language model. As a solution this paper proposes a heuristic to systematically exclude samples, which results in improvements of various methods compared to the softmax function.

Details

Original language	English
Title of host publication	Advances in Databases and Information Systems - 27th European Conference, ADBIS 2023, Proceedings
Editors	Alberto Abelló, Oscar Romero, Panos Vassiliadis, Robert Wrembel
Publisher	Springer Science and Business Media B.V.
Pages	119-132
Number of pages	14
ISBN (print)	9783031429132
Publication status	Published - 2023
Peer-reviewed	Yes

Publication series

Series	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	13985 LNCS
ISSN	0302-9743

Conference

Title	27th European Conference on Advances in Databases and Information Systems , ADBIS 2023
Duration	4 - 7 September 2023
City	Barcelona
Country	Spain

External IDs

ORCID	/0000-0001-8107-2775/work/174431841
ORCID	/0000-0002-5985-4348/work/174432433

Keywords

ASJC Scopus subject areas

Keywords

Active Learning, Calibration, Deep Neural Networks, Softmax, Transformer, Uncertainty

Research Portal of the TU Dresden

Contributors

Abstract

Details

Publication series

Conference

External IDs

Keywords

ASJC Scopus subject areas

Keywords