Unsupervised Pretraining of Echo State Networks for Onset Detection

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in Buch/Sammelband/GutachtenBeigetragenBegutachtung

Beitragende

Abstract

Note onset detection – the detection of the beginning of new note events – is a fundamental task for music analysis that can help to improve Automatic Music Transcription (AMT). The method for onset detection always follows a similar outline: An audio signal is transformed into an Onset Detection Function (ODF), which should have rather low values (i.e. close to zero) for most of the time, and pronounced peaks at onset times, which can then be extracted by applying peak picking algorithms on the ODF. Currently, Recurrent Neural Networks (RNNs) and Convolutional Neural Networks (CNNs) define the state of the art. In this paper, we build upon previous work about onset detection using Echo State Networks (ESNs) that have achieved comparable results to CNNs. We show that unsupervised pre-training of the ESN leads to similar results whilst reducing the model complexity.

Details

OriginalspracheEnglisch
Titel Lecture Notes in Computer Science
Seitenumfang12
PublikationsstatusVeröffentlicht - 7 Sept. 2021
Peer-Review-StatusJa

Externe IDs

Scopus 85115665352
ORCID /0000-0003-0167-8123/work/167214851

Schlagworte

Schlagwörter

  • Clustering, Echo State Networks, Note onset detection