Sabrina Spellman at SemEval-2023 Task 5: Discover the Shocking Truth Behind this Composite Approach to Clickbait Spoiling!

Simon Birkenheuer; Jonathan Drechsel; Paul Justen; Jimmy Pöhlmann; Julius Gonsior; Anja Reusch

doi:10.18653/v1/2023.semeval-1.134

Sabrina Spellman at SemEval-2023 Task 5: Discover the Shocking Truth Behind this Composite Approach to Clickbait Spoiling!

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung

Beitragende

Simon Birkenheuer - , Technische Universität Dresden (Autor:in)
Jonathan Drechsel - , Technische Universität Dresden (Autor:in)
Paul Justen - , Professur für Compilerbau (cfaed) (Autor:in)
Jimmy Pöhlmann - , Professur für Datenbanken (Autor:in)
Julius Gonsior - , Professur für Datenbanken (Autor:in)
Anja Reusch - , Professur für Datenbanken (Autor:in)

Abstract

This paper describes an approach to automatically close the knowledge gap of Clickbait-Posts via a transformer model trained for Question-Answering, augmented by a task-specific post-processing step. This was part of the SemEval 2023 Clickbait shared task (Fröbe et al., 2023a) - specifically task 5. We devised strategies to improve the existing model to fit the task better, e.g. with different special models and a post-processor tailored to different inherent challenges of the task. Furthermore, we explored the possibility of expanding the original training data by using strategies from Heuristic Labeling and Semi-Supervised Learning. With those adjustments, we were able to improve the baseline by 9.8 percentage points to a BLEU-4 score of 48.0%.

Details

Originalsprache	Englisch
Titel	Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Redakteure/-innen	Atul Kr. Ojha, A. Seza Dogruoz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Herausgeber (Verlag)	Association for Computational Linguistics (ACL)
Seiten	969-977
Seitenumfang	9
ISBN (elektronisch)	9781959429999
Publikationsstatus	Veröffentlicht - 2023
Peer-Review-Status	Ja

Workshop

Titel	17th International Workshop on Semantic Evaluation
Kurztitel	SemEval 2023
Veranstaltungsnummer	17
Beschreibung	co-located with the 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
Dauer	13 - 14 Juli 2023
Webseite	https://semeval.github.io/SemEval2023/
Ort	Westin Harbour Castle & Online
Stadt	Toronto
Land	Kanada

Externe IDs

ORCID	/0000-0002-5985-4348/work/174432436

Forschungsportal der TU Dresden

Sabrina Spellman at SemEval-2023 Task 5: Discover the Shocking Truth Behind this Composite Approach to Clickbait Spoiling!

Beitragende

Abstract

Details

Workshop

Externe IDs

Schlagworte

ASJC Scopus Sachgebiete