Enhancing Text Classification in Natural Language Processing: A Comparative Study of Transformer Models and the Potential of Few-Shot Learning

Syed Hur Abbas; Gitanjali Wadhwa; Sandra Hummel

doi:10.1007/978-3-032-00793-3_29

Enhancing Text Classification in Natural Language Processing: A Comparative Study of Transformer Models and the Potential of Few-Shot Learning

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung

Beitragende

Syed Hur Abbas - , Abteilung Responsible AI (Autor:in)
Gitanjali Wadhwa - , Abteilung Responsible AI (Autor:in)
Sandra Hummel - , Abteilung Responsible AI, Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden) (Autor:in)

Abstract

This research focuses on enhancing machine comprehension in multilingual Natural Language Processing (NLP) environments. Despite advancements in pre-trained models, creating custom models still requires more resources. To overcome this, the study explores FewShot Learning (FSL), a Meta-Learning approach inspired by human learning efficiency, asserting that machines can adeptly learn from minimal examples and task descriptions. The methodology unfolds in two dimensions: practical application and theoretical exploration. In practical terms, Python constructs models using pre-trained frameworks—BERT, DistilBERT, ELECTRA, and MiniLM. These Few-Shot models undergo meticulous evaluation for efficiency and accuracy, especially in processing unseen data with minimal support for contextual understanding. Simultaneously, the theoretical facet involves a comprehensive literature review, shedding light on FSL's effectiveness in diverse NLP contexts. Preliminary findings suggest FSL's promising role in addressing multilingual challenges in NLP, acknowledging limitations in complex linguistic scenarios. This study offers valuable insights into FSL's practical applications and limitations, laying the foundation for future investigations. The nuanced exploration contributes to a balanced understanding of FSL's applicability, potentially guiding the development of more advanced and resource-efficient NLP models.

Details

Originalsprache	Englisch
Titel	Applied Artificial Intelligence
Redakteure/-innen	Ravindra Hegadi, Gaurav Gupta, KC Santosh
Herausgeber (Verlag)	Springer Science and Business Media B.V.
Seiten	366-380
Seitenumfang	15
ISBN (elektronisch)	978-3-032-00793-3
ISBN (Print)	978-3-032-00792-6
Publikationsstatus	Veröffentlicht - 2026
Peer-Review-Status	Ja

Publikationsreihe

Reihe	Communications in Computer and Information Science
Band	2621 CCIS
ISSN	1865-0929

Konferenz

Titel	1st International Conference on Applied Artificial Intelligence
Kurztitel	2AI 2024
Veranstaltungsnummer	1
Dauer	2 - 4 Juli 2024
Ort	Shoolini University
Stadt	Solan
Land	Indien

Externe IDs

ORCID	/0000-0001-5272-9811/work/215835607

Schlagworte

ASJC Scopus Sachgebiete

Schlagwörter

Artificial Intelligence, Few-shot learning (FSL), Large Language Model (LLM), Natural Language Model (NLP)

Forschungsportal der TU Dresden