Enhancing Text Classification in Natural Language Processing: A Comparative Study of Transformer Models and the Potential of Few-Shot Learning

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Abstract

This research focuses on enhancing machine comprehension in multilingual Natural Language Processing (NLP) environments. Despite advancements in pre-trained models, creating custom models still requires more resources. To overcome this, the study explores FewShot Learning (FSL), a Meta-Learning approach inspired by human learning efficiency, asserting that machines can adeptly learn from minimal examples and task descriptions. The methodology unfolds in two dimensions: practical application and theoretical exploration. In practical terms, Python constructs models using pre-trained frameworks—BERT, DistilBERT, ELECTRA, and MiniLM. These Few-Shot models undergo meticulous evaluation for efficiency and accuracy, especially in processing unseen data with minimal support for contextual understanding. Simultaneously, the theoretical facet involves a comprehensive literature review, shedding light on FSL's effectiveness in diverse NLP contexts. Preliminary findings suggest FSL's promising role in addressing multilingual challenges in NLP, acknowledging limitations in complex linguistic scenarios. This study offers valuable insights into FSL's practical applications and limitations, laying the foundation for future investigations. The nuanced exploration contributes to a balanced understanding of FSL's applicability, potentially guiding the development of more advanced and resource-efficient NLP models.

Details

OriginalspracheEnglisch
TitelApplied Artificial Intelligence
Redakteure/-innenRavindra Hegadi, Gaurav Gupta, KC Santosh
Herausgeber (Verlag)Springer Science and Business Media B.V.
Seiten366-380
Seitenumfang15
ISBN (elektronisch)978-3-032-00793-3
ISBN (Print)978-3-032-00792-6
PublikationsstatusVeröffentlicht - 2026
Peer-Review-StatusJa

Publikationsreihe

ReiheCommunications in Computer and Information Science
Band2621 CCIS
ISSN1865-0929

Konferenz

Titel1st International Conference on Applied Artificial Intelligence
Kurztitel2AI 2024
Veranstaltungsnummer1
Dauer2 - 4 Juli 2024
OrtShoolini University
StadtSolan
LandIndien

Externe IDs

ORCID /0000-0001-5272-9811/work/215835607

Schlagworte

Schlagwörter

  • Artificial Intelligence, Few-shot learning (FSL), Large Language Model (LLM), Natural Language Model (NLP)