Enhancing Text Classification in Natural Language Processing: A Comparative Study of Transformer Models and the Potential of Few-Shot Learning

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Abstract

This research focuses on enhancing machine comprehension in multilingual Natural Language Processing (NLP) environments. Despite advancements in pre-trained models, creating custom models still requires more resources. To overcome this, the study explores FewShot Learning (FSL), a Meta-Learning approach inspired by human learning efficiency, asserting that machines can adeptly learn from minimal examples and task descriptions. The methodology unfolds in two dimensions: practical application and theoretical exploration. In practical terms, Python constructs models using pre-trained frameworks—BERT, DistilBERT, ELECTRA, and MiniLM. These Few-Shot models undergo meticulous evaluation for efficiency and accuracy, especially in processing unseen data with minimal support for contextual understanding. Simultaneously, the theoretical facet involves a comprehensive literature review, shedding light on FSL's effectiveness in diverse NLP contexts. Preliminary findings suggest FSL's promising role in addressing multilingual challenges in NLP, acknowledging limitations in complex linguistic scenarios. This study offers valuable insights into FSL's practical applications and limitations, laying the foundation for future investigations. The nuanced exploration contributes to a balanced understanding of FSL's applicability, potentially guiding the development of more advanced and resource-efficient NLP models.

Details

Original languageEnglish
Title of host publicationApplied Artificial Intelligence
EditorsRavindra Hegadi, Gaurav Gupta, KC Santosh
PublisherSpringer Science and Business Media B.V.
Pages366-380
Number of pages15
ISBN (electronic)978-3-032-00793-3
ISBN (print)978-3-032-00792-6
Publication statusPublished - 2026
Peer-reviewedYes

Publication series

SeriesCommunications in Computer and Information Science
Volume2621 CCIS
ISSN1865-0929

Conference

Title1st International Conference on Applied Artificial Intelligence
Abbreviated title2AI 2024
Conference number1
Duration2 - 4 July 2024
LocationShoolini University
CitySolan
CountryIndia

External IDs

ORCID /0000-0001-5272-9811/work/215835607

Keywords

Keywords

  • Artificial Intelligence, Few-shot learning (FSL), Large Language Model (LLM), Natural Language Model (NLP)