Enhancing Text Classification in Natural Language Processing: A Comparative Study of Transformer Models and the Potential of Few-Shot Learning
Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung
Beitragende
Abstract
This research focuses on enhancing machine comprehension in multilingual Natural Language Processing (NLP) environments. Despite advancements in pre-trained models, creating custom models still requires more resources. To overcome this, the study explores FewShot Learning (FSL), a Meta-Learning approach inspired by human learning efficiency, asserting that machines can adeptly learn from minimal examples and task descriptions. The methodology unfolds in two dimensions: practical application and theoretical exploration. In practical terms, Python constructs models using pre-trained frameworks—BERT, DistilBERT, ELECTRA, and MiniLM. These Few-Shot models undergo meticulous evaluation for efficiency and accuracy, especially in processing unseen data with minimal support for contextual understanding. Simultaneously, the theoretical facet involves a comprehensive literature review, shedding light on FSL's effectiveness in diverse NLP contexts. Preliminary findings suggest FSL's promising role in addressing multilingual challenges in NLP, acknowledging limitations in complex linguistic scenarios. This study offers valuable insights into FSL's practical applications and limitations, laying the foundation for future investigations. The nuanced exploration contributes to a balanced understanding of FSL's applicability, potentially guiding the development of more advanced and resource-efficient NLP models.
Details
| Originalsprache | Englisch |
|---|---|
| Titel | Applied Artificial Intelligence |
| Redakteure/-innen | Ravindra Hegadi, Gaurav Gupta, KC Santosh |
| Herausgeber (Verlag) | Springer Science and Business Media B.V. |
| Seiten | 366-380 |
| Seitenumfang | 15 |
| ISBN (elektronisch) | 978-3-032-00793-3 |
| ISBN (Print) | 978-3-032-00792-6 |
| Publikationsstatus | Veröffentlicht - 2026 |
| Peer-Review-Status | Ja |
Publikationsreihe
| Reihe | Communications in Computer and Information Science |
|---|---|
| Band | 2621 CCIS |
| ISSN | 1865-0929 |
Konferenz
| Titel | 1st International Conference on Applied Artificial Intelligence |
|---|---|
| Kurztitel | 2AI 2024 |
| Veranstaltungsnummer | 1 |
| Dauer | 2 - 4 Juli 2024 |
| Ort | Shoolini University |
| Stadt | Solan |
| Land | Indien |
Externe IDs
| ORCID | /0000-0001-5272-9811/work/215835607 |
|---|
Schlagworte
ASJC Scopus Sachgebiete
Schlagwörter
- Artificial Intelligence, Few-shot learning (FSL), Large Language Model (LLM), Natural Language Model (NLP)