Enhancing Text Classification in Natural Language Processing: A Comparative Study of Transformer Models and the Potential of Few-Shot Learning
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
This research focuses on enhancing machine comprehension in multilingual Natural Language Processing (NLP) environments. Despite advancements in pre-trained models, creating custom models still requires more resources. To overcome this, the study explores FewShot Learning (FSL), a Meta-Learning approach inspired by human learning efficiency, asserting that machines can adeptly learn from minimal examples and task descriptions. The methodology unfolds in two dimensions: practical application and theoretical exploration. In practical terms, Python constructs models using pre-trained frameworks—BERT, DistilBERT, ELECTRA, and MiniLM. These Few-Shot models undergo meticulous evaluation for efficiency and accuracy, especially in processing unseen data with minimal support for contextual understanding. Simultaneously, the theoretical facet involves a comprehensive literature review, shedding light on FSL's effectiveness in diverse NLP contexts. Preliminary findings suggest FSL's promising role in addressing multilingual challenges in NLP, acknowledging limitations in complex linguistic scenarios. This study offers valuable insights into FSL's practical applications and limitations, laying the foundation for future investigations. The nuanced exploration contributes to a balanced understanding of FSL's applicability, potentially guiding the development of more advanced and resource-efficient NLP models.
Details
| Original language | English |
|---|---|
| Title of host publication | Applied Artificial Intelligence |
| Editors | Ravindra Hegadi, Gaurav Gupta, KC Santosh |
| Publisher | Springer Science and Business Media B.V. |
| Pages | 366-380 |
| Number of pages | 15 |
| ISBN (electronic) | 978-3-032-00793-3 |
| ISBN (print) | 978-3-032-00792-6 |
| Publication status | Published - 2026 |
| Peer-reviewed | Yes |
Publication series
| Series | Communications in Computer and Information Science |
|---|---|
| Volume | 2621 CCIS |
| ISSN | 1865-0929 |
Conference
| Title | 1st International Conference on Applied Artificial Intelligence |
|---|---|
| Abbreviated title | 2AI 2024 |
| Conference number | 1 |
| Duration | 2 - 4 July 2024 |
| Location | Shoolini University |
| City | Solan |
| Country | India |
External IDs
| ORCID | /0000-0001-5272-9811/work/215835607 |
|---|
Keywords
ASJC Scopus subject areas
Keywords
- Artificial Intelligence, Few-shot learning (FSL), Large Language Model (LLM), Natural Language Model (NLP)