Публікація: Automated Knowledge Synthesis: An LLM-refined Framework for Evolutionary Topic Modeling
| dc.contributor.author | Amer Abu-Jassar | |
| dc.contributor.author | Mohammad Hamdan | |
| dc.contributor.author | Nowfal Aweisi | |
| dc.contributor.author | Slisareko, R. | |
| dc.contributor.author | Deineko, Z. | |
| dc.contributor.author | Lyashenko, V. | |
| dc.date.accessioned | 2026-05-06T12:31:53Z | |
| dc.date.issued | 2026 | |
| dc.description.abstract | Traditional topic modeling methods, such as Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF), are limited by their context ignorance, static nature, and low interpretability. Building upon the hybrid approach LDA+NMF+class-based Term Frequency–Inverse Document Frequency (c-TF-IDF), a new formalized framework – Dynamic Contextual Topic Modeling with Large Language Model (LLM) Refinement (DCTM-LLM) – is presented. This LLM-refined framework integrates transformer embeddings for the detection of dynamic semantic clusters and leverages an LLM for their subsequent refinement and the synthesis of high -level narratives. Experiments on a corpus of 35,000 arXiv abstracts (cs.AI (Computer Science — Artificial Intelligence), 2015–2025) showed that DCTM-LLM achieves a Normalized Pointwise Mutual Information (NPMI) of 0.53, a Silhouette score of 0.62, an Adjusted Rand Index (ARI) of 0.55, and Topic Diversity at 10 of 0.88. Crucially, with a Bidirectional Encoder Representations from Transformers (BERT)-based score (BERTScore) F1 of 0.89, the method significantly outperforms Dynamic BERTopic (0.62) and the hybrid LDA, NMF, and c-TF-IDF approach (0.65). Thus, the proposed approach shifts the paradigm of topic modeling from keyword extraction toward utomated knowledge synthesis. | |
| dc.identifier.citation | Automated Knowledge Synthesis: An LLM-refined Framework for Evolutionary Topic Modeling / Amer Abu-Jassar, Mohammad Hamdan, Nowfal Aweisi, R. Slisareko, Z. Deineko, V. Lyashenko // International Journal of Intelligent Engineering and Systems. 2026. Vol.19, No. 4. P. 215-230. DOI: 10.22266/ijies2026.0430.13. | |
| dc.identifier.doi | https://doi.org/10.22266/ijies2026.0430.13 | |
| dc.identifier.issn | 2185-310X | |
| dc.identifier.uri | https://openarchive.nure.ua/handle/document/34418 | |
| dc.language.iso | en_US | |
| dc.publisher | INASS | |
| dc.subject | Large language model | |
| dc.subject | Narrative synthesis | |
| dc.title | Automated Knowledge Synthesis: An LLM-refined Framework for Evolutionary Topic Modeling | |
| dc.type | Article | |
| dspace.entity.type | Publication |
Файли
Оригінальний пакунок
1 - 1 з 1
Завантаження...
- Назва:
- Lyash-INASS-2026.pdf
- Розмір:
- 886.96 KB
- Формат:
- Adobe Portable Document Format
Пакунок ліцензії
1 - 1 з 1
Завантаження...
- Назва:
- license.txt
- Розмір:
- 10.74 KB
- Формат:
- Item-specific license agreed upon to submission
- Опис: