Abstract
In many domains that employ machine learning models, both high performing and interpretable models are needed. A typical machine learning task is text classification, where models are hardly interpretable. Topic models, used as topic embeddings, carry the potential to better understand the decisions made by text classification algorithms. With this goal in mind, we propose two new fuzzy topic models; FLSA-W and FLSA-V. Both models are derived from the topic model Fuzzy Latent Semantic Analysis (FLSA). After training each model ten times, we use the mean coherence score to compare the different models with the benchmark models Latent Dirichlet Allocation (LDA) and FLSA. Our proposed models generally lead to higher coherence scores and lower standard deviations than the benchmark models. These proposed models are specifically useful as topic embeddings in text classification, since the coherence scores do not drop for a high number of topics, as opposed to the decay that occurs with LDA and FLSA.
Original language | English |
---|---|
Title of host publication | 2021 IEEE Symposium Series on Computational Intelligence (SSCI) |
Publisher | Institute of Electrical and Electronics Engineers |
Number of pages | 8 |
ISBN (Electronic) | 978-1-7281-9048-8 |
DOIs | |
Publication status | Published - 24 Jan 2022 |
Event | IEEE Symposium Series on Computational Intelligence, IEEE SSCI 2021 - Online, Orlando, United States Duration: 5 Dec 2021 → 7 Dec 2021 |
Conference
Conference | IEEE Symposium Series on Computational Intelligence, IEEE SSCI 2021 |
---|---|
Abbreviated title | IEEE SSCI 2021 |
Country/Territory | United States |
City | Orlando |
Period | 5/12/21 → 7/12/21 |
Keywords
- Topic modeling
- text classification
- fuzzy modeling
- explainable AI
- natural language processing
- Explainable AI
- NLP
- Text Classification
- Fuzzy Modelling
- Topic Models