|
1 | 1 | # KeyNMF |
2 | 2 |
|
3 | | -KeyNMF is a topic model that relies on contextually sensitive embeddings for keyword retrieval and term importance estimation, |
| 3 | +KeyNMF (Kristensen-McLachlan et al., 2024) is a topic model that relies on contextually sensitive embeddings for keyword retrieval and term importance estimation, |
4 | 4 | while taking inspiration from classical matrix-decomposition approaches for extracting topics. |
5 | 5 |
|
6 | 6 | <figure> |
@@ -513,6 +513,44 @@ model = KeyNMF(10, encoder=encoder) |
513 | 513 |
|
514 | 514 | Setting the default prompt to `query` is especially important, when you are precomputing embeddings, as `query` should always be your default prompt to embed documents with. |
515 | 515 |
|
| 516 | +## Citation |
| 517 | + |
| 518 | +Please cite Turftopic and Kristensen-McLachlan et al. (2024) when using KeyNMF: |
| 519 | + |
| 520 | +```bibtex |
| 521 | +@article{ |
| 522 | + Kardos2025, |
| 523 | + title = {Turftopic: Topic Modelling with Contextual Representations from Sentence Transformers}, |
| 524 | + doi = {10.21105/joss.08183}, |
| 525 | + url = {https://doi.org/10.21105/joss.08183}, |
| 526 | + year = {2025}, |
| 527 | + publisher = {The Open Journal}, |
| 528 | + volume = {10}, |
| 529 | + number = {111}, |
| 530 | + pages = {8183}, |
| 531 | + author = {Kardos, Márton and Enevoldsen, Kenneth C. and Kostkan, Jan and Kristensen-McLachlan, Ross Deans and Rocca, Roberta}, |
| 532 | + journal = {Journal of Open Source Software} |
| 533 | +} |
| 534 | +
|
| 535 | +@inproceedings{keynmf, |
| 536 | + title = "Context is Key(NMF):: Modelling Topical Information Dynamics in Chinese Diaspora Media", |
| 537 | + abstract = "Does the People{\textquoteright}s Republic of China (PRC) interfere with European elections through ethnic Chinese diaspora media? This question forms the basis of an ongoing research project exploring how PRC narratives about European elections are represented in Chinese diaspora media, and thus the objectives of PRC news media manipulation. In order to study diaspora media ef{\"I}ciently and at scale, it is necessary to use techniques derived from quantitative text analysis, such as topic modelling. In this paper, we present a pipeline for studying information dynamics in Chinese media. Firstly, we present KeyNMF, a new approach to static and dynamic topic modelling using transformer-based contextual embedding models. We provide benchmark evaluations to demonstrate that our approach is competitive on a number of Chinese datasets and metrics. Secondly, we integrate KeyNMF with existing methods for describing information dynamics in complex systems. We apply this pipeline to data from five news sites, focusing on the period of time leading up to the 2024 European parliamentary elections. Our methods and results demonstrate the effectiveness of KeyNMF for studying information dynamics in Chinese media and lay groundwork for further work addressing the broader research questions.", |
| 538 | + keywords = "Chinese, contextual topic models, information dynamics, keywords, novelty", |
| 539 | + author = "Kristensen-McLachlan, \{Ross Deans\} and Hicke, \{Rebecca Marie Matouschek\} and M{\'a}rton Kardos and Mette Thun{\o}", |
| 540 | + year = "2024", |
| 541 | + month = dec, |
| 542 | + language = "English", |
| 543 | + volume = "3834", |
| 544 | + series = "CEUR Workshop Proceedings", |
| 545 | + publisher = "CEUR-WS", |
| 546 | + pages = "829--847", |
| 547 | + editor = "Haverals, \{Wouter \} and Koolen, \{Marijn \} and Thompson, \{Laure \}", |
| 548 | + booktitle = "Proceedings of the Computational Humanities Research Conference 2024", |
| 549 | + address = "Germany", |
| 550 | +} |
| 551 | +``` |
| 552 | + |
| 553 | + |
516 | 554 | ## API Reference |
517 | 555 |
|
518 | 556 | ::: turftopic.models.keynmf.KeyNMF |
0 commit comments