Abstract
Nonnegative matrix factorization (NMF) is a popular approach to model data, however, most models are unable to flexibly take into account multiple matrices across sources and time or apply only to integer-valued data. We introduce a probabilistic, Gaussian Process-based, more inclusive NMF-based model which jointly analyzes nonnegative data such as text data word content from multiple sources in a temporal dynamic manner. The model collectively models observed matrix data, source-wise latent variables, and their dependencies and temporal evolution with a full-fledged hierarchical approach including flexible nonparametric temporal dynamics. Experiments on simulated data and real data show the model out-performs, comparable models. A case study on social media and news demonstrates the model discovers semantically meaningful topical factors and their evolution.
Original language | English |
---|---|
Title of host publication | CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management |
Publisher | ACM |
Pages | 1035-1043 |
Number of pages | 9 |
ISBN (Electronic) | 978-1-4503-6859-9 |
DOIs | |
Publication status | Published - Oct 2020 |
Publication type | A4 Article in conference proceedings |
Event | ACM International Conference on Information & Knowledge Management - Duration: 19 Oct 2020 → 23 Oct 2020 |
Conference
Conference | ACM International Conference on Information & Knowledge Management |
---|---|
Abbreviated title | CIKM |
Period | 19/10/20 → 23/10/20 |
Publication forum classification
- Publication forum level 2