Generative Modeling for Maximizing Precision and Recall in Information Visualization

Jaakko Peltonen, Samuel Kaski

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    5 Citations (Scopus)

    Abstract

    Information visualization has recently been formulated as an information retrieval problem, where the goal is to find similar data points based on the visualized nonlinear projection, and the visualization is optimized to maximize a compromise between (smoothed) precision and recall. We turn the visualization into a generative modeling task where a simple user model parameterized by the data coordinates is optimized, neighborhood relations are the observed data, and straightforward maximum likelihood estimation corresponds to Stochastic Neighbor Embedding (SNE). While SNE maximizes pure recall, adding a mixture component that “explains away” misses allows our generative model to focus on maximizing precision as well. The resulting model is a generative solution to maximizing tradeoffs between precision and recall. The model outperforms earlier models in terms of precision and recall and in external validation by unsupervised classification.
    Original languageEnglish
    Title of host publicationProceedings of AISTATS 2011, the 14th International Conference on Artificial Intelligence and Statistics
    PublisherJMLR
    Pages579-587
    Number of pages9
    Volume15
    Publication statusPublished - 2011
    Publication typeA4 Article in conference proceedings

    Publication forum classification

    • No publication forum level

    Fingerprint

    Dive into the research topics of 'Generative Modeling for Maximizing Precision and Recall in Information Visualization'. Together they form a unique fingerprint.

    Cite this