Scalable optimization of neighbor embedding for visualization

Zhirong Yang, Jaakko Peltonen, Samuel Kaski

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    35 Citations (Scopus)

    Abstract

    Neighbor embedding (NE) methods have found their use in data visualization but are limited in big data analysis tasks due to their O(n2) complexity for n data samples. We demonstrate that the obvious approach of subsampling produces inferior results and propose a generic approximated optimization technique that reduces the NE optimization cost to O(n log n). The technique is based on realizing that in visualization the embedding space is necessarily very low-dimensional (2D or 3D), and hence efficient approximations developed for n-body force calculations can be applied. In gradient-based NE algorithms the gradient for an individual point decomposes into "forces" exerted by the other points. The contributions of close-by points need to be computed individually but far-away points can be approximated by their "center of mass", rapidly computable by applying a recursive decomposition of the visualization space into quadrants. The new algorithm brings a significant speed-up for medium-size data, and brings "big data" within reach of visualization.

    Original languageEnglish
    Title of host publication30th International Conference on Machine Learning, ICML 2013
    PublisherInternational Machine Learning Society (IMLS)
    Pages786-794
    Number of pages9
    EditionPART 1
    Publication statusPublished - 2013
    Publication typeA4 Article in conference proceedings
    Event30th International Conference on Machine Learning, ICML 2013 -
    Duration: 1 Jan 2013 → …

    Conference

    Conference30th International Conference on Machine Learning, ICML 2013
    Period1/01/13 → …

    Publication forum classification

    • Publication forum level 2

    Fingerprint

    Dive into the research topics of 'Scalable optimization of neighbor embedding for visualization'. Together they form a unique fingerprint.

    Cite this