Fast kernel matrix computation for big data clustering

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    11 Citations (Scopus)

    Abstract

    Kernel k-Means is a basis for many state of the art global clustering approaches. When the number of samples grows too big, however, it is extremely time-consuming to compute the entire kernel matrix and it is impossible to store it in the memory of a single computer. The algorithm of Approximate Kernel k-Means has been proposed, which works using only a small part of the kernel matrix. The computation of the kernel matrix, even a part of it, remains a significant bottleneck of the process. Some types of kernel, however, can be computed using matrix multiplication. Modern CPU architectures and computational optimization methods allow for very fast matrix multiplication, thus those types of kernel matrices can be computed much faster than others.

    Original languageEnglish
    Title of host publicationProcedia Computer Science
    PublisherElsevier
    Pages2445-2452
    Number of pages8
    Volume51
    Edition1
    DOIs
    Publication statusPublished - 2015
    Publication typeA4 Article in conference proceedings
    EventInternational Conference on Computational Science, ICCS 2002 - Amsterdam, Netherlands
    Duration: 21 Apr 200224 Apr 2002

    Conference

    ConferenceInternational Conference on Computational Science, ICCS 2002
    Country/TerritoryNetherlands
    CityAmsterdam
    Period21/04/0224/04/02

    Keywords

    • Approximate
    • Big data
    • Clustering
    • Kernel k-Means
    • Kernel matrix

    ASJC Scopus subject areas

    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Fast kernel matrix computation for big data clustering'. Together they form a unique fingerprint.

    Cite this