Multi-modal subspace learning with dropout regularization for cross-modal recognition and retrieval

Guanqun Cao, Muhammad Adeel Waris, Alexandros Iosifidis, Moncef Gabbouj

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    3 Citations (Scopus)

    Abstract

    There has been a surge of efforts in cross-modal recognition and retrieval in recent multimedia research. Towards this goal, we investigate a multi-modal subspace learning algorithm together with the Dropout regularizer. Inspired by the regularization for neural networks, we propose to aritificially remove the effect of certain amount of feature bins using the probabilistic approach to prevent the linear subspace learning from over-fitting. The novel regularizer is well integrated into the multi-modal learning algorithm which maximizes the between-class scatter while minimizing the within-class scatter in the projected latent space. The new objective function can be solved efficiently as the generalized eigenvalue problem. Experimental results have shown that superior performance can be obtained in both face-sketch recognition and cross-modal retrieval applications.
    Original languageEnglish
    Title of host publication2016 6th International Conference on Image Processing Theory, Tools and Applications (IPTA)
    PublisherIEEE
    Pages1-6
    Number of pages6
    ISBN (Electronic)978-1-4673-8910-5
    ISBN (Print)978-1-4673-8911-2
    DOIs
    Publication statusPublished - Dec 2016
    Publication typeA4 Article in conference proceedings
    EventInternational Conference on Image Processing Theory, Tools and Applications -
    Duration: 1 Jan 1900 → …

    Publication series

    Name
    ISSN (Electronic)2154-512X

    Conference

    ConferenceInternational Conference on Image Processing Theory, Tools and Applications
    Period1/01/00 → …

    Keywords

    • Eigenvalues and eigenfunctions
    • Face recognition
    • Feature extraction
    • Image retrieval
    • Learning systems
    • Linear programming
    • Multimedia communication
    • cross-modal retrieval
    • face-sketch recognition
    • subspace learning

    Publication forum classification

    • Publication forum level 1

    Fingerprint

    Dive into the research topics of 'Multi-modal subspace learning with dropout regularization for cross-modal recognition and retrieval'. Together they form a unique fingerprint.

    Cite this