MVDA: a multi-view genomic data integration methodology

Angela Serra, Michele Fratello, Vittorio Fortino, Giancarlo Raiconi, Roberto Tagliaferri, Dario Greco

Research output: Contribution to journalArticleScientificpeer-review

59 Citations (Scopus)

Abstract

BACKGROUND: Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications.

RESULTS: We propose a multi-view approach in which the information from different data layers (views) is integrated at the levels of the results of each single view clustering iterations. It works by factorizing the membership matrices in a late integration manner. We evaluated the effectiveness and the performance of our method on six multi-view cancer datasets. In all the cases, we found patient sub-classes with statistical significance, identifying novel sub-groups previously not emphasized in literature. Our method performed better as compared to other multi-view clustering algorithms and, unlike other existing methods, it is able to quantify the contribution of single views on the final results.

CONCLUSION: Our observations suggest that integration of prior information with genomic features in the subtyping analysis is an effective strategy in identifying disease subgroups. The methodology is implemented in R and the source code is available online at http://neuronelab.unisa.it/a-multi-view-genomic-data-integration-methodology/ .

Original languageEnglish
Pages (from-to)261
JournalBMC Bioinformatics
Volume16
DOIs
Publication statusPublished - 19 Aug 2015
Externally publishedYes
Publication typeA1 Journal article-refereed

Keywords

  • Algorithms
  • Cluster Analysis
  • Genomics/methods
  • MicroRNAs/genetics
  • Sequence Analysis, RNA

Fingerprint

Dive into the research topics of 'MVDA: a multi-view genomic data integration methodology'. Together they form a unique fingerprint.

Cite this