Efficient mining of concept-hierarchy aware distinguishing sequential patterns

Chengxin He, Lei Duan, Guozhu Dong, Jyrki Nummenmaa, Tingting Wang, Tinghai Pang

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Distinguishing sequential patterns are sequential patterns that have much higher frequencies in one target group of sequences (concerning a given phenomenon of interest) than in a contrasting group of sequences. Distinguishing sequential patterns are useful for many machine learning tasks, as well as for the explanation and characterization of the phenomenon underlying the target group of sequences. However, previous studies on mining distinguishing sequential patterns did not consider the hierarchical relationship among elements in sequences. To fill the gap, this paper investigates the mining of distinguishing sequential patterns in the presence of concept hierarchies among sequence elements. The associated patterns will be called concept-hierarchy aware distinguishing sequential patterns (hDSPs). After presenting the challenges on mining hDSPs, we present hDSP-Miner, a method with effective pruning techniques, for mining hDSPs. Our empirical study using real-world protein sequences demonstrates that hDSP-Miner is effective and efficient, and it can discover more novel distinguishing sequential patterns than previous algorithms for mining distinguishing sequential patterns.

Original languageEnglish
Article number109710
JournalKnowledge-Based Systems
Volume255
DOIs
Publication statusPublished - 14 Nov 2022
Publication typeA1 Journal article-refereed

Keywords

  • Concept hierarchy
  • Contrast mining
  • Distinguishing sequential pattern
  • Domain knowledge

Publication forum classification

  • Publication forum level 1

ASJC Scopus subject areas

  • Software
  • Management Information Systems
  • Information Systems and Management
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Efficient mining of concept-hierarchy aware distinguishing sequential patterns'. Together they form a unique fingerprint.

Cite this