CSM-SD: Methodology for contrast set mining through subgroup discovery

Petra Kralj Novak, Nada Lavrač, Dragan Gamberger, Antonija Krstačić

Research output: Contribution to journalArticlepeer-review

Abstract (may include machine translation)

This paper addresses a data analysis task, known as contrast set mining, whose goal is to find differences between contrasting groups. As a methodological novelty, it is shown that this task can be effectively solved by transforming it to a more common and well-understood subgroup discovery task. The transformation is studied in two learning settings, a one-versus-all and a pairwise contrast set mining setting, uncovering the conditions for each of the two choices. Moreover, the paper shows that the explanatory potential of discovered contrast sets can be improved by offering additional contrast set descriptors, called the supporting factors. The proposed methodology has been applied to uncover distinguishing characteristics of two groups of brain stroke patients, both with rapidly developing loss of brain function due to ischemia:those with ischemia caused by thrombosis and by embolism, respectively.

Original languageEnglish
Pages (from-to)113-122
Number of pages10
JournalJournal of Biomedical Informatics
Volume42
Issue number1
DOIs
StatePublished - Feb 2009
Externally publishedYes

Keywords

  • Brain ischemia
  • Contrast set mining
  • Descriptive rules
  • Subgroup discovery
  • Supporting factors

Fingerprint

Dive into the research topics of 'CSM-SD: Methodology for contrast set mining through subgroup discovery'. Together they form a unique fingerprint.

Cite this