Semantic data mining of financial news articles

Anže Vavpetič, Petra Kralj Novak, Miha Grčar, Igor Mozetič, Nada Lavrač

Research output: Contribution to Book/Report typesConference contributionpeer-review

Abstract (may include machine translation)

Subgroup discovery aims at constructing symbolic rules that describe statistically interesting subsets of instances with a chosen property of interest. Semantic subgroup discovery extends standard subgroup discovery approaches by exploiting ontological concepts in rule construction. Compared to previously developed semantic data mining systems SDM-SEGS and SDM-Aleph, this paper presents a general purpose semantic subgroup discovery system Hedwig that takes as input the training examples encoded in RDF, and constructs relational rules by effective top-down search of ontologies, also encoded as RDF triples. The effectiveness of the system is demonstrated through an application in a financial domain with the goal to analyze financial news in search for interesting vocabulary patterns that reflect credit default swap (CDS) trend reversal for financially troubled countries. The approach is showcased by analyzing over 8 million news articles collected in the period of eighteen months. The paper exemplifies the results by showing rules reflecting interesting news topics characterizing Portugal CDS trend reversal in terms of conjunctions of terms describing concepts at different levels of the concept hierarchy.

Original languageEnglish
Title of host publicationDiscovery Science - 16th International Conference, DS 2013, Proceedings
PublisherSpringer Verlag
Pages294-307
Number of pages14
ISBN (Print)9783642408960
DOIs
StatePublished - 2013
Externally publishedYes
Event16th International Conference on Discovery Science, DS 2013 - Singapore, Singapore
Duration: 6 Oct 20139 Oct 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8140 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Discovery Science, DS 2013
Country/TerritorySingapore
CitySingapore
Period6/10/139/10/13

Keywords

  • credit default swap
  • financial crisis
  • ontology
  • semantic data mining
  • subgroup discovery

Fingerprint

Dive into the research topics of 'Semantic data mining of financial news articles'. Together they form a unique fingerprint.

Cite this