TY - JOUR
T1 - Extraction of temporal networks from term co-occurrences in online textual sources
AU - Popović, Marko
AU - Štefančić, Hrvoje
AU - Sluban, Borut
AU - Novak, Petra Kralj
AU - Grčar, Miha
AU - Mozetič, Igor
AU - Puliga, Michelangelo
AU - Zlatić, Vinko
N1 - Publisher Copyright:
© 2014 Popović et al.
PY - 2014/12/3
Y1 - 2014/12/3
N2 - A stream of unstructured news can be a valuable source of hidden relations between different entities, such as financial institutions, countries, or persons. We present an approach to continuously collect online news, recognize relevant entities in them, and extract time-varying networks. The nodes of the network are the entities, and the links are their co-occurrences. We present a method to estimate the significance of co-occurrences, and a benchmark model against which their robustness is evaluated. The approach is applied to a large set of financial news, collected over a period of two years. The entities we consider are 50 countries which issue sovereign bonds, and which are insured by Credit Default Swaps (CDS) in turn. We compare the country co-occurrence networks to the CDS networks constructed from the correlations between the CDS. The results show relatively small, but significant overlap between the networks extracted from the news and those from the CDS correlations.
AB - A stream of unstructured news can be a valuable source of hidden relations between different entities, such as financial institutions, countries, or persons. We present an approach to continuously collect online news, recognize relevant entities in them, and extract time-varying networks. The nodes of the network are the entities, and the links are their co-occurrences. We present a method to estimate the significance of co-occurrences, and a benchmark model against which their robustness is evaluated. The approach is applied to a large set of financial news, collected over a period of two years. The entities we consider are 50 countries which issue sovereign bonds, and which are insured by Credit Default Swaps (CDS) in turn. We compare the country co-occurrence networks to the CDS networks constructed from the correlations between the CDS. The results show relatively small, but significant overlap between the networks extracted from the news and those from the CDS correlations.
UR - http://www.scopus.com/inward/record.url?scp=84916220779&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0099515
DO - 10.1371/journal.pone.0099515
M3 - Article
C2 - 25470498
AN - SCOPUS:84916220779
SN - 1932-6203
VL - 9
JO - PLoS ONE
JF - PLoS ONE
IS - 12
M1 - e99515
ER -