A new approach to estimate neighborhood socioeconomic status using supermarket transactions and GNNs

Eduardo Cruz*, Monica Villavicencio, Carmen Vaca, Lisette Espín-Noboa, Nervo Verdezoto

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract (may include machine translation)

Ending poverty in all its forms everywhere remains the number one Sustainable Development Goal of the United Nations 2030 Agenda. Governments face challenges in measuring socioeconomic status with fine spatial resolution because traditional data collection methods, such as censuses and surveys, are time-consuming, labor-intensive, performed at long intervals, and cover only a limited population. This work is a data-driven study to analyze the digital traces left by humans in supermarket transactions and model the relationship between consumption behavior and the average per capita income, proposing a proxy to estimate socioeconomic status at the urban neighborhood level. We analyze more than 20 million supermarket shopping transactions in Guayaquil, the most populated city in Ecuador. Using customer consumption data, we created a basket graph and fed it into a graph neural network to predict neighborhood socioeconomic status. The model was trained with spectral and spatial convolutional filters using cross-validation to select the best approach for the prediction. The results show that the Chebyshev spectral convolutional filter has the highest predictive power to predict the socioeconomic status of the neighborhood, with R2=0.91. Our proposed approach contributes to measuring socioeconomic status at the neighborhood level to support policymakers in making informed decisions about resource allocation according to the needs of different geographical areas.

Original languageEnglish
Article number3
JournalEPJ Data Science
Volume14
Issue number1
DOIs
StatePublished - 17 Jan 2025

Keywords

  • Basket graph
  • Graph neural network
  • Item embedding
  • Neighborhood socioeconomic status
  • Per capita income
  • Spectral convolutional filter

Fingerprint

Dive into the research topics of 'A new approach to estimate neighborhood socioeconomic status using supermarket transactions and GNNs'. Together they form a unique fingerprint.

Cite this