When dialects collide: how socioeconomic mixing affects language use

Thomas Louf*, Jose J. Ramasco, David Sanchez, Marton Karsai

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract (may include machine translation)

The socioeconomic background of people and how they use standard forms of language are not independent, as demonstrated in various sociolinguistic studies. However, the extent to which these correlations may be influenced by the mixing of people from different socioeconomic classes remains relatively unexplored from a quantitative perspective. In this work we leverage geotagged tweets and transferable computational methods to map deviations from standard English across eight UK metropolitan areas. We combine these data with high-resolution income maps to assign a proxy socioeconomic indicator to home-located users. Strikingly, we find a consistent pattern suggesting that the more different socioeconomic classes mix, the less interdependent the frequency of their departures from standard grammar and their income become. Further, we propose an agent-based model of linguistic variety adoption that sheds light on the mechanisms that produce the observations seen in the data.
Original languageEnglish
Article number47
Number of pages21
JournalEPJ Data Science
Volume14
Issue number1
DOIs
StatePublished - 10 Jul 2025

Keywords

  • Agent-based modeling
  • Computational sociolinguistics
  • Dialects
  • Social media data
  • Socioeconomic status

Fingerprint

Dive into the research topics of 'When dialects collide: how socioeconomic mixing affects language use'. Together they form a unique fingerprint.

Cite this