A nationwide dataset of de-identified activity spaces derived from geotagged social media data

Ate Poorthuis, Qingqing Chen, Matthew Zook

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

In this article, we present a historical dataset of activity spaces, originally based on publicly posted and geotagged social media sent within the United States from 2012 to 2019. The dataset, which contains approximately 2 million users and 1.2 billion data points, is de-identified and spatially aggregated to enable ethical and broad sharing across the research community. By publishing the dataset, we hope to help researchers to quickly access and filter data to study people’s activity spaces across a range of places. In this article, we first describe the construction and characteristics of this dataset and then highlight certain limitations of the data through an illustrative analysis of potential bias—an important consideration when using data not collected through representative sampling. Our goal is to empower researchers to create novel, insightful research projects of their own design based on this dataset.

Original languageEnglish
Pages (from-to)2264-2275
Number of pages12
JournalEnvironment and Planning B: Urban Analytics and City Science
Volume51
Issue number9
DOIs
StatePublished - Nov 2024

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

Keywords

  • Activity spaces
  • geotagged social media
  • human mobility
  • location-based services

ASJC Scopus subject areas

  • Geography, Planning and Development
  • Architecture
  • Urban Studies
  • Nature and Landscape Conservation
  • Management, Monitoring, Policy and Law

Fingerprint

Dive into the research topics of 'A nationwide dataset of de-identified activity spaces derived from geotagged social media data'. Together they form a unique fingerprint.

Cite this