Process / pipelineData collection

Longitudinal Web Scraping — Repeated Automated Collection of Web Data Over Time

Longitudinal web scraping is a data collection technique that uses automated scripts to extract content from websites at multiple, predefined time points. By revisiting the same web sources repeatedly, researchers build a time-series dataset that captures how online content, prices, discourse, or behavior evolves. It is widely used in computational social science, economics, political science, health research, and digital humanities to study change without relying on retrospective self-report.

Find Topic with PaperMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Salganik, M. J. (2018). Bit by Bit: Social Research in the Digital Age. Princeton University Press. ISBN: 978-0691158648
  2. Luscombe, A., Dick, K., & Walby, K. (2022). Algorithmic thinking in the public interest: navigating technical, legal, and ethical challenges in government web scraping. Quality & Quantity, 56(3), 1781–1802. DOI: 10.1007/s11135-021-01164-0

Related methods

ScholarGateLongitudinal Web Scraping (Longitudinal Web Scraping for Research). Retrieved 2026-06-04 from https://scholargate.app/en/survey-methodology/longitudinal-web-scraping