Process / pipelineData collection
Longitudinal Web Scraping — Repeated Automated Collection of Web Data Over Time
Longitudinal web scraping is a data collection technique that uses automated scripts to extract content from websites at multiple, predefined time points. By revisiting the same web sources repeatedly, researchers build a time-series dataset that captures how online content, prices, discourse, or behavior evolves. It is widely used in computational social science, economics, political science, health research, and digital humanities to study change without relying on retrospective self-report.
Find Topic with PaperMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Salganik, M. J. (2018). Bit by Bit: Social Research in the Digital Age. Princeton University Press. ISBN: 978-0691158648
- Luscombe, A., Dick, K., & Walby, K. (2022). Algorithmic thinking in the public interest: navigating technical, legal, and ethical challenges in government web scraping. Quality & Quantity, 56(3), 1781–1802. DOI: 10.1007/s11135-021-01164-0 ↗