Σύγκριση μεθόδων
Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.
| Απομακρυσμένη συλλογή δεδομένων από τον Ιστό (Remote Web Scraping)× | Απόξεση Ιστού (Web Scraping)× | |
|---|---|---|
| Πεδίο | Μεθοδολογία Επισκοπήσεων | Μεθοδολογία Επισκοπήσεων |
| Οικογένεια | Process / pipeline | Process / pipeline |
| Έτος προέλευσης≠ | 2000s–2010s (cloud infrastructure era) | Late 1990s–2000s |
| Δημιουργός≠ | Distributed computing and web automation communities | Early internet practitioners; systematised in research contexts from the late 1990s onward |
| Τύπος≠ | Automated remote data collection technique | Automated digital data collection technique |
| Θεμελιώδης πηγή | Mitchell, R. (2018). Web Scraping with Python: Collecting More Data from the Modern Web (2nd ed.). O'Reilly Media. ISBN: 978-1491985571 | Mitchell, R. (2018). Web Scraping with Python: Collecting More Data from the Modern Web (2nd ed.). O'Reilly Media. ISBN: 978-1491985571 |
| Εναλλακτικές ονομασίες | cloud web scraping, server-side scraping, remote automated data extraction, distributed web scraping | web harvesting, screen scraping, web crawling, automated data extraction |
| Συναφείς≠ | 3 | 5 |
| Σύνοψη≠ | Remote web scraping is a data collection approach in which automated scripts or bots harvest publicly accessible web content — text, tables, metadata, or links — running on remote servers or cloud infrastructure rather than on the researcher's local machine. This separation allows continuous, large-scale, or geographically distributed crawling that local setups cannot sustain, making it particularly suited to longitudinal or high-volume data collection tasks. | Web scraping is a computational data collection technique in which software automatically retrieves and extracts structured or semi-structured content from websites. Widely used in social science, computational linguistics, economics, and information science, it enables researchers to assemble large datasets from publicly accessible web sources — such as news archives, social media platforms, government portals, and online marketplaces — that would be impractical to collect manually. |
| ScholarGateΣύνολο δεδομένων ↗ |
|
|