Dataset
Using libquery and libquery_extensions (see querier), we query the data sources, filter the images, and process the metadata. This way, we construct a dataset of 13K old visualization. The data processing scripts and the constructed dataset can be found in this repository.
The dataset we have constructed can be downloaded with our Python package oldvis_dataset (see downloader).
Data Sources
As of Jun 20, 2023, our dataset consists of old visualization from seven data sources. The number of old visualizations we obtained from each data source is listed in the following table.
Data Source | #Entries |
---|---|
David Rumsey Map Collection | 7816 |
Internet Archive | 2985 |
Gallica | 2090 |
Telefact | 225 |
Library of Congress | 212 |
British Library | 132 |
Alabama Maps | 51 |