Skip to content
On this page

Dataset

Using libquery and libquery_extensions (see querier), we query the data sources, filter the images, and process the metadata. This way, we construct a dataset of 13K old visualization. The data processing scripts and the constructed dataset can be found in this repository.

The dataset we have constructed can be downloaded with our Python package oldvis_dataset (see downloader).

Data Sources

As of Jun 20, 2023, our dataset consists of old visualization from seven data sources. The number of old visualizations we obtained from each data source is listed in the following table.

Data Source#Entries
David Rumsey Map Collection7816
Internet Archive2985
Gallica2090
Telefact225
Library of Congress212
British Library132
Alabama Maps51