What is the OldVis project?
The OldVis project is a digital library dedicated to old visualizations. The digital library is built by assembling information related to old visualizations scattered in existing digital libraries such as David Rumsey Map Collection and Internet Archive. It has indexed over 13K old visualizations published before 1950.
Components
The OldVis project consists of several components:
Repository | Description |
---|---|
gallery | An online gallery of old visualizations (details). |
oldvis_dataset | A Python package to download old visualizations (details). |
dataset | Data curation scripts in Python and the built dataset (details). |
libquery | A Python package to query digital libraries (details). |
libquery_extensions | A Python package that extends libquery (details). |
libprocess | A Python package to process query result of libquery (details). |
libprocess_extensions | A Python package to process query result of libquery_extensions (details). |
image classification labeler | An interactive data labeling system to label image classification. |
image segmentation labeler | An interactive data labeling system to label image segmentation. |
docs | This documentation website. |
This documentation provides an overview of the project implementation. Detailed documentation can be found in each repository.
Stack
The stack of the OldVis project consists of four layers (from bottom to top):
- Data Collection: fetch metadata and images with libquery/libquery_extensions and process metadata with libprocess/libprocess_extensions.
- Data Labeling: label images with image classification labeler and image segmentation labeler systems.
- Dataset: the data curation scripts and built dataset and the dataset downloader oldvis_dataset.
- Gallery: the metadata and image gallery.
Copyright
Notice
We do not hold the copyright of the images of old visualizations indexed by the OldVis project. We have stored the rights information obtained from the data sources when available. When the rights information requires attribution (e.g., when the image is licensed under CC BY-NC-SA 3.0), you should attribute the original data source instead of us. If you want to redistribute the images, you may check whether the images are in the public domain according to copyright lengths and regulations in their country.