What is the OldVis project?
The OldVis project is a digital collection dedicated to historical visualizations. The digital collection is built by assembling information related to old visualizations, which are scattered across existing digital libraries such as the David Rumsey Map Collection and the Internet Archive. As of August 17, 2023, it has indexed over 13,000 historical visualizations published before 1950.
Mission
Historical visualizations have played crucial roles in science and communication. For example, John Snow's Cholera Map (1854) laid the foundation for epidemiology. These visualizations represent important milestones in the history of data visualization.
While historical visualizations are valuable records that offer insights into how data was communicated and interpreted across different eras, they have remained scattered across various digital libraries, making systematic study difficult. The OldVis project aims to address this gap in digital humanities and visualization research: the lack of a large-scale, dedicated collection of historical visualizations.
By creating a comprehensive digital collection, the OldVis project aims to:
- Gather scattered historical visualizations into a unified, searchable collection.
- Provide researchers, educators, and enthusiasts with easy access to historical visualization materials.
- Enable systematic analysis of visualization techniques, design patterns, and their evolution over time.
Components
The OldVis project consists of several components:
| Repository | Description |
|---|---|
| gallery | An online gallery of old visualizations (details). |
| oldvis_dataset | A Python package to download old visualizations (details). |
| dataset | Data curation scripts in Python and the built dataset (details). |
| libquery | A Python package to query digital libraries (details). |
| libquery_extensions | A Python package that extends libquery (details). |
| libprocess | A Python package to process query results from libquery (details). |
| libprocess_extensions | A Python package to process query results from libquery_extensions (details). |
| image classification labeler | An interactive data labeling system for image classification. |
| image segmentation labeler | An interactive data labeling system for image segmentation. |
| docs | This documentation website. |
This documentation provides an overview of the project's implementation. Detailed information can be found in each repository.
Stack
The stack of the OldVis project consists of four layers (from bottom to top):
- Data Collection: fetch metadata and images with libquery/libquery_extensions and process metadata with libprocess/libprocess_extensions.
- Data Labeling: label images with the image classification labeler, image segmentation labeler, and image taxonomy labeler systems.
- Dataset: the data curation scripts and the built dataset, as well as the dataset downloader oldvis_dataset.
- Gallery: the metadata and image gallery.
Copyright
Notice
We do not hold the copyright to the images of old visualizations indexed by the OldVis project. We have stored the rights information obtained from the data sources when available. When the rights information requires attribution (e.g., when the image is licensed under CC BY-NC-SA 3.0), you should attribute the original data source instead of us. If you want to redistribute the images, you should check whether they are in the public domain according to the copyright lengths and regulations in their country.