Historical Printed Ornaments: Dataset and Tasks

ICDAR 2024

Sayan K Chaki Sonat Baltaci Elliot Vincent Rémi Emonet Fabienne Vial-Bonacci Christelle Bahier-Porte Mathieu Aubry Thierry Fournel
Dataset & Code ROIi Project ArXiv Bibtex Poster

Abstract

This paper aims to develop the study of historical printed ornaments with modern unsupervised computer vision. We highlight three complex tasks that are of critical interest to book historians: clustering, element discovery, and unsupervised change localization. For each of these tasks, we introduce an evaluation benchmark, and we adapt and evaluate state-of-the-art models. Our Rey's Ornaments dataset is designed to be a representative example of a set of ornaments historians would be interested in. It focuses on an XVIIIth century bookseller, Marc-Michel Rey, providing a consistent set of ornaments with a wide diversity and representative challenges. Our results highlight the limitations of state-of-the-art models when faced with real data and show simple baselines such as k-means or congealing can outperform more sophisticated approaches.

Dataset and Tasks

Our Rey's Ornaments dataset is composed of three parts, giving insights into our three tasks:

Qualitative Results

Block Ornaments and Clustering

Composite Ornaments and Element Discovery

Vignettes and Unsupervised Change Localization

Project Links

Bibtex

@inproceedings{chaki2024historical,
title={Historical Printed Ornaments: Dataset and Tasks},
author={Chaki, Sayan and Baltaci, Sonat and Vincent, Elliot and Emonet, Rémi and Vial-Bonacci, Fabienne and Bahier-Porte, Christelle and Aubry, Mathieu and Fournel, Thierry},
booktitle={ICDAR},
year={2024}}

Acknowledgement

This work was funded by ANR ROIi project ANR-20-CE38-0005. S. Baltaci, E. Vincent, and M. Aubry were supported by ERC project DISCOVER funded by the European Union’s Horizon Europe Research and Innovation program under grant agreement No. 101076028 and ANR VHS project ANR-21-CE38-0008. We thank Silya Ounoughi, Thomas Gautrais, and Vincent Ventresque for their work in the collection and annotation of the datasets, and Ségolène Albouy, Raphaël Baena, Syrine Kalleli, Ioannis Siglidis, Gurjeet Singh, Andrea Morales and Malamatenia Vlachou for valuable feedbacks.

References

[1] Monnier, T., Groueix, T., Aubry, M.: Deep Transformation-Invariant Clustering. In: NeurIPS (Oct 2020)
[2] MacQueen, J., et al.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability. vol. 1, pp. 281–297. Oakland, CA, USA (1967)
[3] Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al.: Learning transferable visual models from natural language supervision. In: International conference on machine learning. pp. 8748–8763. PMLR (2021)
[4] Cox, M., Sridharan, S., Lucey, S., Cohn, J.: Least squares congealing for unsu- pervised alignment of images. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition. pp. 1–8 (2008).
[5] Kingma, D.P., Welling, M.: Auto-encoding variational bayes. In: 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings (2014)