A new gesture-based browsing experience for art historical research:

paper, specified "short paper"
Authorship
  1. 1. Valentine Bernasconi

    Digital Visual Studies, University of Zurich, Switzerland

Work text
This plain text was ingested for the purpose of full-text search, not to preserve original formatting or readability. For the most complete copy, refer to the original conference program.


Introduction
Thanks to the increase of digital artworks collections and the latest sophistication of computer vision techniques, there is now the possibility to computationaly apprehend large corpuses of paintings and to propose different research perspectives to the field of art history. Recent works on pattern recognition, allowing similar patterns extraction from sets of images, enable to better understand the importance of replicas within the work of an artist (Shen et al., 2019), as well as the circulation of artistic practices in the past (Lenardo, di et al., 2016). Such techniques fostered the creation of innovative browsing tools based on similarity detection from areas of images and dedicated to the practice of art history (Seguin, 2018). The specific task of automated body pose estimation also allows new methodologies for the comparison of gestures in corpuses of paintings and drawings (Impett, 2020). Applied more specifically to the hand, automated gesture retrieval can ease the task of the art historian to analyze an important number of hand gestures and their possible underlying meaning in relation to their context (Bell et al., 2013). Thus, hands and gestures can be characterised geometrically over large sets of paintings
and considered through similarities or specific movements, enlighting their complex role in the pictorial practice(Impett and Süsstrunk, 2016). However, in spite of such developments, most navigation systems publicly available require some previous knowledge of the data and are still based on textual research

For examples see
and
https://www.europeana.eu/fr/collections

. Furthermore, other browsing approaches that rely on formal qualities

See projects proposed on the open platform

or propose new interactive systems based on gestures recognition

See the ArtLens Wall from the Cleveland Museum of Art

(Derry et al., 2021) rather aim at fostering the curiosity of a non-expert audience in a context of cultural heritage preservation, than promoting significant discoveries for researchers. It is based on this need for an innovative browsing tool and to better apprehend a corpus of painted hands for a research project on computational and historical analysis of hands and gestures in Early Modern time that the Gestures for Artwork Browsing (GAB) project was developed.

A gesture-based tool

Figure 1: Results page of the GAB application with details for each hand

The objectives of the main research are to better understand hand poses, their signification and implications in possible gestures from a dataset of Italian paintings from the 14th to the 18th Century. Based on a subset of the digitized Fototeca from the Bibliotheca Hertziana

, a collection of 5’993 hands was created by using the
OpenPose

See

model (Cao et al., 2021). The results of the model are a set of coordinates from keypoints describing the pose of each body detected on an image. From this information, the hands were automatically cropped. The browsing tool consists of recording a hand gesture performed by the user in front of a webcamera for five seconds. The keypoints of the hand detected
The library mediapipe was used for the real time extraction of keypoints, see
(Zhang et al., 2020)
at each frame are then analysed in order to retrieve, via a K-NN algorithm, similar hand poses from the collection. Finally, a .gif animation is created, reproducing the hand gesture of the user with painted hands from the Early Modern time. Apart from the animation, the user also has the possibility to see the detail of each hand used and the painting it belongs to with corresponding metadata for a better contextualisation and further investigation.

The research scope
The idea to translate a gesture to an animation has several research scopes, one of which is the possibility to show different representations of similar hand poses. It proposes a new context, different from the one of a single painting where still hands summarise a longer action, and allows researchers to envision these hands under a new perspective. Even though the fundamental meaning of a gesture greatly relies on the iconographic context and the character performing it (Burke, 1992; Dimova, 2020), the idea here is to understand and question the significance of the depicted moments in order to outline specific codes and conventions of representation from the time they were painted. Furthermore, the fact that a hand gesture can or cannot be translated with painted hands not only provides information on the practices of the studied period, but also on the divergences from contemporary ones. Overall, there is a great potential to learn from the data through the browsing process and to develop an understanding of the kind of poses that belong to the narrative spectrum of Early Modern art. However, it is important to acknowledge that the collection used is a subset of paintings from the Italian Early Modern period and results have to be apprehended with this curatorial aspect in mind. Any outcome cannot be considered as a ground truth as it is the result of multiple selections that occurred throughout the shaping and digitization process of the collection. In addition to the curation, the performance of the
OpenPose model primarily used for hands detection impacts on the results proposed, with an estimated accuracy of 32%. Many hands are thereby missing, whereas others, in spite of manual corrections, have misplaced keypoints, resulting in different pose interpretations from the machine than what they actually represent.

Bibliography

Bell, P., Schlecht, J. and Ommer, B. (2013). Nonverbal Communication in Medieval Illustrations Revisited by Computer Vision and Art History.
Visual Resources Journal, Special Issue on Digital Art History,
29. Taylor & Francis: 26–37.

Burke, P. (1992). The language of gesture in early modern Italy.
A Cultural History of Gesture. Cornell University Press, p. 14.

Cao, Z., Hidalgo, G., Simon, T., Wei, S.-E. and Sheikh, Y. (2021). OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields.
IEEE Transactions on Pattern Analysis and Machine Intelligence,
43(1): 172–86 doi:10.1109/TPAMI.2019.2929257.

Derry, L., Duhaime, D., Kruguer, J., Rodighiero, D., Schnapp, J. and Pietsch, C. (2021).
Surprise Machines. (Information+ Conference). Online https://vimeo.com/595473865 (accessed 4 February 2022).

Dimova, T. (2020).
Le Langage Des Mains Dans L’art: Histoire, Significations Et Usages Des Chirogrammes Picturaux Aux XVIIe Et XVIIIe Siecles. 1er édition. Brepols Publishers.

Impett, L. (2020). Analyzing Gesture in Digital Art History.
The Routledge Companion to Digital Humanities and Art History. Routledge.

Impett, L. and Süsstrunk, S. (2016). Pose and Pathosformel in Aby Warburg’s Bilderatlas.
Undefined /paper/Pose-and-Pathosformel-in-Aby-Warburg%27s-Bilderatlas-Impett-S%C3%BCsstrunk/f3a34525fa7021322f132c80c9517f240cf1e742 (accessed 3 June 2021).

Lenardo, I. di, Seguin, B. L. A. and Kaplan, F. (eds). (2016).
Visual Patterns Discovery in Large Databases of Paintings.

Seguin, B. (2018). The Replica Project: Building a visual search engine for art historians.
XRDS: Crossroads, The ACM Magazine for Students,
24(3): 24–29 doi:10.1145/3186653.

Shen, X., Efros, A. A. and Aubry, M. (2019). Discovering Visual Patterns in Art Collections With Spatially-Consistent Feature Learning.
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach, CA, USA: IEEE, pp. 9270–79 doi:10.1109/CVPR.2019.00950. https://ieeexplore.ieee.org/document/8954148/ (accessed 18 October 2021).

Zhang, F., Bazarevsky, V., Vakunov, A., Tkachenka, A., Sung, G., Chang, C.-L. and Grundmann, M. (2020). MediaPipe Hands: On-device Real-time Hand Tracking.
ArXiv:2006.10214 [Cs] http://arxiv.org/abs/2006.10214 (accessed 21 December 2021).

If this content appears in violation of your intellectual property rights, or you see errors or omissions, please reach out to Scott B. Weingart to discuss removing or amending the materials.

Conference Info

In review

ADHO - 2022
"Responding to Asian Diversity"

Tokyo, Japan

July 25, 2022 - July 29, 2022

361 works by 945 authors indexed

Held in Tokyo and remote (hybrid) on account of COVID-19

Conference website: https://dh2022.adho.org/

Contributors: Scott B. Weingart, James Cummings

Series: ADHO (16)

Organizers: ADHO