DOI: 10.1098/rsos.251515 ISSN: 2054-5703

Dynamic presentation in 3D modulates face similarity judgements: a human-aligned encoding model approach

Simon M. Hofmann, Anthony Ciston, Abhay Koushik, Felix Klotzsche, Martin Hebart, Klaus-Robert Mueller, Arno Villringer, Nico Scherf, Anna Hilsmann, Vadim Nikulin, Michael Gaebler

Abstract

Face perception dynamically unfolds in three-dimensional space, yet, experimental paradigms predominantly rely on static 2D images, limiting insights into real-world face processing. We conducted a pre-registered study comparing face similarity judgements in static 2D and dynamic 2.5D (here, 3D) conditions using a triplet odd-one-out task in 2605 participants (yielding data from 323 400 unique trials). Behavioural similarity matrices revealed a strong cross-condition correlation (R2D~3D = 0.93, p < 0.001), suggesting perceptual invariance, that is, consistency across modalities. However, human-aligned sparse (VICE) and deep (VGG-Face) encoding models trained to map face stimuli to behavioural judgements uncovered condition-specific weighting of facial geometry: while chin–cheek distance, eye size and nose shape dominated similarity judgements in both conditions, face width–height ratio and upper face length gained more perceptual relevance in 3D. Importantly, the richer information in dynamic stimuli significantly reduced choice variance, indicating lower perceptual demand than in static 2D during similarity judgements. Employing a representational alignment framework, our approach reveals both shared cognitive processing and representational differences between the static 2D and dynamic 2.5D conditions, motivating more naturalistic experimental paradigms that reflect real-world perception. Our open large-scale dataset and encoding models enable further advances in face perception research across biological and computational systems.

More from our Archive