Can you feel what I am saying? Speech-based vibrotactile stimulation enhances the cortical tracking of attended speech in a multi-talker background
I. Sabina Răutu, Mathieu Bourguignon, Vincent Wens, Veikko Jousmäki, Julie Bertels, Xavier De TiègeAbstract
In environments with multiple talkers, humans can ‘tune in’ to a speaker of interest while ignoring competing voices. In such conditions, however, auditory cortices track the attended speech envelope rhythms (cortical tracking of speech, CTS) less accurately than in quiet, hindering intelligibility. Visual speech cues (e.g., lip movements) can enhance this CTS, but it remains unclear whether other non-auditory sensory cues, such as tactile input, provide comparable benefits through similar neural mechanisms. Here, using magnetoencephalography, we quantified CTS as the coherence between the speech temporal envelope of the attended speaker and brain responses in supratemporal auditory areas at syllabic (4–8 Hz), word (1–4 Hz) and phrasal/sentential (<1 Hz) frequencies. Participants listened to connected speech presented alone, together with synchronous or asynchronous speech-based vibrations, or with the corresponding speaker video, in both quiet and a multi-talker background. We hypothesized that, in the presence of competing background speakers, speech-based vibrotactile stimulation improves comprehension by enhancing CTS of the attended speaker and modulating auditory-seeded functional brain connectivity with extra-auditory neocortical brain areas. Results revealed that synchronous vibrotactile stimulation improved comprehension in the multi-talker background and increased syllabic CTS at right auditory cortex, with this CTS increase magnitude correlating with comprehension performance. Audio-tactile CTS enhancement was accompanied by stronger beta-band auditory cortex connectivity with ipsilateral angular and ventral inferior temporal gyri, alongside reduced alpha-band coupling with the precuneus. These findings suggest that vibrotactile input can support speech-in-noise processing by impacting both local auditory cortical activity and auditory-seeded long-range functional connectivity.