Enlighten Research Data

In this section

A comprehensive multimodal dataset for contactless lip reading and acoustic analysis

Abbasi, Q. , Ge, Y., Chong, T., Haobo, L., Zikang, C., Wang, J., Wenda, L., Cooper, J. , Chetty, K., Faccio, D. and Imran, M. (2023) A comprehensive multimodal dataset for contactless lip reading and acoustic analysis. [Data Collection]

Datacite DOI: 10.6084/m9.figshare.c.6942846.v1

Collection description

Nowadays, non-privacy small-scale motion detection has attracted an increasing number of researches of remote sensing in speech recognition. These new modalities target to enhance and restore the speech information from speakers from multiple types of data. In this paper, we propose a dataset contains 7.5 GHz Channel Impulse Response (CIR) data from Ultra-Wideband (UWB) radar, 77 GHz frequency modulated continuous wave (FMCW) data from millimeter-wave (mmWave) radar and laser data. Meanwhile, a depth camera is adopted to record the subjects’ landmarks of lip and voice. Approximately 6 hours of annotated speech profiles are provided, which are collected from 20 participants speaking 5 vowels, 15 words and 16 sentences. The dataset has been validated for and is potential for the researches of lip reading and multi-modalities speech recognition.

Funding:

Engineering and Physical Sciences Research Council (EPSRC) [EP/T021020/1]
Engineering and Physical Sciences Research Council (EPSRC) [EP/T021063/1]

College / School:

College of Science and Engineering > School of Engineering

Date Deposited:

09 Nov 2023 11:52

Statement on legal, ethical and access issues:

This is open data and is available under Creative Commons Licence (CCBY)

URI:

https://researchdata.gla.ac.uk/id/eprint/1408

Additional details

Available Files

There are no files for this dataset available to download.

Repository Staff Only: Update this record

Cite this record

Abbasi, Q. , Ge, Y., Chong, T., Haobo, L., Zikang, C., Wang, J., Wenda, L., Cooper, J. , Chetty, K., Faccio, D. and Imran, M. (2023); A comprehensive multimodal dataset for contactless lip reading and acoustic analysis

FigShare

DOI: 10.6084/m9.figshare.c.6942846.v1

Retrieved: 2024-10-22

Altmetric

Enlighten Research Data

A comprehensive multimodal dataset for contactless lip reading and acoustic analysis

Collection description

Available Files

Library