You are in:Home/Publications/Lamiaa A. Elrefaei, Tahani Q. Alhassan, Shefaa S. Omar, An Arabic Visual Dataset for Visual Speech Recognition, Procedia Computer Science, Volume 163, 2019, Pages 400-409, ISSN 1877-0509, https://doi.org/10.1016/j.procs.2019.12.122.

Prof. lamiaa Elrefaei :: Publications:

Title:
Lamiaa A. Elrefaei, Tahani Q. Alhassan, Shefaa S. Omar, An Arabic Visual Dataset for Visual Speech Recognition, Procedia Computer Science, Volume 163, 2019, Pages 400-409, ISSN 1877-0509, https://doi.org/10.1016/j.procs.2019.12.122.
Authors: Not Available
Year: 2020
Keywords: Not Available
Journal: Not Available
Volume: Not Available
Issue: Not Available
Pages: Not Available
Publisher: Not Available
Local/International: International
Paper Link:
Full paper Not Available
Supplementary materials Not Available
Abstract:

Visual speech recognition (VSR) has received increasing attention in recent decades due to its potential uses in many applications. As for any recognition system, useful materials for training and testing are required. For VSR system development, the training and testing materials are videos representing the visual speech of the words. This paper presents the Arabic Visual Speech Dataset (AVSD) for visual speech recognition. The dataset contains 1100 videos for 10 daily communication words collected from 22 speakers and recorded using smartphones’ cameras in high-resolution and high-framerate. The process of building the dataset, including design, acquisition, post-processing phases are described in the paper. Finally, the results of evaluating AVSD using a VSR system are presented and discussed.

Google ScholarAcdemia.eduResearch GateLinkedinFacebookTwitterGoogle PlusYoutubeWordpressInstagramMendeleyZoteroEvernoteORCIDScopus