Multimodal Biometrics Recognition from Facial Video with Missing Modalities Using Deep Learning

Sayan Maity; Mohamed Abdel-Mottaleb; Shihab S. Asfour

doi:10.3745/JIPS.02.0129

Back

Multimodal Biometrics Recognition from Facial Video with Missing Modalities Using Deep Learning

Journal article

Open access

Multimodal Biometrics Recognition from Facial Video with Missing Modalities Using Deep Learning

Sayan Maity, Mohamed Abdel-Mottaleb and Shihab S. Asfour

JIPS(Journal of Information Processing Systems), 16(1), pp.6-29

2020-02

DOI: https://doi.org/10.3745/JIPS.02.0129

Abstract

컴퓨터학

Biometrics identification using multiple modalities has attracted the attention of many researchers as it producesmore robust and trustworthy results than single modality biometrics. In this paper, we present a novel multimodalrecognition system that trains a deep learning network to automatically learn features after extracting multiplebiometric modalities from a single data source, i.e., facial video clips. Utilizing different modalities, i.e., leftear, left profile face, frontal face, right profile face, and right ear, present in the facial video clips, we trainsupervised denoising auto-encoders to automatically extract robust and non-redundant features. The automaticallylearned features are then used to train modality specific sparse classifiers to perform the multimodalrecognition. Moreover, the proposed technique has proven robust when some of the above modalities weremissing during the testing. The proposed system has three main components that are responsible for detection,which consists of modality specific detectors to automatically detect images of different modalities present infacial video clips; feature selection, which uses supervised denoising sparse auto-encoders network to capturediscriminative representations that are robust to the illumination and pose variations; and classification, whichconsists of a set of modality specific sparse representation classifiers for unimodal recognition, followed byscore level fusion of the recognition results of the available modalities. Experiments conducted on theconstrained facial video dataset (WVU) and the unconstrained facial video dataset (HONDA/UCSD), resultedin a 99.17% and 97.14% Rank-1 recognition rates, respectively. The multimodal recognition accuracydemonstrates the superiority and robustness of the proposed approach irrespective of the illumination, nonplanarmovement, and pose variations present in the video clips even in the situation of missing modalities. KCI Citation Count: 0

Files and links (1)

url

https://doi.org/10.3745/JIPS.02.0129View

Published (Version of record) Open

Metrics

12 Record Views

14 Times Cited - Web of Science

UN Sustainable Development Goals (SDGs)

This output has contributed to the advancement of the following goals:

Source: InCites

Details

Title: Multimodal Biometrics Recognition from Facial Video with Missing Modalities Using Deep Learning
Creators: Sayan Maity - (University of Miami)
Mohamed Abdel-Mottaleb - (University of Miami)
Shihab S. Asfour - (University of Miami)
Publication Details: JIPS(Journal of Information Processing Systems), 16(1), pp.6-29
Publisher: 한국정보처리학회
Academic Unit: Leadership Department; College of Engineering; CoE - Electrical & Computer Engineering
Language: English
Resource Type: Journal article
Record Identifier: 991031576022402976