Cyclic style generative adversarial network for near infrared and visible light face recognition

Fangzheng Huang,Xikai Tang,Chao Li,D. Ban

Published 2023 in Applied Soft Computing

ABSTRACT

Face recognition in the visible light (VIS) spectrum has been widely utilized in many practical applications. With the development of the deep learning method, the recognition accuracy and speed have already reached an excellent level, where face recognition can be applied in various circumstances. However, in some extreme situations, there are still problems that face recognition cannot guarantee performance. One of the most signifcant cases is under poor illumination. Lacking light sources, images cannot show the true identities of detected people. To address such a problem, the near infrared (NIR) spectrum ofers an alternative solution to face recognition in which face images can be captured clearly. Studies have been made in recent years, and current near infrared and visible light (NIR-VIS) face recognition methods have achieved great performance. In this thesis, I review current NIR-VIS face recognition methods and public NIR-VIS face datasets. I frst list public NIR-VIS face datasets that are used in most research. For each dataset, I represent their characteristics, including the number of subjects, collection environment, resolution of images, and whether paired or not. Also, I conclude evaluation protocols for each dataset, helping with further analyzing of performances. Then, I classify current NIR-VIS face recognition methods into three categories, image synthesis-based methods, subspace learning-based methods, and invariant feature-based methods. The contribution of each method is concisely explained. Additionally, I make comparisons between current NIR-VIS face recognition methods and propose my own opinion on the advantages and disadvantages of these methods. To improve the shortcomings of current methods, this thesis proposes a new model, Cyclic Style Generative Adversarial Network (CS-GAN), which is a combination of image synthesis-based method and subspace learning-based method. The proposed CS-GAN improves the visualization results of image synthesis between the NIR domain and VIS domain as well as recognition accuracy. The CS-GAN is based on the Style-GAN 3 network which was proposed in 2021. In the proposed model, there are two generators from pre-trained Style-GAN 3 which generate images in the NIR domain and VIS domain, respectively. The generators consist of a mapping network and synthesis network, where the mapping network disentangles the latent code for reducing correlation between features, and the synthesis network synthesizes face images through progressive growing training. The generators have diferent fnal layers, a to-RGB layer for the VIS domain and a tograyscale layer for the NIR domain. Generators are embedded in a cyclic structure, in which latent codes are sent into the synthesis network in the other generator for recreated images, and recreated images are compared with real images which in the same domain to ensure domain consistency. Besides, I apply the proposed cyclic subspace learning. The

PUBLICATION RECORD

Publication year
2023
Venue
Applied Soft Computing
Publication date
2023-11-01
Fields of study
Computer Science, Engineering
Identifiers
DOI 10.2139/ssrn.4387135
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Real Time Fatigue Detection Using Shape Predictor 68 Face Landmarks Algorithm
2022cited by this paper
Orthogonal Modality Disentanglement and Representation Alignment Network for NIR-VIS Face Recognition
2022cited by this paper
Joint Feature Distribution Alignment Learning for NIR-VIS and VIS-VIS Face Recognition
2021cited by this paper
Linear Cross-Modal Hash Encoding (LCMHE) for Visual and Near-Infrared Face Recognition
2021cited by this paper
Dual Face Alignment Learning Network for NIR-VIS Face Recognition
2021cited by this paper
Towards NIR-VIS Masked Face Recognition
2021influential reference
Alias-Free Generative Adversarial Networks
2021cited by this paper
Face Recognition in the Dark: A Unified Approach for NIR- VIS and VIS- NIR Face Matching
2020cited by this paper
Non-Visual to Visual Translation for Cross-Domain Face Recognition
2020influential reference
Re-Ranking High-Dimensional Deep Local Representation for NIR-VIS Face Recognition
2019cited by this paper
Analyzing and Improving the Image Quality of StyleGAN
2019cited by this paper
Adversarial Cross-Spectral Face Completion for NIR-VIS Face Recognition
2019influential reference
Dictionary Alignment With Re-Ranking for Low-Resolution NIR-VIS Face Recognition
2019cited by this paper
Facial feature embedded CycleGAN for VIS–NIR translation
2019cited by this paper
Cross-Spectral Face Hallucination via Disentangling Independent Factors
2019cited by this paper
Asymmetric Cyclegan for Unpaired NIR-to-RGB Face Image Translation
2019influential reference
Image-Image Translation to Enhance Near Infrared Face Recognition
2019cited by this paper
A Style-Based Generator Architecture for Generative Adversarial Networks
2018influential reference
ArcFace: Additive Angular Margin Loss for Deep Face Recognition
2018cited by this paper
GENERATIVE ADVERSARIAL NETS
2018cited by this paper
FPGA accelerates deep residual learning for image recognition
2017cited by this paper
Wasserstein Generative Adversarial Networks
2017cited by this paper
Progressive Growing of GANs for Improved Quality, Stability, and Variation
2017cited by this paper
Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization
2017influential reference
Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition
2017influential reference
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
2017cited by this paper
Adversarial Discriminative Heterogeneous Face Recognition
2017influential reference
Deep Heterogeneous Face Recognition Networks Based on Cross-Modal Distillation and an Equitable Distance Metric
2017influential reference
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
2016cited by this paper
Image-to-Image Translation with Conditional Adversarial Networks
2016cited by this paper
Heterogeneous Face Recognition with CNNs
2016influential reference
Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding
2016influential reference
Seeing the Forest from the Trees: A Holistic Approach to Near-Infrared Heterogeneous Face Recognition
2016cited by this paper
Transferring deep representation for NIR-VIS heterogeneous face recognition
2016influential reference
Deep Residual Learning for Image Recognition
2015cited by this paper
NIR-VIS heterogeneous face recognition via cross-spectral joint dictionary learning and reconstruction
2015cited by this paper
A Light CNN for Deep Face Representation With Noisy Labels
2015influential reference
Learning Compact Binary Face Descriptor for Face Recognition
2015influential reference
Very deep convolutional neural network based image classification using small training sample size
2015influential reference
Adam: A Method for Stochastic Optimization
2014influential reference
Shared representation learning for heterogenous face recognition
2014cited by this paper
Going deeper with convolutions
2014cited by this paper
On Effectiveness of Histogram of Oriented Gradient Features for Visible to Near Infrared Face Matching
2014cited by this paper
Optimal UV spaces for facial morphable model construction
2014cited by this paper
Matching NIR face to VIS face using multi-feature based MSDA
2014cited by this paper
Matching NIR Face to VIS Face Using Transduction
2014influential reference
Learning Face Representation from Scratch
2014influential reference
Deep Learning Face Attributes in the Wild
2014cited by this paper
The CASIA NIR-VIS 2.0 Face Database
2013cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Transductive VIS-NIR face matching
2012cited by this paper
Computer Vision: Models, Learning, and Inference
2012cited by this paper
Heterogeneous Face Recognition: Matching NIR to Visible Light Images
2010cited by this paper
Weight-Based Facial Expression Recognition from Near-Infrared Video Sequences
2009influential reference
Learning mappings for face synthesis from near infrared to visual light images
2009cited by this paper
Heterogeneous Face Recognition from Local Structures of Normalized Appearance
2009cited by this paper
A 3D Face Model for Pose and Illumination Invariant Face Recognition
2009cited by this paper
Fusion of Multi-directional Rotation Invariant Uniform LBP Features for Face Recognition
2009influential reference
Multi-view facial expression recognition
2008cited by this paper
Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments
2008influential reference
Face Detection Based on Multi-Block LBP Representation
2007cited by this paper
Face Matching Between Near Infrared and Visible Light Images
2007cited by this paper
Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions
2007cited by this paper
Integrating structured biological data by Kernel Maximum Mean Discrepancy
2006influential reference
K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation
2005cited by this paper
Distinctive Image Features from Scale-Invariant Keypoints
2004influential reference
Categorical Data Analysis
2003cited by this paper
Web image retrieval re-ranking with relevance model
2003cited by this paper
Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns
2002influential reference
Nonlinear dimensionality reduction by locally linear embedding.
2000cited by this paper
A decision-theoretic generalization of on-line learning and an application to boosting
1997cited by this paper
Introduction to statistical pattern recognition (2nd ed.)
1990influential reference
A Theory for Multiresolution Signal Decomposition: The Wavelet Representation
1989cited by this paper
Long-term potentiation and memory.
1987cited by this paper
Parallel Distributed Processing: Explorations in the Microstructure of Cognition : Psychological and Biological Models
1986influential reference
Analysis of a complex of statistical variables into principal components.
1933cited by this paper

CITED BY

RGB-to-NIR Facial Image Translation via Lightweight Generative Adversarial Network
2026cites this paper
A survey on AI-empowered task-oriented sensing, communication, and computation in 6G networks
2026cites this paper
Research on Low-Light Image Enhancement Algorithm Based on Generative Adversarial Networks
2025cites this paper
Face Protection Based on Optimizing Latent Code Space With Multiple Loss Function Constraints
2025cites this paper
LW-DCGAN: a lightweight deep convolutional generative adversarial network for enhancing occluded face recognition
2024cites this paper
Human Emotion Recognition with an Advanced Vision Transformer Model
year unknowncites this paper