ICASSp 1
ICASSp 1
ABSTRACT faces with high accuracy and speed without employing any
specialized hardware or parallel processing. Given detected
Face recognition systems are designed to handle well-aligned
faces and their landmarks, we present an algorithm to align
images captured under controlled situations. However real-
these faces and use a modified version of a state-of-the-art al-
world images present varying orientations, expressions, and
gorithm to recognize faces. Our algorithm is able to identify
illumination conditions. Traditional face recognition algo-
pictures in almost real-time on a simple PC. Except for deep
rithms perform poorly on such images. In this paper we
learning based schemes, our method outperforms all other
present a method for face recognition adapted to real-world
schemes we are aware of in terms of recognition accuracy.
conditions that can be trained using very few training exam-
The rest of this paper is organized as follows: Section 2
ples and is computationally efficient. Our method consists
briefly reviews prominent state-of-the-art methods for face
of performing a novel alignment process followed by classi-
recognition, Section 3 presents our approach, Section 4 shows
fication using sparse representation techniques. We present
the results of our experiments on datasets for face recognition,
our recognition rates on a difficult dataset that represents
and Section 5 concludes the paper.
real-world faces where we significantly outperform state-of-
the-art methods.
2. PREVIOUS WORK
Index Terms— Face recognition, sparse representation,
alignment, mesh warping, facial landmarks. The initial methods developed for face recognition used in-
dividual features on the faces, such as eyes, mouth or nose
1. INTRODUCTION to perform identification [2]. However such methods did not
lead to good results because of the variability of poses and the
Face recognition is probably one of the most prominent areas low amount of information used.
of research in imaging and has a wide range of real-world From the 90s, new methods that use global features of the
applications including surveillance, access control, identity faces were developed. For example, Turk and Pentland pro-
authentication [1], and photo-management. Face recognition posed EigenFaces [3] that uses Principal Component Analy-
systems either perform face verification, i.e., classify a pair sis (PCA). Other methods like Fisherfaces [4] or Laplacian-
of pictures as belonging to the same individual or not, or per- faces [5] extract features from face images and perform near-
form face identification, i.e., put a label on an unknown face est neighbor identification using Euclidean distance measure.
with respect to some training set. In this paper, we address Baback et al. [6] use a bayesian approach where a probabilis-
the latter problem of face identification. tic similarity measure is used to perform classification.
During the last thirty years automatic face recognition has Wright et al. applied the ideas of sparse coding to face
seen considerable progress. Despite this, face recognition is recognition: they proposed the Sparse Representation based
a very challenging problem when the training examples are Classification (SRC) scheme [7], a dictionary learning based
few and the conditions of capture are unconstrained, resulting approach to recognize faces. This method, which can be seen
in face images varying widely in orientation, expression, and as an improvement over the previous ones, is far more robust
illumination. and is able to handle occlusions and corruption of face im-
In our work, we focus on the difficult problem of recog- ages. The SRC algorithm led to other approaches [8, 9, 10,
nizing faces captured in uncontrolled environments. We im- 11] that use sparsity and improve the robustness in dealing
pose additional constraints on the number of training samples with face alignment and pose variation issues.
and on computational efficiency without needing any special- Following the success of the use of sparsity in face recog-
ized hardware. This rules out deep learning approaches which nition Zhang et al. [12] questioned if sparsity was the key to
are data and computation hungry. the success of the SRC algorithm. They concluded that it is
Our contribution is a face recognition scheme that per- the use of collaborative representation (i.e., using an overcom-
forms automatic face alignment and recognition of detected plete dictionary) and not the sparsity constraint that improves
1483
(a) Reference face 1 (b) Triangle mesh (c) LFW sample (d) Face landmarks (e) Mesh warping (f) Aligned image
Fig. 1: The steps of alignment using our method on LFW sample image of G.W. Bush. Output (f) is cropped and resized after alignment.
4. RESULTS
1484
2. Applying RSC algorithm on the faces detected on the ing samples is very fast since the size of the dictionary reduces
LFWa images. significantly.
3. Applying our modified RSC algorithm, after performing
our alignment step on the LFWa images.
Method Rate
The images from the LFWa dataset are 250 × 250 pixels in
NN 10.6%
size. For the first experiment, we resize them to 50 × 50 with-
SRC [7] 22.3%
out any other modifications. For the second experiment we
ESRC [11] 26.7%
detect faces in the LFWa images and we resize the face re-
PCRC [13] 25.0 ± 1.8%
gion of the images to size 50 × 50. In the third experiment,
SVDL [10] 30.2%
which corresponds to our algorithm, the final aligned images
Ours 33.3 ± 3.4%
are of size 30×30. The results are summarized in the Table 1.
Our time 0.02 s
Exp. 1 Exp. 2 Exp. 3
Recognition rate (%) 19.6 28.8 76.4 Table 3: Recognition rates on the LFWa dataset for the extreme
Time for one image (s) 3.2 3.0 1.6 case of using a single training sample per person.
1485
6. REFERENCES [12] Lei Zhang, Meng Yang, and Xiangchu Feng, “Sparse
representation or collaborative representation: Which
[1] Michel Owayjan, Amer Dergham, Gerges Haber, Nidal helps face recognition?,” in 2011 International Confer-
Fakih, Ahmad Hamoush, and Elie Abdo, “Face recog- ence on Computer Vision. IEEE, 2011, pp. 471–478.
nition security system,” in New Trends in Network-
ing, Computing, E-learning, Systems Sciences, and En- [13] Pengfei Zhu, Lei Zhang, Qinghua Hu, and Simon C. K.
gineering, pp. 343–348. Springer, 2015. Shiu, “Multi-scale patch based collaborative represen-
tation for face recognition with margin distribution op-
[2] Woodrow Wilson Bledsoe, “Man-machine facial recog- timization,” ECCV, 2012.
nition,” Panoramic Research Inc., 1966. [14] Volker Blanz and Thomas Vetter, “Face Recognition
[3] Matthew Turk and Alex Pentland, “Eigenfaces for Based on Fitting a 3D Morphable Model,” IEEE Trans-
Recognition,” Journal of Cognitive Neuroscience, vol. actions on Pattern Analysis and Machine Intelligence,
3, no. 1, 1991. vol. 25, no. 9, pp. 1063–1074, September 2003.
[15] Xiangyu Zhu, Zhen Lei, Junjie Yan, Dong Yi, and
[4] Peter N. Belhumeur, João P. Hespanha, and David J. Stan Z Li, “High-fidelity pose and expression normal-
Kriegman, “Eigenfaces vs. Fisherfaces: Recognition ization for face recognition in the wild,” in Proceedings
Using Class Specific Linear Projection,” IEEE Trans- of the IEEE Conference on Computer Vision and Pattern
actions on Pattern Analysis and Machine Intelligence, Recognition, 2015, pp. 787–796.
vol. 19, no. 7, pp. 711–720, 1997.
[16] Florian Schroff, Dmitry Kalenichenko, and James
[5] Xiaofei He, Shuicheng Yan, Yuxiao Hu, Partha Niyogi, Philbin, “FaceNet: A Unified Embedding for Face
and Hong-Jiang Zhang, “Face recognition using lapla- Recognition and Clustering,” 2015.
cianfaces,” IEEE Transactions on Pattern Analysis and
Machine Intelligence, vol. 27, pp. 328–340, 2005. [17] Yi Sun, Ding Liang, Xiaogang Wang, and Xiaoou Tang,
“DeepID3: Face Recognition with Very Deep Neural
[6] Baback Moghaddam, Tony Jebara, and Alex Pentland, Networks,” February 2015.
“Bayesian face recognition,” Pattern Recognition, vol.
[18] Paul Viola and Michael Jones, “Robust real-time object
33, pp. 1771–1782, 2000.
detection,” in IJCV, 2001.
[7] John Wright, Allen Y. Yang, Arvind Ganesh, S. Shankar [19] Vahid Kazemi and Josephine Sullivan, “One Millisec-
Sastry, and Yi Ma, “Robust Face Recognition via Sparse ond Face Alignment with an Ensemble of Regression
Representation,” IEEE Transactions on pattern analysis Trees,” in CVPR. 2014, IEEE Computer Society.
and machine intelligence, vol. 31, no. 2, pp. 210–227,
February 2009. [20] Davis E. King, “Dlib-ml: A Machine Learning Toolkit,”
Journal of Machine Learning Research, vol. 10, pp.
[8] Andrew Wagner, John Wright, Arvind Ganesh, Zhou Zi- 1755–1758, 2009.
han, Hossein Mobahi, and Yi Ma, “Towards a Practical
[21] Gary B. Huang and Vidit Jain, “Unsupervised joint
Face Recognition System: Robust Alignment and Illu-
alignment of complex images,” in ICCV, 2007.
mination by Sarse Representation,” .
[22] A.M Martinez and R. Benavente, “The AR Face
[9] Meng Yang, Jian Yang, and David Zhang, “Robust Database,” Tech. Rep. 24, CVC, June 1998.
Sparse Coding for Face Recognition,” IEEE Conference
on Computer Vision and Pattern Recognition (CVPR), [23] Gary B. Huang, Manu Ramesh, Tamara Berg, and
pp. 625–632, June 2011. Erik Learned-Miller, “Labeled Faces in the Wild:
A Database for Studying Face Recognition in Uncon-
[10] Meng Yang, Luc Van Gool, and Lei Zhang, “Sparse strained Environments,” Tech. Rep. 07-49, University
variation dictionary learning for face recognition with of Massachusetts, Amherst, October 2007.
a single training sample per person,” in The IEEE In-
[24] Gary B. Huang Erik Learned-Miller, “Labeled Faces
ternational Conference on Computer Vision, December
in the Wild: Updates and New Reporting Proce-
2013.
dures,” Tech. Rep. UM-CS-2014-003, University of
[11] Weihong Deng, Jiani Hu, and Jun Guo, “Extended SRC: Massachusetts, Amherst, May 2014.
Undersampled Face Recognition via Intraclass Variant [25] Lior Wolf, Tal Hassner, and Yaniv Taigman, “Similarity
Dictionary,” IEEE Transactions on Pattern Analysis and scores based on background samples,” Asian Confer-
Machine Intelligence, vol. 34, no. 9, pp. 1864–1870, ence on Computer Vision, September 2009.
September 2012.
1486