0% found this document useful (0 votes)

3 views

Clustering Face Images with Application to Image Retrieval

Uploaded by

beoverall

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Clustering Face Images with Application to Image Retrieval

Uploaded by

beoverall

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Clustering Face Images with Application to Image Retrieval

in Large Databases
Florent Perronnin and Jean-Luc Dugelay
Institut Eurécom
Multimedia Communications Department
2229 route des Crêtes, BP 193
06904 Sophia-Antipolis Cédex, FRANCE

ABSTRACT
In this article, we evaluate the effectiveness of a pre-classification scheme for the fast retrieval of faces in a
large image database. The studied approach is based on a partitioning of the face space through a clustering
of face images. Mainly two issues are discussed. How to perform clustering with a non-trivial probabilistic
measure of similarity between faces? How to assign face images to all clusters probabilistically to form a robust
characterization vector? It is shown experimentally on the FERET face database that, with this simple approach,
the cost of a search can be reduced by a factor 6 or 7 with no significant degradation of the performance.
Keywords: Biometrics, Face Recognition, Indexing, Clustering.

1. INTRODUCTION
Defining a meaningful measure of similarity between face images for the problem of automatic person identifica-
tion and verification is a very challenging issue. Indeed, faces of different persons share global shape character-
istics, while face images of the same person are subject to considerable variability, which might overwhelm the
inter-person differences. Such variability is due to a long list of factors including facial expressions, illumination
conditions, pose, presence or absence of eyeglasses and facial hair, occlusion and aging. A measure of similarity
between face images should therefore be rich enough to accommodate for all these possible variabilities. Although
using a more complex measure may improve the performance, it will also generally increase the computational
cost. Hence, it is difficult to design a measure which is both accurate and computationally efficient. However,
both properties are required to tackle the very challenging task of automatic retrieval of face images in large
databases. Mainly, two techniques based on the notion of coarse classification have been suggested to reduce the
number of comparisons when searching a database.
The first approach makes use of two (or even more) complementary measures of distance and cascades them.
The first distance, which has a low accuracy but requires little computation, is run on the whole dataset and the
N -best candidates are retained. The second distance, which has a high accuracy but requires more computation,
is then applied on this subset of images. Such an approach has already been applied for instance to the problem
of multimodal biometrics person authentication.1
The second approach consists in partitioning the image space, e.g. by clustering the dataset. When a new
target image is added to the database, one computes the distance between this image and all clusters and the
image is associated to its nearest cluster. When a query image is probed, the first step consists in determining
the nearest cluster and the second step involves the computation of the distances between the query image and
the target images assigned to the corresponding cluster. It is interesting to notice that the pre-classification
of images is an issue which has received very little attention from the face recognition community. For other
biometrics, such as fingerprints, this has been a very active research topic. 2
The quality of a pre-classification scheme can be measured through the penetration rate and the binning error
rate.3 The penetration rate can be defined as the expected proportion of the template data to be searched under
Send correspondence to Professor Jean-Luc Dugelay: E-mail [email protected], Telephone: +33
(0)4.93.00.26.41, Fax: +33 (0)4.93.00.26.27
the rule that the search proceeds through the entire partition, regardless of whether a match is found. A binning
error occurs if the template and a subsequent sample from the same user are placed in different partitions and
the binning error rate is the expected number of such errors. Both target and query images can be assigned to
more than one cluster. Indeed if face images of a given person are close to the “boundary” between two or more
clusters, as large variabilities may not be fully handled by the distance measure, different images of the same
person may be assigned to different clusters as depicted on Figure 1. To solve this problem, target and query
images can be assigned to their K nearest clusters or to all the clusters whose distance falls below a predefined
threshold. Obviously, the decrease of the binning error rate is obtained at the expense of an increase in the
penetration rate.

? ?

Figure 1. Uncertainty in cluster assignment.

The main contribution of this paper is to evaluate the reduction of the amount of computation which can
be achieved when searching a face database using a pre-classification strategy based on clustering. In this work,
the measure of similarity between face images that is considered is the Probabilistic Mapping with Local Trans-
formations (PMLT) introduced in.4 This approach consists in estimating the set R of possible transformations
between face images of the same person. The global transformation is approximated with a set of local trans-
formations under the constraint that neighboring transformations must be consistent with each other. Local
transformations and neighboring constraints are embedded within the probabilistic framework of a 2-D HMM.
The states of the HMM are the local transformations, emission probabilities model the cost of a local mapping
and transition probabilities the cost of coherence constraints (c.f.4, 5 for more details). The measure of similarity
between a template image It and a query image Iq is P (Iq |It , R), i.e. the likelihood that Iq was generated from
It knowing the set R of possible transformations. This approach was shown to be robust to facial expression,
pose and illumination variations.5 Even if its computational complexity is low enough to perform real-time
verification (which is a one-to-one matching) or even identification (which is a one-to-many matching) for a
target set that does not exceed a few hundred of images on a modern PC, it is still too high for searching large
face databases.
Therefore, we will first have to consider the issue of clustering face images with this non-trivial probabilistic
measure of similarity. Many clustering algorithms, especially those based on a probabilistic framework, can be
directly interpreted as an application of the Expectation-Maximization (EM) algorithm. 6 During the E-step,
the distance between each observation and each cluster centroid is computed and each observation is assigned to
its nearest cluster (or probabilistically to all clusters). During the M-step, the cluster centroid is updated using
the assigned observations. The update step also depends on the chosen distance since the centroid is defined as
the point that minimizes the average distance between the assigned observations and the centroid. When using
simple metrics the update step is greatly simplified. For instance, for the Euclidean distance, the update step
is a simple averaging of the assigned observations. In the case of complex distances, such as PMLT, computing
the centroid is much more challenging.
Then, we will consider the possibility to assign each face image to all clusters probabilistically instead of
assigning face images in a hard manner. A similar approach, referred to as anchor modeling has already been
proposed in the field of automatic speaker detection and indexing.7 Another contribution of this paper is to
improve over the original anchor modeling approach.
The remainder of this paper is organized as follows. In section 2, we describe the face image clustering
procedure. In section 3, we briefly review the anchor modeling approach and propose two improvements. In
section 4, we present experimental results before drawing conclusions in section 5.

2. CLUSTERING FACE IMAGES

As our measure of similarity is probabilistic, it is natural to use a Maximum-Likelihood (ML) framework to
perform clustering.8 In this section, we first briefly present the basic ML approach based of the EM principle.
We then discuss the issue of cluster initialization and propose a fast procedure which is tailored to the problem
of interest.

2.1. EM-based Clustering

Our goal is, given a set of N images {I1 , ..., IN }, to estimate C clusters {C1 , ..., CC }. The measure of distance
between an image In and a cluster is the probability that this image was generated by the cluster centroid knowing
R. Therefore images {I1 , ..., IN } are naturally treated as query images and cluster centroids as templates. In the
following, we will denote by On the “query representation” of In , i.e. the set of feature vectors extracted from In .
O will denote the set of all observations: O = {O1 , ..., ON }. We will denote by λc the “template representation”
of the centroid of cluster Cc . Thus, the measure of distance between In and cluster Cc is P (On |λc , λR ).
We assume that the distribution of the data can be modeled with a mixture of C components, where each
component corresponds to one of the C clusters:

X
C
P (On |λ, λR ) = wc P (On |λc , λR ) (1)
c=1

with λ = {w1 , ..., wC , λ1 , ..., λC }. The mixture weights wc are subject to the following constraint:

X
C
wc = 1 (2)
c=1

We also assume that samples are drawn independently from the previous mixture.

Y
N
P (O|λ) = P (On |λ) (3)
n=1

Our goal is to find the parameters {w1 , ..., wC } and {λ1 , ..., λC } which maximize P (O|λ). This problem cannot
be solved directly and an iterative procedure based on the EM algorithm is generally used. The application of
the EM algorithm to the problem of the estimation of mixture densities is based on the computation (E-step) and
maximization (M-step) of Baum’s auxiliary Q function.6 The hidden variable includes both the state sequence
Q, i.e. the set of local transformations which are “chosen” when measuring the similarity between a image
and a cluster centroid (c.f. section 1), and a variable Θ that indicates the mixture component (i.e. the cluster
assignment). Therefore, the Q function takes the following form:
XX
Q(λ|λ0 ) = P (Q, Θ|O, λ0 ) log P (O, Q, Θ|λ) (4)
Q Θ

where λ0 is the current parameters estimate and λ is the improved set of parameters that we seek to estimate.
If we split log P (O, Q, Θ|λ) into log P (O, Q|Θ, λ) + log P (Θ|λ), the Q function can be written as:

X
C X
N X
C X
N X
Q(λ|λ0 ) = γnc log(wc ) + γnc log P (On , Q|λc , λR ) (5)
c=1 n=1 c=1 n=1 Q
where the probability γnc for image In to be assigned to cluster Cc is given by:

w0 P (On |λ0c , λR )
γnc = P (λ0c |On , λR ) = PC c (6)
0 0
i=1 wi P (On |λi , λR )

To maximize Q(λ|λ0 ), we can maximize independently the two terms. To find the optimal estimate ŵ c of wc ,
we maximize the first term under the constraint (7) and obtain:

1 X c
N
ŵc = γ (7)
N n=1 n

The maximization of the second term does not raise technical difficulties. However, as this issue is not the focus
of this paper, the details are not presented here and the interested reader can refer to. 5

2.2. A Fast Initialization Procedure

While the EM procedure is bound to reach a local optimum, it is by no means guaranteed to reach the global
one. The quality of the optimum which is found depends on several factors, one of which is the initialization
of cluster centroids. Indeed, after preliminary experiments, it was clear that selecting the initial centroids in a
random manner could lead to very different solutions.
A simple procedure we employed to alleviate this problem was to perform as an initialization step a hierarchical
agglomerative clustering.8 The goal is not to obtain the C best possible clusters but to obtain with a fast
procedure reasonable seed centroids that can be subsequently fed to the EM procedure described in the previous
section. The basic idea of agglomerative clustering is to start with N clusters, each cluster containing one image,
and to merge the clusters until the desired number of clusters C is obtained.
Therefore, a distance between clusters needs to be defined. Let {In } be a set of images assigned to Ci . The
likelihood L(Ci ) of Ci is given by: X
L(Ci ) = P (On |λi , λR ) (8)
n:In ∈Ci

As we want a fast initialization procedure, we do not want to have to use the EM procedure to estimate λ i . Thus
we make use of the concept of medoid9 : one chooses the most likely observation among the set of observations
assigned to Ci . Thus, if λIm is the “template representation” of Im , then:
X
λi = arg max P (On |λIm , λR ) (9)
m:Im ∈Ci
n:In ∈Ci

Let us remind that, during the initialization step, the goal is to find the C cluster centroids which maximize
the likelihood of the set of observations. After each merging stage, the likelihood of the set of observations will
decrease. Therefore, our goal is to merge at each step of the agglomerative clustering the two clusters that lead
to the smallest decrease of the likelihood. Hence, the distance between two clusters C i and Cj is defined as the
decrease in likelihood after the merging:

Dlike (Ci , Cj ) = L(Ci ) + L(Cj ) − L(Ci ∪ Cj ) (10)

Note that this is similar to the criterion which is often used by Gaussian merging algorithms. 10 While at each
step we are guaranteed to obtain the smallest decrease in likelihood, we are not guaranteed that the sequence of
steps leads to the global maximum.
However we found experimentally that if we apply directly this procedure, the clusters we obtain may be
highly unbalanced, i.e. some clusters may be assigned a large number of data items while others may contain
only a small number of data items. This is a problem as a cluster centroid cannot be robustly estimated with a
too small number of data items. Hence, we should penalize the previous distance in order to take into account
the balance between clusters. Let ni be the number of data items in cluster Ci and let N be the total number of
data items. We also introduce pi = ni /N . Clearly, the entropy11 :

X
N
H=− pi log(pi ) (11)
i=1

is a measure of balance as, the larger H, the more balanced is the set of clusters. Let H be the entropy for the
set of clusters {C1 , ...CC }. If we merge clusters Ci and Cj , then the delta entropy will be:

∆H(Ci , Cj ) = pi log(pi ) + pj log(pj ) − (pi + pj ) log(pi + pj ) (12)

which is a negative quantity. The closer is this quantity to zero, the smaller the reduction of entropy, and thus
the smaller the reduction of the “balance” in our system.
Hence, we use as a measure of distance between two clusters Ci and Cj :

D(Ci , Cj ) = Dlike (Ci , Cj ) − ρ∆H(Ci , Cj ) (13)

where ρ is a positive parameter that keeps the balance between the two possibly competing criteria: the minimum
likelihood decrease versus the maximum entropy decrease.

3. PROBABILISTIC ASSIGNMENT OF FACE IMAGES

In this section, we first briefly review the anchor modeling approach. We then suggest two significant improve-
ments over the original approach.

3.1. A Brief Review of Anchors for Indexing

A limitation of the multiple cluster assignment paradigm described in the introductory section is that it does not
make the most out of the available information. To make our argument clear, let us assume that the face space
is partitioned as depicted on Figure 2. When the template image It is added to the database, it is likely to be
assigned to clusters C6 , C7 and C8 . At test time, the query image Iq is first assigned to C2 , C8 and C9 and then
compared to all the template images contained in one of these clusters, which includes I t . However, It and Iq are
fairly distant and, thus, unlikely to belong to the same person. Therefore, such a comparison will most likely be
wasteful. The reason why It and Iq were compared while they should not have been is that, when assigning an
image to one or multiple clusters, we throw away a lot of valuable information: the “distances” p(I|C n ). Indeed,
the vector v = [p(I|C1 ), ...p(I|CN )]T could be used to characterize a face image I.

C2
C1
C3

Iq
C9
C7 C8
C4
It
C5
C6

Figure 2. Case where a template image It and a query image Iq are unlikely to belong to the same person but are still
assigned to the same cluster.

Anchor modeling was proposed in the fields of speaker detection and indexing 7 : a speech utterance s is scored
against a set of models {A1 , ..., AN } referred to as anchors and the vector v = [p(s|A1 ), ..., p(s|AN )]T is used to
characterize the speech utterance. This characterization vector can be understood as a projection of the target
image into a speaker space. Let vq be the characterization vectors of Iq . Then at test time, we first compute the
distance between vq and the characterization vectors of all template images contained in the database. Although
there are as many distances to compute as template images, this is very fast as these vectors are low dimensional.
Then Iq is compared with the template images It that are less than a given threshold distant from Iq . Note
that this approach can be seen as a special case of the cascading approach. Indeed, characterization vectors are
simplified representations of face images and thus recognition based purely on these vectors has a low accuracy.
However, they are fairly fast to estimate and very fast to compare. An interesting property of such a cascading
approach is that the characterization vector retains the properties of the costly distance, a property that is not
discussed in.7 Indeed, if the distance is robust to some variations, then the characterization vector should not
be significantly affected by these variations.

3.2. Improving the Original Anchor Approach

We propose in this paper two significant improvements over the original anchor modeling approach:

• As in7 the number of anchor models was large (668 in their experiments), methods for reducing the size
of the Euclidean distance comparison were investigated in an effort to increase performance by using only
those anchor models that provide good characterizing information. However, such an approach does not
reduce the cost of computing v which can also be significant. In the proposed approach, our anchors are
not faces but the centroids which are obtained after clustering a set of face images. The clustering step
should therefore perform a dimension reduction and drastically decrease the cost of computing v and of
comparing it with other vectors.

• Instead of using a characterization vector v based on the likelihood, we propose to use posterior probabilities:
v = [p(C1 |I), ..., p(CN |I)]T . Such a vector should be more robust, especially to a mismatch between training
and test conditions, as it normalizes the likelihood.

4. EXPERIMENTAL RESULTS
In this section, we first describe the experimental setup. We then compare the performance of posterior-based
characterization vectors with likelihood-based vectors. Finally, we evaluate the impact of the reduction of the
number of anchors on the efficiency of the retrieval.

4.1. Experimental setup

Our experiments were carried out on the FERET face database, a standard testbed for face recognition al-
gorithms. The choice of this database was primarily motivated by the high number of individuals it contains
(almost 1,200). 695 persons were chosen randomly for the training and 500 person for the test phase. We use for
all training and test persons 2 images, those that are labeled FA and FB and which correspond to frontal images
with variations in facial expression. The measure of similarity based on PMLT was trained exactly as described
in4 with the 1,390 training images. The estimation of cluster centroids was performed on the same data. At test
time, each of the 1,000 face images was chosen successively as the query and the 999 remaining images were used
as templates. The baseline performance of our system is the identification rate when each query is compared to
all the templates, which is 95.7%. On a 2 GHz Pentium 4 with 1 GB RAM, this set of comparisons takes on the
order of 5 seconds. The goal is now to reach a similar performance but with a number of comparisons which is
significantly smaller than 999.

4.2. Evaluating the Impact of Posterior-based Characterization Vectors

If we want the pre-classification step to be effective, the distance between characterization vectors should be
based on relatively simple metrics. The goal of the first set of experiments is to determine 1) which distance is
the most appropriate to measure the similarity of characterization vectors and 2) whether the characterization
vector based on posteriors is superior to the one based on likelihoods. Thus, in this first set of experiments, we
perform identification with the characterization vectors only. We tested the L 1 (city-block), L2 (Euclidean) and
100

identification rate
70

L1
30
L2
cosine
20
0 20 40 60 80 100
N−best (%)

(a)

100

80
identification rate

40
L1
L2
30
cosine
divergence
20
0 20 40 60 80 100
N−best (%)

(b)

Figure 3. Performance of a system with C = 20 clusters which makes use of (a) log-likelihood-based characterization
vectors (b) posterior-based characterization vectors. Cumulative identification rate versus N-best (as a percentage of the
database).
.
cosine metrics on both types of characterization vectors. As a posterior-based characterization vector defines a
discrete probability distribution, we also tried the symmetric divergence on this type of vectors.
Note that the likelihoods P (On |λc , λR ) are extremely large (on the order of 1010,000 ) and thus they are
difficult to compare directly. Therefore, in the following we did not use likelihood-based characterization vectors
but characterization vectors based on the log-likelihood. In the same manner, P (O n |λc , λR )’s are so large that
the posteriors P (λc |On , λR ) are equal to 1 for the most likely centroid and 0 for the other ones. Thus, to
increase the fuzziness of the assignment, we raised the posteriors to the power of a small positive factor β and
then renormalized them so that they would sum to unity. In the following experiments we set β = 0.01.
Results are presented for C = 20 clusters on Figure 3. On Figure 3 (a), we compare the performance of
the L1 , L2 and cosine metrics for characterization vectors based on the log-likelihood. Clearly, the cosine is
by far the best choice. On Figure 3 (b), we compare the performance of the L1 , L2 , cosine and symmetric
divergence metrics for posterior-based characterization vectors. Results are much improved for the first three
metrics (especially for L1 and L2 ) compared to log-likelihood-based vectors. The four measures of distance
exhibit a similar performance but the symmetric divergence seems to outperform the three other metrics by a
slight margin. Hence, in the following experiments, we will use posterior-based characterization vectors and the
similarity of two such vectors will be measured with the symmetric divergence.

4.3. Evaluating the Impact of a Reduction of the Number of Anchors

Now that we have chosen the type of characterization vector and the metric, we can evaluate the performance
of our system when characterization vectors are used during a pre-classification step to find the most likely
candidates. We present results for various numbers of clusters as the identification rate versus the percentage
of comparisons compared to the baseline case where we perform an exhaustive search (Figure 4). Note that
we have to take into account the comparisons with the C cluster centroids and the comparison with all the
templates that are retained after the pre-classification. While the increase of performance from 5 to 10 clusters
is very significant, especially for a small number of comparisons, it is smaller when going from 10 to 20 clusters.
No improvement could be obtained with more than 20 centroids. This shows that, for the problem of interest,
clustering is very important as only a very small number of clusters is required for an efficient pre-classification.
The best performance we could obtain was a reduction of the computational cost by a factor 6 or 7 with no
significant degradation of the performance compared to an exhaustive search.

100

95
identification rate

80
C=5
C = 10
C = 20
75
0 5 10 15 20
percentage of comparisons

Figure 4. Performance of the system with probabilistic cluster assignment for a varying number C of clusters.
5. CONCLUSION
In this article, we evaluated the effectiveness of a pre-classification scheme for the fast retrieval of faces in a large
database. We studied an approach based on a partitioning of the face space through a clustering of face images.
We discussed mainly two issues. First, we addressed the problem of clustering face images with a non-trivial
measure of similarity. As the chosen measure is probabilistic, we naturally used a ML framework based on the
EM principle. Then, we discussed how to form a characterization vector, which could be used for an efficient
indexing, by concatenating the distances between the considered image and all cluster centroids. While this is
similar to anchor modeling, we suggested to significant improvements over the original approach. Experiments
carried out on the FERET face database showed that, with this simple approach, the cost of a search could be
reduced by a factor 6 or 7 with very little degradation of the performance.
Although the exact figures might vary depending on the specific database or measure of similarity, we believe
that they give a reasonable idea of the speed-up which can be expected with a pre-classification approach. While
this is a very significant cost reduction, it is clear that such a scheme would not be sufficient for databases
which contain millions of faces. For such a challenging case, other approaches would have to be considered in
combination with the studied approach. Especially, the use of multiple hardware units or of exogenous data
(such as the gender or the age)12 would most certainly be necessary.

ACKNOWLEDGMENTS
The authors would like to thank Professor Kenneth Rose from the University of California at Santa Barbara
(UCSB) for drawing their attention to the important clustering issue. The authors would also like to thank
France Telecom Research and Development for partially funding their research activities.

REFERENCES
1. L. Hong and A. Jain, “Integrating faces and fingerprints for person identification,” IEEE Trans. on Pattern
Analysis and Machine Intelligence (PAMI) 20, pp. 1295–1307, Dec 1998.
2. A. Jain and S. Pankanti, Advances in Fingerprint Technology, ch. Automated Fingerprint Identification and
Imaging Systems. CRC Press, 2nd ed., 2001.
3. A. Mansfield and J. Wayman, “Best practices in testing and reporting performance of biometric devices,”
Aug 2002.
4. F. Perronnin, J.-L. Dugelay, and K. Rose, “Deformable face mapping for person identification,” in IEEE
Int. Conf. on Image Processing (ICIP), 1, pp. 661–664, 2003.
5. F. Perronnin, A Probabilistic Model of face Mapping Applied to Person Recognition. PhD thesis, Intitut
Eurécom, 2004.
6. A. Dempster, N. Laird, and D. Rubin, “Maximum likelihood from incomplete data via the EM algorithm,”
Journal of the Royal Statistical Society 39(1), pp. 1–38, 1977.
7. D. Sturim, D. Reynolds, E. Singer, and J. Campbell, “Speaker indexing in large audio databases using
anchor models,” in Proc. of the IEEE Int. Conf. on Acoustics Speech and Signal Processing (ICASSP), 1,
pp. 429–432, 2001.
8. R. Duda, P. Hart, and D. Stork, Pattern classification, John Wiley & Sons, Inc., 2nd ed., 2000.
9. L. Kaufman and P. Rousseeuw, Finding groups in data: an introduction to cluster analysis, ch. Partitioning
around medoids. John Wiley & Sons, 1990.
10. A. Sankar, “Experiments with a Gaussian merging-splitting algorithm for HMM training for speech recogni-
tion,” in Proc. of the 1997 DARPA Broadcast News Transcription and Understanding Workshop, pp. 99–104,
1998.
11. T. Cover and J. Thomas, Elements of Information Theory, John Wiley & Sons, Inc., 1993.
12. A. Jain, S. Pankanti, L. Hong, A. Ross, and J. Wayman, “Biometrics: a grand challenge,” in Proc. of the
IEEE Int. Conf. on Pattern Recognition (ICPR), 2, pp. 935–942, 2004.

Aerodrome Emergency Plan Presentation
100% (1)
Aerodrome Emergency Plan Presentation
22 pages
Contour Detection and Hierarchical Image Segmentation
No ratings yet
Contour Detection and Hierarchical Image Segmentation
19 pages
A Method of Face Recognition Based On Fuzzy C-Means Clustering and Associated Sub-Nns
No ratings yet
A Method of Face Recognition Based On Fuzzy C-Means Clustering and Associated Sub-Nns
11 pages
Report Eigenface 2012
No ratings yet
Report Eigenface 2012
9 pages
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies
No ratings yet
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies
11 pages
Automated System For Denoising Gray-Scale Images Using Image Priors
No ratings yet
Automated System For Denoising Gray-Scale Images Using Image Priors
7 pages
Distributed Face Recognition Via Consensus On SE
No ratings yet
Distributed Face Recognition Via Consensus On SE
14 pages
Face Recog Using Wavelet Transformations PDF
No ratings yet
Face Recog Using Wavelet Transformations PDF
5 pages
Nips Ckiw
No ratings yet
Nips Ckiw
8 pages
Copy Move Forgery Based On DWT-DCT
No ratings yet
Copy Move Forgery Based On DWT-DCT
4 pages
Keypoint Recognition Using Randomized Trees
No ratings yet
Keypoint Recognition Using Randomized Trees
29 pages
Hinterstoisser Iccv11
No ratings yet
Hinterstoisser Iccv11
8 pages
Robust Computation of Mutual Information Using Spatially Adaptive Meshes 1st Edition by Hari Sundar, Dinggang Shen, George Biros, Chenyang Xu, Christos Davatzikos ISBN 9783540757573 download
100% (2)
Robust Computation of Mutual Information Using Spatially Adaptive Meshes 1st Edition by Hari Sundar, Dinggang Shen, George Biros, Chenyang Xu, Christos Davatzikos ISBN 9783540757573 download
22 pages
Linear Model of Resolution and Quality of Digital Images
No ratings yet
Linear Model of Resolution and Quality of Digital Images
7 pages
An Overview of Advances of Pattern Recognition Systems in Computer Vision
No ratings yet
An Overview of Advances of Pattern Recognition Systems in Computer Vision
27 pages
Processing of 2D Electrophoresis Gels: Abstract
No ratings yet
Processing of 2D Electrophoresis Gels: Abstract
11 pages
RAILWAY TRACK FAULT DETECTION USING IMAGE PROCESSI
No ratings yet
RAILWAY TRACK FAULT DETECTION USING IMAGE PROCESSI
8 pages
2 Face Recognition by Graph Matching
No ratings yet
2 Face Recognition by Graph Matching
6 pages
One-Dimensional Vector Based Pattern
No ratings yet
One-Dimensional Vector Based Pattern
12 pages
Think Globally, Fit Locally
No ratings yet
Think Globally, Fit Locally
33 pages
Robust Computation of Mutual Information Using Spatially Adaptive Meshes 1st Edition by Hari Sundar, Dinggang Shen, George Biros, Chenyang Xu, Christos Davatzikos ISBN 9783540757573 - Own the ebook now with all fully detailed chapters
No ratings yet
Robust Computation of Mutual Information Using Spatially Adaptive Meshes 1st Edition by Hari Sundar, Dinggang Shen, George Biros, Chenyang Xu, Christos Davatzikos ISBN 9783540757573 - Own the ebook now with all fully detailed chapters
38 pages
LostNet a Smart Way for Lost and Find
No ratings yet
LostNet a Smart Way for Lost and Find
17 pages
Remote Sensing Image Classification Thesis
100% (2)
Remote Sensing Image Classification Thesis
4 pages
6614 Ijcsit 12
No ratings yet
6614 Ijcsit 12
8 pages
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
No ratings yet
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
5 pages
A Combinatorial Solution For Model-Based Image Segmentation and Real-Time Tracking
No ratings yet
A Combinatorial Solution For Model-Based Image Segmentation and Real-Time Tracking
13 pages
A Face Recognition Scheme Based On Principle Component Analysis and Wavelet Decomposition
No ratings yet
A Face Recognition Scheme Based On Principle Component Analysis and Wavelet Decomposition
5 pages
CAPSTONE PROJECT (2) (2)
No ratings yet
CAPSTONE PROJECT (2) (2)
9 pages
Quantifying Facial Age by Posterior of Age Comparisons: Megaage
No ratings yet
Quantifying Facial Age by Posterior of Age Comparisons: Megaage
14 pages
c59bdb5e03d5127d2a02c7939b0c5257
No ratings yet
c59bdb5e03d5127d2a02c7939b0c5257
9 pages
PP 54-57 Image Fusion by Meta Heuristic Genetic Algorithm Approach Danish
No ratings yet
PP 54-57 Image Fusion by Meta Heuristic Genetic Algorithm Approach Danish
4 pages
Real-Time Embedded Age and Gender Classification in Unconstrained Video
No ratings yet
Real-Time Embedded Age and Gender Classification in Unconstrained Video
9 pages
The Motivation Behind Preparing Is To Become Familiar With A Pleasant Space Where The Green Spots Are Assembled Close and Away From The Red Specks and The Other Way Around
No ratings yet
The Motivation Behind Preparing Is To Become Familiar With A Pleasant Space Where The Green Spots Are Assembled Close and Away From The Red Specks and The Other Way Around
4 pages
An Explainable Model-Agnostic Algorithm For CNN-based Biometrics Verification
No ratings yet
An Explainable Model-Agnostic Algorithm For CNN-based Biometrics Verification
6 pages
Tplus en 1054
No ratings yet
Tplus en 1054
5 pages
N
No ratings yet
N
8 pages
Optimization of Fuzzy C Means With Darwinian Particle Swarm Optimization On MRI Image
No ratings yet
Optimization of Fuzzy C Means With Darwinian Particle Swarm Optimization On MRI Image
4 pages
CAPSTONE PROJECT - -1
No ratings yet
CAPSTONE PROJECT - -1
12 pages
Ijigsp V13 N6 2
No ratings yet
Ijigsp V13 N6 2
11 pages
2005 - Designing Multiple Classi Er Systems
No ratings yet
2005 - Designing Multiple Classi Er Systems
10 pages
Visualizing Conformal Predictions
No ratings yet
Visualizing Conformal Predictions
14 pages
CNN Based Features Extraction For Age Estimation and Gender Classification
No ratings yet
CNN Based Features Extraction For Age Estimation and Gender Classification
8 pages
Masoud Mazloom Shohreh Kasaei: Face Recognition Using Wavelet, PCA, and Neural Networks
No ratings yet
Masoud Mazloom Shohreh Kasaei: Face Recognition Using Wavelet, PCA, and Neural Networks
6 pages
Robust Computation of Mutual Information Using Spatially Adaptive Meshes 1st Edition by Hari Sundar, Dinggang Shen, George Biros, Chenyang Xu, Christos Davatzikos ISBN 9783540757573 - Read the ebook online or download it as you prefer
100% (6)
Robust Computation of Mutual Information Using Spatially Adaptive Meshes 1st Edition by Hari Sundar, Dinggang Shen, George Biros, Chenyang Xu, Christos Davatzikos ISBN 9783540757573 - Read the ebook online or download it as you prefer
35 pages
Manisha Satone 2014
No ratings yet
Manisha Satone 2014
14 pages
Facenet: A Unified Embedding For Face Recognition and Clustering
No ratings yet
Facenet: A Unified Embedding For Face Recognition and Clustering
9 pages
Alignment by Maximization of Mutual Information
No ratings yet
Alignment by Maximization of Mutual Information
19 pages
Face Recognition
No ratings yet
Face Recognition
8 pages
CHP 3A10.1007 2F978 3 642 39342 6 - 17 PDF
No ratings yet
CHP 3A10.1007 2F978 3 642 39342 6 - 17 PDF
10 pages
Cvpr06 Edge
No ratings yet
Cvpr06 Edge
8 pages
Robust Computation of Mutual Information Using Spatially Adaptive Meshes 1st Edition by Hari Sundar, Dinggang Shen, George Biros, Chenyang Xu, Christos Davatzikos ISBN 9783540757573 download
100% (3)
Robust Computation of Mutual Information Using Spatially Adaptive Meshes 1st Edition by Hari Sundar, Dinggang Shen, George Biros, Chenyang Xu, Christos Davatzikos ISBN 9783540757573 download
48 pages
An Image Fusion Approach Based On Markov Random Fields
No ratings yet
An Image Fusion Approach Based On Markov Random Fields
12 pages
2019 (Brats) 3rdpos 3d To 2d
No ratings yet
2019 (Brats) 3rdpos 3d To 2d
9 pages
Moving Toward Region-Based Image Segmentation Techniques: A Study
No ratings yet
Moving Toward Region-Based Image Segmentation Techniques: A Study
7 pages
A Review of Adaptable Conventional Image Processing Pipelines and Deep Learning On Limited Datasets
No ratings yet
A Review of Adaptable Conventional Image Processing Pipelines and Deep Learning On Limited Datasets
17 pages
Preprocessing Techniques To Improve CNN
No ratings yet
Preprocessing Techniques To Improve CNN
20 pages
Genetic Algorithms For Object Recognition IN A: Complex Scene
No ratings yet
Genetic Algorithms For Object Recognition IN A: Complex Scene
4 pages
A Multiresolution Stochastic Level Set Method For Mumford-Shah Image Segmentation
No ratings yet
A Multiresolution Stochastic Level Set Method For Mumford-Shah Image Segmentation
12 pages
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
From Everand
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
From Everand
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
Fouad Sabry
No ratings yet
1st Quaretrly Exam in PR 1
100% (1)
1st Quaretrly Exam in PR 1
3 pages
Bridge Technical Note 2018 - 010 Integral and Semiintegral Bridges Jan 2018
No ratings yet
Bridge Technical Note 2018 - 010 Integral and Semiintegral Bridges Jan 2018
2 pages
Generator Protection Presentation
No ratings yet
Generator Protection Presentation
20 pages
UG Reading Comprehension
No ratings yet
UG Reading Comprehension
47 pages
Kuka
No ratings yet
Kuka
18 pages
Red Hat Certified System Administrator (RHCSA) exam - OBJECTIVE (1)
No ratings yet
Red Hat Certified System Administrator (RHCSA) exam - OBJECTIVE (1)
4 pages
NGB GB DGBNFG
No ratings yet
NGB GB DGBNFG
5 pages
DLP-Mathematics 5-W4-D1
No ratings yet
DLP-Mathematics 5-W4-D1
7 pages
SD Card Module User's Manual v1.0
100% (1)
SD Card Module User's Manual v1.0
9 pages
Golden apple
No ratings yet
Golden apple
4 pages
WSTG - Stable OWASP
No ratings yet
WSTG - Stable OWASP
2 pages
FESCO ONLINE BILL - Compressed
No ratings yet
FESCO ONLINE BILL - Compressed
1 page
A Study On Capital Budgetting With Reference To Bescal Steel Industries
100% (2)
A Study On Capital Budgetting With Reference To Bescal Steel Industries
69 pages
Chemistry Investigatory Project Class 12
No ratings yet
Chemistry Investigatory Project Class 12
6 pages
Department of Education: Simplified Melc-Based Budget of Lessons in English Grade 8
No ratings yet
Department of Education: Simplified Melc-Based Budget of Lessons in English Grade 8
2 pages
Steven Tinker: Wal-Mart September 31 2011 - March 31 2012
No ratings yet
Steven Tinker: Wal-Mart September 31 2011 - March 31 2012
2 pages
Pricing Strategy Modules
No ratings yet
Pricing Strategy Modules
34 pages
Leather Chemical Industry
No ratings yet
Leather Chemical Industry
119 pages
Theoretical Grammar of The English Language
100% (1)
Theoretical Grammar of The English Language
47 pages
Elmer Tutorials
No ratings yet
Elmer Tutorials
103 pages
Lesson 1.3 - Worksheet
No ratings yet
Lesson 1.3 - Worksheet
8 pages
Part 2 Vitish
No ratings yet
Part 2 Vitish
15 pages
Group 2 Data Visualization
No ratings yet
Group 2 Data Visualization
6 pages
36 Elemenet Rulers of Darkness
No ratings yet
36 Elemenet Rulers of Darkness
5 pages
Couse Work
No ratings yet
Couse Work
3 pages
Cable Stayed Bridge: Partha Pratim Roy
No ratings yet
Cable Stayed Bridge: Partha Pratim Roy
4 pages
Personal Audio System: Cmt-X5Cd/Cmt-X5Cdb
No ratings yet
Personal Audio System: Cmt-X5Cd/Cmt-X5Cdb
2 pages
Relative Clause - Grade 9-KEYS
No ratings yet
Relative Clause - Grade 9-KEYS
4 pages
SITHCCC023 Use Food Preparation Equipment - UKA ####DONE PRACTICAL
No ratings yet
SITHCCC023 Use Food Preparation Equipment - UKA ####DONE PRACTICAL
12 pages

Clustering Face Images with Application to Image Retrieval

Uploaded by

Clustering Face Images with Application to Image Retrieval

Uploaded by

Clustering Face Images with Application to Image Retrieval

Figure 1. Uncertainty in cluster assignment.

2. CLUSTERING FACE IMAGES

2.1. EM-based Clustering

2.2. A Fast Initialization Procedure

Dlike (Ci , Cj ) = L(Ci ) + L(Cj ) − L(Ci ∪ Cj ) (10)

∆H(Ci , Cj ) = pi log(pi ) + pj log(pj ) − (pi + pj ) log(pi + pj ) (12)

D(Ci , Cj ) = Dlike (Ci , Cj ) − ρ∆H(Ci , Cj ) (13)

3. PROBABILISTIC ASSIGNMENT OF FACE IMAGES

3.1. A Brief Review of Anchors for Indexing

3.2. Improving the Original Anchor Approach

4.1. Experimental setup

4.2. Evaluating the Impact of Posterior-based Characterization Vectors

4.3. Evaluating the Impact of a Reduction of the Number of Anchors

You might also like