Performances of Mobile-Agents For Interactive Image Retrieval
Performances of Mobile-Agents For Interactive Image Retrieval
Abstract query and presents the relevant results given a specific rel-
evance function to the user. The user annotates the results
In this paper, we present a system for image retrieval in order to improve the relevance function. The system then
over a network of computer based on “ant-like” mobile- presents new results according to this refinement. This user
agents. Image databases are hosted on the network, and the interaction loop is called relevance feedback, and is per-
user wants to find all the images matching a specific con- formed several times. When the refinement is satisfying,
cept (cars, flower, Italy, etc...). Usually, content based im- the best results are presented to the user as final result.
age retrieval systems (CBIR) do not consider the dispertion The best efforts made so far in order to improve the
of the data among the network. We train a SVM classifier learning process of the classifier is “active learning” and
with examples annotated by the user and then launch mobile aims at optimizing the set of training examples [4]. The ac-
agents which explore the network in order to retrieve the tive learning strategy aims at presenting to the user the un-
most relevant images. Several interactive session (launch- labeled document that will enhance the most the relevance
ing of agents then annotation of the results) are made to function [8].
improve the classifier. With the increase of interconnections between networks,
Experiments are made both to see the influence of local- the available images are distributed over thousands of small
ization of the search concept on the quality of the learning, databases. The internet shows us an example of a real con-
and to focus on the quality of the agent based solution com- text for interactive image retrieval. Classical CBIR systems
pared to a centralizing system within a fixed amount of time has not been designed to face this network-based context.
for the interaction. They consider that all the images are available in a huge
single database.
We are now introducing a new system designed for CBIR
1 Introduction. over a network. It is based on mobile agents inspired by
ants [6].These agents explore the network looking for im-
With the growth of image collections coming from per-
ages matching the user’s concept. All the processing of the
sonal or professional digitization, spread on private net-
images are made on the computer where they are hosted,
works, or on the Internet, the incredible amount of available
thus allowing a natural distribution of the computation. We
data makes it impossible to search within by hand. Usually,
have already presented preliminary description of our sys-
users searching for a category of images (let say images
tem [7, 5], we are now testing its performances.
of cars, or images of Europe) mostly use a search engine
This paper if organized as follows: the next part will give
and feed it with semantic keywords supposed to represent
an overview of our system. We will present results on the
the searched concept (as with Google). Yet, these search
quality of the retrieval depending on the distribution of the
engines rely on meta-data (annotations) or the context in
data. And then, we will evaluate the performance of our
which the image is found (the web page for example) and
system in a real-case application, and comment the results
not on the internal features of the image.
obtained in order to draw a conclusion on the advantages of
Tools have been developed, as [10], in order to use low
the distributed approach.
level features of the images (colors, textures, shapes, etc.).
In order to improve the retrieval, a learning processes has 2 Description of our strategy.
been introduced in the context of content-based image re-
trieval (CBIR) [9, 2]. A typical interactive retrieval follows Let us consider a network of computers, some of which
this scheme: first, the user gives an image or a keyword containing image databases. On each of these machines,
to begin the search process. Then, the system analyzes the our system runs a local static agent (Local Image-Database
3 Results.
Effects of thematization of databases. We made a real
and functionnal implementation of this system using the
JADE1 framework. In order to see the influence of the path
reinforcement on the learning process, the system has been Figure 1. Mean Average Precision against the
tested on a simplificated setup. This setup is as follows: our localization of the search concept. The gain
test database is an excerpt of the Corel database which con- in quality when the concept is localized is be-
tains 6,000 images categorized in 50 concepts of 50 to 300 tween 5% and 10%.
images. The database is split into two subsets hosted on
separated computers A and B. Agents were able to move
the existings systems are not designed for a use over a net-
either to A or to B. The localization of the searched concept
work, we have built a very simple system to be compared
varied from 50% on A (equally distributed) to 100% on A
to, which we call “centralizing system”. At the begining
(entirely on A). When the category is equally distributed
of each retrieval session, it gathers all the images available
over the network, the path reinforcement algorithm cannot
on the network into a local database, and then performs the
give the advantage to one of the bases. On the contrary, if
classical interactive CBIR scheme. The user presents an
the category is entirely on A, this algorithm forgets the path
image representing the concept he is looking for. The sys-
to B quickly. Eight mobile-agents were launched, each of
tem analyzes the image by extracting its features and build
them carrying back two images. We made relevance feed-
a classifier based on SVM. 5 images are selected from the
back rounds until 100 images were annotated.
local database given the strategy SV Mactive [8]. The user
The figure 1 shows the Mean Average Precision (M AP )
labels them (1 relevant, -1 irrelevant), and a new SVM clas-
obtained. The M AP indicated an overall quality of the in-
sifier is computed, leading to another round of active learn-
teractive retrieval system, and is computed as follow : Given
ing, exactly as for the distributed approach. When the time
a ranking of the bests images, We call precision P the num-
out is reached, the best results are presented to the user.
ber of relevant images retrieved over the total number of
Let us introduce now the setup used to compare this cen-
retrieved images, and recall R the number of relevant im-
tralizing strategy to our system. We have a network of 5
ages retrieved over the total number of relevant images. The
computers S0 to S4 . The user is working on the machine
Average Precision Ap is the value of P each time R in-
S0 . The other computers are hosting a image database from
creases. The M AP is the mean of all Ap through the final
{A, B, C, D} with the following pairs : {S1 , A}, {S2 , B},
ranking. A perfect retrieval will lead to a M AP of 1. For
{S3 , C}, {S4 , D}. S0 is connected to each of the other
very easy categories, the path learning has little influence
computers. Given the results of previous experiments, the
on the concept learning. The same appears with difficult
concentration of the searched category is set to 90% on A.
categories. For the categories in between, a well localiza-
The query is presented to the system, and then, the retrieval
tion allows us to gain up to 10% of quality, as seen in figure
aims at finding the most relevant images of {A, B, C, D}
1. If the searched category is uniformly distributed over the
within a specific maximum interaction time (M IT ).
network, the reinforcement algorithm has no effect on the
A Gaussian distribution centered on 1s and a σ of 100ms
learning.
is used to model the time taken by the user to annotate an
image. At the end of this session, we will compute the mean
Overall performances. In these experiments, we want to average precision (M AP ). We made the experiment for
compare our system to classical CBIR strategies. But since several values of M IT in order to catch the behavior of the
1 http://jade.tilab.it two systems. Each image in the category was taken as the