Optimization of The Algorithm For Image Retrieval by Color Features
Optimization of The Algorithm For Image Retrieval by Color Features
Abstract - In this paper, we describe our research in content based image retrieval systems, based on
color features. The growing size of the database will result in long search time which may be unacceptable in
many practical situations. We describe our approach to solving this problem and to improve the performance
of the image database management system, based on color features, by using of sorted list structures.
Keywords: Image databases, content based image retrieval, image features, color content
INTRODUCTION
There is a rapid increase in the size of digital image collections together with the fast
growth of the Internet in the recent years. Digital images have found their way into many
application areas, including Geographical Information System, Office Automation, Medical
Imaging, Computer Aided Design, Computer Aided Manufacturing, and Robotics.
For content-based image retrieval (CBIR), i.e. searching in image databases based on
image content, several image retrieval systems have been developed. One of the first
systems was the QBIC system [8]. Other popular research systems are BlobWorld [10],
VIPER/GIFT [11], SIMBA [9], and SIMPLIcity [12]. All these systems compare images
based on specific features in one way or another and therefore a large variety of features
for image retrieval exists. Usually, CBIR systems do not use all known features as this
would involve large amounts of data and increase the necessary computing time. Instead,
a set of features appropriate to the given task is ususally selected, but it is difficult to
judge beforehand which features are appropriate for which tasks. The fundamental idea of
the CBIR approach is to generate automatically image descriptions directly from the image
content by analyzing the content of the images. Such techniques are being developed by
many research groups and commercial companies around the world.
- II.17-1 -
International Conference on Computer Systems and Technologies - CompSysTech’ 2006
- II.17-2 -
International Conference on Computer Systems and Technologies - CompSysTech’ 2006
Image retrieval
1. For both type of queries: Query by Image Example and Query by User
Sketch first step is to calculate and define ColorDescriptorMatrix of the query,
using the same algorithms as for adding new image in the database.
2. When we have the matrix of the query we analyze it consequently block by
block and build temporal query list. This temporal query list contains the IDs from
the corresponding lists in the image database of the dominant colors and the
positions of the blocks in the query. So it contains 256 lists from the database.
3. The iterative IDs will be grouped after sorting of this temporal query list. The
number of IDs in every group is determined and the query list is modified to contain
the IDs and the corresponding number of the coincidence of this ID in the query list.
4. This modified query list is sorted according to the descending number of the
coincidences per each ID.
After these steps the IDs of the most similar images will be in the beginning of
the query list
The similarity between the query and the images from image database is defined as:
BReq
SIM = ∗ 100% - if the query type is Query by Image Example
256
or:
BReq
SIM = ∗ 100% - if the query type is Query by User Sketch,
BRspec
where:
BReq - is the number of the blocks in which the dominant color is the same as this in the
query, this is the number of the coincidence in the temporal query list for every ID;
BR spec - if the query type is Query by User Sketch it is possible that the user is looking
for similar images not as a whole, but only for images that contain a specific color in a
specific position in the image and he does not define the dominant colors in all blocks. If
some of the dominant colors are not defined this means that they are not important for the
query and they will not take part in the results. (Figure 2). In this case the temporal query
list contains only the lists for the dominant colors of the specified blocks and only these
blocks are analyzed.
Figure 2. Query by User Sketch – the user is looking for images that contain purple
and green in the center of the image and the rest part of the image is not important.
- II.17-3 -
International Conference on Computer Systems and Technologies - CompSysTech’ 2006
Sketch, and the user is specified not all blocks in the query. The number and the exact
lists are defined by the query matrix. ColorDescriptorMatrix of the query defines the
exact numbers of the lists using this simple name convention for the files of lists:
o First two numbers in the name of the list are indexes in the matrix (Figure 1) and
they can be from 1 to 16;
o Third number in the name is defined by the number of the dominant color in this
block (table 1 and figure 1) and the possible values are from 0 to 9. This number
is equal to the corresponding value in the matrix in the position of two first
numbers.
Efficiency of this approach for image database organization and retrieval depends on
the selected algorithms and methods for fast file sorting and searching. These algorithms
are well known and widely used in the field of theory and practice of the algorithms and
data structures.
As a conclusion it is useful to say that the proposed approach is very efficient in respect
of similarity calculating, because the similarity coefficients are calculated automatically
through the process of list scanning. The number of the blocks with the same dominant
colors is equal to the number of the repetition of the corresponding image ID in the final
query list. The most similar images are in the beginning of this list.
REFERENCES
1. Boris Rachev, Irena Valova, Silyan Arsov, An Approach for Image Organization and
Retrieval in Realistic Image Databases, 7th EC-GIS Workshop, EG II-Managing the
Mosaic, Potsdam, Germany, 13-15.06.2001
2. Valova, B. Rachev, Image Databases – an Approach to Image Segmentation&Color
Reduction Analysis&Synthesis, International Conference CompSysTech’2003, Sofia,
Bulgaria, 19-20 June 2003
3. Valova, B. Rachev, Retrieval By Color Features In Image Databases, Adbis’04, 22-25
September, 2004, Budapest, Hungary
4. Valova, Irena; Rachev, Boris (Bulgaria): ''An Algorithm for Organization and Retrieval
by Color Features in Image Databases'', International Conference on Cybernetics and
Information Technologies, Systems and Applications: CITSA 2004; July 21 - 25, 2004
in Orlando, Florida, USA, Volume IV;
http://www.infocybernetics.org/citsa2005/pastproceeding/contents.asp?id_volumen=4&
action=1&id_conference=6&vpart=
5. Swain, M. J. and Ballard, D. H. (1991). Color indexing. International Journal of
Computer Vision, 7(1):11–32.
6. Schettini, R., Ciocca, G., and Zuffi, S. (2000). Color in databases: Indexation and
similarity. In Proc. of Int’l Conf. on Color in Graphics and Image Processing, pages
244–249.
7. Schettini, R., Ciocca, G., and Zuffi, S. (2001). Color Imaging Science: Exploiting Digital
Media, Ed. R. Luo and L. MacDonald, chapter A Survey on Methods for Colour Image
Indexing and Retrieval in Image Database. John Wiley
8. C. Faloutsos, R. Barber, M. Flickner, J. Hafner, W. Niblack, D. Petkovic, and W. Equitz.
Efficient and effective querying by image content. Journal of Intelligent Information
Systems, 3(3/4):231–262, July 1994.
9. S. Siggelkow. Feature Histograms for Content-Based Image Retrieval. Ph.D. thesis,
University of Freiburg, Institute for Computer Science, Freiburg, Germany, 2002.
- II.17-4 -
International Conference on Computer Systems and Technologies - CompSysTech’ 2006
- II.17-5 -