"Growcut" - Interactive Multi-Label ND Image Segmentation by Cellular Automata
"Growcut" - Interactive Multi-Label ND Image Segmentation by Cellular Automata
net/publication/246346467
CITATIONS READS
430 1,280
2 authors, including:
Vadim Konushin
Lomonosov Moscow State University
26 PUBLICATIONS 600 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Vadim Konushin on 07 July 2014.
The performance of described photo editing methods was eval- 5. Interactivity - as the segmentation is refined with each itera-
uated in [Rother et al. 2004] (except for the intelligent paint). The tion, user can observe the evolution and refine the segmenta-
authors have clearly shown, that methods based on graph cuts allow tion “on the fly”;
6. The algorithm is simple in both understanding and implemen-
tation;
2 Method details
At iteration t + 1 cell labels l t+1
p and strengths θ pt+1 are updated
2.1 Basic method as follows:
Cellular automata (CA) were introduced by Ulam and von Neu-
Code 1 Automata evolution rule
mann [von Neumann 1966]. Since then they’ve been used to
model wide variety of dynamical systems in various application do- // For each cell...
mains, including image denoising and edge detection [Popovici and for ∀p ∈ P
Popovici 2002], [Hernandez and Herrmann 1996]. A cellular au- // Copy previous state
tomaton is generally an algorithm discrete in both space and time, l t+1
p = lp;
t
As we have already emphasized in the introduction, our hope is It also should not be forgotten that both Random Walker and
to stir up the research community, motivating to search new ideas GrowCut are capable of solving multi-labelling tasks.
in the field of cellular automata and evolutionary computation and Random Walker boasts of creating segments connected to the
applying them to interactive image segmentation. initial seeds always. So as GrowCut does. Graph Cut potentially
We expect that results exceeding our current can be obtained. can (and sometimes does) create segments not connected to ini-
However, our current method can already compete with elegant tial seeds, if the regional term R(A) is stronger than boundary term
achievements of graph theory. In this section we will try to compare B(A). Same applies GrabCut. However, whether this an advantage
current top performing methods with ours and point out advantages or a drawback is probably the question of exact segmentation task.
and disadvantages of our scheme.
We take four methods - Graph Cuts, GrabCut, Random Walker
and GrowCut and compare them by several criteria: segmentation 4.2 Speed
quality, speed and convenience for the user. Accurately speaking,
The method speed is surely important. The reaction time to the
the methods differ seriously by the amount of information that they
user input should be small enough not to produce discomfort and
extract from the image. GrabCut uses most information - it com-
irritation, otherwise the method cannot be called interactive. An
putes the evolving color statistics of foreground and background
important measure is not only how fast the initial segmentation is
and takes into account color difference between neighboring pix-
achieved, but also how fast can the segmentation be recomputed,
els. Graph Cuts differs in using color statistics collected from the
when user adds more seeds to refine the initial segmentation.
user-specified seeds only, computed before the segmentation start.
The fastest are algorithms based on graph cuts. The initial seg-
Random Walker uses only intensity difference between neighboring
mentation computation time for Graph Cuts reported by the authors
pixels. Our current GrowCut variant also does not take advantage of
is ‘less than a second’ for ‘most 2D images (up to 512x512)’ on
object color statistics, however it can be easily extended to maintain
‘333MHz Pentium III’. This looks terrific, but judging from the
regions color statistics and use them in automaton evolution.
Intel processor manufacturer site, the frequency for Pentium III
processor family ranges from 450 MHz to 1.33 GHz. This sug-
4.1 Segmentation quality gests that actually running times are given for 1.33 GHz processor,
which is still a very good result. GrabCut processing time was given
Judging by our own experiments, Graph Cuts, GrabCut (we used for 450x300 image and equals to 0.9sec on 2.5GHz CPU with 512
our own implementation of both methods) and GrowCut are all able Mb RAM. Recomputing segmentation after new user-added seeds
to produce high quality segmentation in natural and medical im- is even faster than that. However, our own implementation of these
ages, so in this case mostly the amount user effort required should methods, based on Vladimir Kolmogorov’s publicly available code
be the measure of performance. However, there are some inter- [Boykov and Kolmogorov 2004] was not able to reach this speed.
esting features in these methods results, mentioned in next para- Random walker cannot boast of such speed, for the computation
graph. Concerning Random Walker, we’ve not been able to test the time for 256x256 image on dual Intel Xeon 2.4GHz processor with
1GB RAM is almost 5 (4.831) seconds. No indications that adding
more seeds will need less time to compute a refined segmentation
were found in the method description. But again, Random Walker
solves a harder task of multi-labelling.
Our current implementation goes somewhere in between. It
takes about 4 seconds on 256x256 image on 2.5GHz processor.
However, this is the total automaton evolution time (till complete
stop). Usually the desired segmentation result is obtained much
earlier (about 1.5-2 seconds) and the later automaton evolution does
not change the segmentation and can be stopped by the user, when
he is satisfied with the result. Another important issue is that our
methods allows user intervention at any iteration step, so adding
additional seeds throughout the computation does not increase the
whole working time. Again - our method solves the task of multi-
labelling.
6 Acknowledgements B OYKOV, Y., V EKSLER , O., AND Z ABIH , R. 2001. Fast approximate
energy minimization via graph cuts. IEEE Trans. Pattern Anal. Mach.
We gratefully acknowledge valuable comments from the Graphicon Intell. 23, 11, 1222–1239.
reviewing committee members, Yuri Boykov and Olga Veksler. We
G RADY, L., AND F UNKA -L EA , G. 2004. Multi-label image segmentation
also would like to thank our lab’s head Yuri Bayakovsky for attract- for medical applications based on graph-theoretic electrical potentials.
ing our attention to the field of cellular automata some years ago. In ECCV Workshops CVAMIA and MMBIA, 230–245.
H EIMANN , T., T HORN , M., K UNERT, T., AND M EINZER , H.-P. 2004.
References New methods for leak detection and contour correction in seeded region
growing segmentation. In 20th ISPRS Congress, Istanbul 2004, Inter-
BARRE , S. Medical image samples. national Archives of Photogrammetry and Remote Sensing, vol. XXXV,
http://www.barre.nom.fr/medical/samples/. 317–322.
B OYKOV, Y., AND J OLLY, M.-P. 2001. Interactive graph cuts for optimal
boundary and region segmentation of objects in n-d images. In Proc. of H ERNANDEZ , G., AND H ERRMANN , H. J. 1996. Cellular automata for
the International Conference on Computer Vision, vol. 1, 105–112. elementary image enhancement. CVGIP: Graphical Model and Image
Processing 58, 1, 82–89.
B OYKOV, Y., AND KOLMOGOROV, V. 2004. An experimental comparison
of min-cut/max-flow algorithms for energy minimization in vision. IEEE M OGA , A., AND G ABBOUJ , M. 1996. A parallel marker based watershed
Trans. Pattern Anal. Mach. Intell. 26, 9, 1124–1137. transformation. In ICIP96, II: 137–140.
M ORTENSEN , E. N.,View
AND BARRETT, W. A. 1998. Interactive segmenta-
publication stats
tion with intelligent scissors. Graphical Models and Image Processing
60, 5, 349–384.
M ORTENSEN , E. N., AND BARRETT, W. A. 1999. Toboggan-based intel-
ligent scissors with a four-parameter edge model. In CVPR, 2452–2458.
P OPOVICI , A., AND P OPOVICI , D. 2002. Cellular automata in image pro-
cessing. In Fifteenth International Symposium on Mathematical Theory
of Networks and Systems.
R EESE , L. 1999. Intelligent Paint: Region-Based Interactive Image Seg-
mentation. Master’s thesis, Department of Computer Science, Brigham
Young University, Provo, UT.
ROTHER , C., KOLMOGOROV, V., AND B LAKE , A. 2004. Grabcut - in-
teractive foreground extraction using iterated graph cuts. Proc. ACM
Siggraph.
V EKSLER , O. 2000. Image segmentation by nested cuts. In Proc. IEEE
Conf. Computer Vision and Pattern Recognition, vol. 1, 339–344.
VON N EUMANN , J. 1966. Theory of Self-Reproducing Automata. Univer-
sity of Illinois Press. Ed. and Completed by A. Burks.