Human Vision: Jitendra Malik U.C. Berkeley
Human Vision: Jitendra Malik U.C. Berkeley
vision
Jitendra
Malik
U.C.
Berkeley
Visual
Areas
Mathema;cal
Abstrac;on
The
photoreceptor
mosaic:
rods
and
cones
are
the
eye’s
pixels
Cones
and
Rods
2 2
• OE = ( I ∗ f odd ) + ( I ∗ f even )
• Can be used to model complex cells, as this is
insensitive to phase
• Multiple scales
Hypercolumns
in
visual
cortex
Macaque
Visual
Areas
Rolls
et
al
(2000)
model
of
ventral
stream
Object
Detec;on
can
be
very
fast
• On
a
task
of
judging
animal
vs
no
animal,
humans
can
make
mostly
correct
saccades
in
150
ms
(Kirchner
&
Thorpe,
2006)
– Comparable
to
synap;c
delay
in
the
re;na,
LGN,
V1,
V2,
V4,
IT
pathway.
– Doesn’t
rule
out
feed
back
but
shows
feed
forward
only
is
very
powerful
• Detec;on
and
categoriza;on
are
prac;cally
simultaneous
(Grill-‐Spector
&
Kanwisher,
2005)
Feed-‐forward
model
of
the
ventral
stream
Intrinsic
&
Extrinsic
Connec;vity
of
the
Ventral
Stream
(Kravitz,
Saleem,
Baker,
Ungerleinder,
Mishkin,
TICS,
2013)
What
can
we
learn?
• Neurons
show
increasing
specificity
higher
in
the
visual
pathway
• V1
simple
and
complex
cells
are
orienta;on-‐tuned
• Convolu;on
with
a
linear
kernel
followed
by
simple
non-‐lineari;es
is
a
good
model
for
computa;on
in
re;na,
LGN
and
V1,
but
beyond
that
we
do
not
have
sa;sfactory
computa;onal
models
• Good
designs
of
visual
systems
are
likely
to
be
hierarchical
and
“mostly”
feedforward
Neuroscience
&
Computer
Vision
Features