0% found this document useful (0 votes)

47 views9 pages

Voxresnet: Deep Voxelwise Residual Networks For Volumetric Brain Segmentation

This document presents VoxResNet, a deep 3D convolutional neural network that uses residual learning to perform volumetric brain segmentation from MRI images. The key contributions are: 1) VoxResNet extends residual learning to 3D, allowing it to fully leverage contextual information from volumetric data for segmentation. 2) An auto-context version of VoxResNet further improves segmentation by integrating low-level appearance, implicit shape, and high-level context features. Experiments on brain MRI segmentation show VoxResNet outperforms other state-of-the-art methods.

Uploaded by

Prakash Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views9 pages

Voxresnet: Deep Voxelwise Residual Networks For Volumetric Brain Segmentation

Uploaded by

Prakash Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

VoxResNet: Deep Voxelwise Residual Networks for

Volumetric Brain Segmentation

Hao Chen, Qi Dou, Lequan Yu and Pheng-Ann Heng

Department of Computer Science and Engineering
arXiv:1608.05895v1 [cs.CV] 21 Aug 2016

The Chinese University of Hong Kong

Hong Kong, China
[email protected]

Abstract

Recently deep residual learning with residual units for training very deep neu-
ral networks advanced the state-of-the-art performance on 2D image recognition
tasks, e.g., object detection and segmentation. However, how to fully leverage
contextual representations for recognition tasks from volumetric data has not been
well studied, especially in the field of medical image computing, where a major-
ity of image modalities are in volumetric format. In this paper we explore the
deep residual learning on the task of volumetric brain segmentation. There are
at least two main contributions in our work. First, we propose a deep voxelwise
residual network, referred as VoxResNet, which borrows the spirit of deep residual
learning in 2D image recognition tasks, and is extended into a 3D variant for han-
dling volumetric data. Second, an auto-context version of VoxResNet is proposed
by seamlessly integrating the low-level image appearance features, implicit shape
information and high-level context together for further improving the volumetric
segmentation performance. Extensive experiments on the challenging benchmark
of brain segmentation from magnetic resonance (MR) images corroborated the ef-
ficacy of our proposed method in dealing with volumetric data. We believe this
work unravels the potential of 3D deep learning to advance the recognition per-
formance on volumetric image segmentation.

1 Introduction

Over the last few years, deep learning especially deep convolutional neural networks (CNNs) have
emerged as one of the most prominent approaches for image recognition problems in various do-
mains including computer vision [15, 31, 18, 33] and medical image computing [25, 26, 3, 30].
Most of the studies focused on the 2D object detection and segmentation tasks, which have shown
a compelling accuracy compared to previous methods employing hand-crafted features. However,
in the field of medical image computing, volumetric data accounts for a large portion of medical
image modalities, such as Computed Tomography (CT) and Magnetic Resonance Imaging (MRI),
etc. Furthermore, the volumetric diagnosis from temporal series usually requires data analysis even
in a higher dimension. In clinical practice, the task of volumetric image segmentation plays a signif-
icant role in computer aided diagnosis (CADx), which provides quantitative measurements and aids
surgical treatments. Nevertheless, this task is quite challenging due to the high-dimensionality and
complexity along with volumetric data. To the best of our knowledge, there are two types of CNNs
developed for handling volumetric data. The first type employed modified variants of 2D CNN by
taking aggregated adjacent slices [5] or orthogonal planes (i.e., axial, coronal and sagittal) [25, 27]
as input to make up complementary spatial information. However, these methods cannot make full
use of the contextual information sufficiently, hence it is not able to segment objects from volumet-
ric data accurately. Recently, other methods based on 3D CNN have been developed to detect or

1
segment objects from volumetric data and demonstrated compelling performance [8, 14, 17]. Nev-
ertheless, these methods may suffer from limited representation capability using a relatively shallow
depth or may cause optimization degradation problem by simply increasing the depth of network.
Recently, deep residual learning with substantially enlarged depth further advanced the state-of-
the-art performance on 2D image recognition tasks [10, 11]. Instead of simply stacking layers, it
alleviated the optimization degradation issue by approximating the objective with residual functions.
Brain segmentation for quantifying the brain structure volumes can be of significant value on diagno-
sis, progression assessment and treatment in a wide range of neurologic diseases such as Alzheimer’s
disease [9]. Therefore, lots of automatic methods have been developed to achieve accurate segmen-
tation performance in the literature. Broadly speaking, they can be categorized into three classes:
1) Machine learning methods with hand-crafted features. These methods usually employed dif-
ferent classifiers with various hand-crafted features, such as support vector machine (SVM) with
spatial and intensity features [37, 22], random forest with 3D Haar like features [38] or appearance
and spatial features [24]. These methods suffer from limited representation capability for accurate
recognition. 2) Deep learning methods with automatically learned features. These methods learn
the features in a data-driven way, such as 3D convolutional neural network [6], parallelized long
short-term memory (LSTM) [32], and 2D fully convolutional networks [23]. These methods can
achieve more accurate results while eliminating the need for designing sophisticated input features.
Nevertheless, more elegant architectures are required to further advance the performance. 3) Multi-
atlas registration based methods [1, 2, 28]. However, these methods are usually computationally
expensive, hence limits its capability in applications requiring fast speed.
To overcome aforementioned challenges and further unleash the capability of deep neural networks,
we propose a deep voxelwise residual network, referred as VoxResNet, which borrows the spirit
of deep residual learning to tackle the task of object segmentation from volumetric data. Exten-
sive experiments on the challenging benchmark of brain segmentation from volumetric MR images
demonstrated that our method can achieve superior performance, outperforming other state-of-the-
art methods by a significant margin.

2 Method

2.1 Deep Residual Learning

Deep residual networks with residual units have shown compelling accuracy and nice conver-
gence behaviors on several large-scale image recognition tasks, such as ImageNet [10, 11] and
MS COCO [7] competitions. By using identity mappings as the skip connections and after-addition
activation, residual units can allow signals to be directly propagated from one block to other blocks.
Generally, the residual unit can be expressed as following
xl+1 = xl + F(xl , Wl ) (1)
here the F denotes the residual function, xl is the input feature to the l-th residual unit and Wl is
a set of weights correspondingly associated with the residual unit. The key idea of deep residual
learning is to learn the additive residual function F with respect to the input feature xl . Hence by
unfolding above equation recursively, the xL (L > l ≥ 1) can be derived as
L−1
X
xL = xl + F(xi , Wi ) (2)
i=l

therefore, the feature xL of any deeper layers can be represented as the feature xl of shallow unit l
PL−1
plus summarized residual functions i=l F(xi , Wi ). The derivations imply that residual units can
make information propagate through the network smoothly.

2.2 VoxResNet for Volumetric Image Segmentation

Although 2D deep residual networks have been extensively studied in the domain of computer vi-
sion [10, 11, 29, 39], to the best of our knowledge, seldom studies have been explored in the field of
medical image computing, where the majority of dataset is volumetric images. In order to leverage

2
Conv1c, 64, 3x3x3, /2
input

Conv4, 64, 3x3x3, /2

Conv7,64, 3x3x3, /2
Conv1b, 32, 3x3x3
Conv1a, 32, 3x3x3
BN, ReLU

BN, ReLU

BN, ReLU
VoxRes2
VoxRes3

VoxRes5
VoxRes6

VoxRes8
VoxRes9
Input
Conv, 64, 3x3x3
BN, ReLU
Conv, 64, 3x3x3
1 2 4 8

C1 C2 C3 C4 Output
(a) Softmax (b) VoxRes module
;

Figure 1: (a) The architecture of proposed VoxResNet for volumetric image segmentation; (b) The
illustration of VoxRes module.

the powerful capability of deep residual learning and tackle the object segmentation tasks from high-
dimensional volumetric images efficiently and effectively, we extend the 2D deep residual networks
into a 3D variant and design the architecture following the spirit from [11] with full pre-activation,
i.e., using asymmetric after-addition activation, as shown in Figure 1(b).
The architecture of our proposed VoxResNet for volumetric image segmentation is shown in Fig-
ure 1(a). Specifically, it consists of stacked residual modules (i.e., VoxRes module) with a total of
25 volumetric convolutional layers and 4 deconvolutional layers [18], which is the deepest 3D con-
volutional architecture so far. In each VoxRes module, the input feature xl and transformed feature
F(xl , Wl ) are added together with skip connection as shown in Figure 1(b), hence the information
can be directly propagated in the forward and backward passes. Note that all the operations are im-
plemented in a 3D way to strengthen the volumetric feature representation learning. Following the
principle from VGG network [31] and deep residual networks [11], we employ small convolutional
kernels (i.e., 3 × 3 × 3) in the convolutional layers, which have demonstrated the state-of-the-art
performance on image recognition tasks. Three convolutional layers are along with a stride 2, which
reduced the resolution size of input volume by a factor of 8. This enables the network to have a large
receptive field size, hence enclose more contextual information for improving the discrimination
capability. Batch normalization layers are inserted into the architecture intermediately for reducing
internal covariate shift [12], hence accelerate the training process and improve the performance. In
our network, the rectified linear units, i.e., f (x) = max(0, x), are utilized as the activation function
for non-linear transform [15].
There is a huge variation of 3D anatomical structure shape, which demands different suitable re-
ceptive field sizes for better recognition peformance. In order to handle the large variation of shape
sizes, multi-level contextual information (i.e., 4 auxiliary classifiers C1-C4 in Figure 1(a)) with deep
supervision [16, 4] is fused in our framework. Therefore, the whole network is trained in an end-to-
end way by minimizing following objective function with standard back-propagation

XXX XX
L(x, y; θ) = λψ(θ) − wα ycx log pα
c (x; θ) − ycx log pc (x; θ) (3)
α x∈V c x∈V c

where the first part is the regularization term (L2 norm in our experiments) and latter one is the
fidelity term consisting of auxiliary classifiers and final target classifier. The tradeoff of these terms
is controlled by the hyperparameter λ. wα (where α indicates the index of auxiliary classifiers) is the
weights of auxiliary classifiers, which were set as 1 initially and decreased till marginal values (i.e.,
10−3 ) in our experiments. The weights of network are denoted as θ = {W }, pc (x; θ) or pα c (x; θ)
denotes the predicted probability of cth class after softmax classification layer for voxel x in volume
space V, and ycx ∈ {0, 1} is the corresponding ground truth. , i.e., ycx = 1 if voxel x belongs to the
cth class, otherwise 0.

3
T1-IR Ground
Truth

Multi-modality Images
T2-FLAIR
Result

T1 Auto-context
VoxResNet

VoxResNet

Auto-context

Figure 2: An overview of our proposed framework for fusing auto-context with multi-modality
information.

2.3 Multi-modality and Auto-context Information Fusion

In medical image computing, the volumetric data are usually acquired with multiple imaging modal-
ities for robustly examining different tissue structures. For example, three imaging modalities in-
cluding T1, T1-weighted inversion recovery (T1-IR) and T2-FLAIR are available in brain struc-
ture segmentation task [20] and four imaging modalities are used in brain tumor (T1, T1 contrast-
enhanced, T2, and T2-FLAIR MRI) [21] and lesion studies (T1-weighted, T2-weighted, DWI and
FLAIR MRI) [19]. The main reason for acquiring multi-modality images is that the information
from multi-modality dataset can be complementary, which provides robust diagnosis results. Thus,
we concatenate these multi-modality data as input, then the complementary information is jointly
fused during the training of network in an implicit way, which demonstrated consistent improvement
compared to any single modality.
Furthermore, in order to harness the integration of high-level context information, implicit shape
information and original low-level image appearance for improving recognition performance, we
formulate the learning process as an auto-context algorithm [35]. Compared with the recognition
tasks in computer vision, the role of auto-context information can be more important in the medical
domain as the anatomical structures are roughly positioned and constrained [36]. Different from [36]
utilizing the probabilistic boosting tree as the classifier, we employ the powerful deep neural net-
works as the classifier. Specifically, given the training images, we first train a VoxResNet classifier
on original training sub-volumes. Then the discriminative probability maps generated from VoxRes-
Net are then used as the context information, together with the original volumes (i.e., appearance
information), to train a new classifier Auto-context VoxResNet, which further refines the semantic
segmentation and removes the outliers. Different from the original auto-context algorithm in an
iterative way [36], our empirical study showed that following iterative refinements gave marginal
improvements. Therefore, we chosen the output of Auto-context VoxResNet as the final results.

3 Experiments
3.1 Dataset and Pre-processing

We validated our method on the MICCAI MRBrainS challenge data, an ongoing benchmark for
evaluating algorithms on the brain segmentation. The task of MRBrainS challenge is to segment the

4
(a) (b) (c) (d)

Figure 3: The example results of validation data (yellow, green, and red colors represent the WM,
GM, and CSF, respectively): (a) original MR images, (b) results of VoxResNet, (c) results of Auto-
context VoxResNet, (d) ground truth labels.

brain into four-class structures, i.e., background, cerebrospinal fluid (CSF), gray matter (GM) and
white matter (WM). The datasets were acquired at the UMC Utrecht of patients with diabetes and
matched controls with varying degrees of atrophy and white matter lesions [20]. Multi-sequence 3T
MRI brain scans, including T1-weighted, T1-IR and T2-FLAIR, are provided for each subject. The
training dataset consists of five subjects with manual segmentations provided. The test data includes
15 subjects with ground truth held out by the organizers for independent evaluation.
In the pre-processing step, we subtracted Gaussian smoothed image and applied Contrast-Limited
Adaptive Histogram Equalization (CLAHE) for enhancing local contrast by following [32]. Then
six input volumes including original images and pre-processed ones were used as input data in our
experiments. We normalized the intensities of each slice with zero mean and unit variance before
inputting into the network.

3.2 Evaluation and Comparison

The evaluation metrics of MRBrainS challenge include the Dice coefficient (DC), the 95th-percentile
of the Hausdorff distance (HD) and the absolute volume difference (AVD), which are calculated for
each tissue type (i.e., GM, WM and CSF), respectively [20]. The details of evaluation can be found
in the challenge website1 .
To investigate the efficacy of multi-modality and auto-context information, we performed extensive
ablation studies on the validation data (leave one out cross-validation). The detailed results of cross-
validation are reported in Table 1. We can see that combining the multi-modality information can
dramatically improve the segmentation performance than that of single image modality, especially
on the metric of DC, which demonstrates the complementary characteristic of different imaging
modalities. Moreover, by integrating the auto-context information, the performance of DC can be
further improved. The qualitative results of brain segmentation can be seen in Figure 3 and we
can see that the results of combining multi-modality and auto-context information can give more
accurate results visually than only multi-modality informaltion.
Regarding the evaluation of testing data, we compared our method with several state-of-the-art meth-
ods, including MDGRU, 3D U-net [6] and PyraMiD-LSTM [32]. The MDGRU applied a neural net-
work with the main components being multi-dimensional gated recurrent units and achieved quite
1
MICCAI MRBrainS Challenge: http://mrbrains13.isi.uu.nl/details.php

5
Table 1: Cross-validation Results of MR Brain Segmentation using Different Modalities (DC: %,
HD: mm, AVD: %).
GM WM CSF
Modality
DC HD AVD DC HD AVD DC HD AVD
T1 86.96 1.36 4.67 89.70 1.92 6.85 79.58 2.71 17.55
T1-IR 80.61 1.92 8.45 85.89 2.87 7.42 76.44 3.00 12.87
T2-FLAIR 81.13 1.92 9.15 83.21 3.00 4.99 75.34 3.03 3.77
All 86.86 1.36 7.13 90.22 1.36 5.12 81.97 2.14 9.87
All+auto-context 87.83 1.36 6.22 90.63 1.36 2.22 82.76 2.14 5.50

good performance. The 3D U-net extended previous 2D version [26] into a 3D variant and high-
lighted the necessity for volumetric feature representation when applied to 3D recognition tasks.
The PyraMiD-LSTM parallelised the multi-dimensional recurrent neural networks in a pyramidal
fashion and achieved compelling performance. The detailed results of testing data from different
methods on brain segmentation from MR images can be seen in Table 2. We can see that deep learn-
ing based methods can achieve much better performance than hand-crafted feature based methods.
The results of VoxResNet (see CU DL in Table 2) by fusing multi-modality information achieved
better performance than other deep learning based methods, which demonstrated the efficacy of our
proposed framework. Incorporating the auto-context information (see CU DL2 in Table 2) , the
performance of DC can be further improved. Overall, our methods achieved the top places in the
challenge leader board out of 37 competing teams.

Table 2: Results of MICCAI MRBrainS Challenge of different methods (DC: %, HD: mm, AVD:
%. only top 10 teams are shown here).
GM WM CSF
Method Score*
DC HD AVD DC HD AVD DC HD AVD
CU DL (ours) 86.12 1.47 6.42 89.39 1.94 5.84 83.96 2.28 7.44 39
CU DL2 (ours) 86.15 1.45 6.60 89.46 1.94 6.05 84.25 2.19 7.69 39
MDGRU 85.40 1.55 6.09 88.98 2.02 7.69 84.13 2.17 7.44 57
PyraMiD-LSTM2 84.89 1.67 6.35 88.53 2.07 5.93 83.05 2.30 7.17 59
FBI/LMB Freiburg [6] 85.44 1.58 6.60 88.86 1.95 6.47 83.47 2.22 8.63 61
IDSIA [32] 84.82 1.70 6.77 88.33 2.08 7.05 83.72 2.14 7.09 77
STH 84.77 1.71 6.02 88.45 2.34 7.67 82.77 2.31 6.73 86
ISI-Neonatology [22] 85.77 1.62 6.62 88.66 2.07 6.96 81.08 2.65 9.77 87
UNC-IDEA [38] 84.36 1.62 7.04 88.68 2.06 6.46 82.81 2.35 10.5 90
MNAB2 [24] 84.50 1.70 7.10 88.04 2.12 7.74 82.30 2.27 8.73 109
*Score = Rank DC + Rank HD + Rank AVD; a smaller score means better performance.

3.3 Implementation Details

Our method was implemented using Matlab and C++ based on Caffe library [13, 34]. It took about
one day to train the network while less than 2 minutes for processing each test volume (size 240 ×
240×48) using a standard workstation with one NVIDIA TITAN X GPU. Due to the limited capacity
of GPU memory, we cropped volumetric regions (size 80 × 80 × 80 × m, m is number of image
modalities and set as 6 in our experiments) for the input into the network. This was implemented
in an on-the-fly way during the training, which randomly sampled the training samples from the
whole input volumes. In the test phase, the probability map of whole volume was generated in an
overlap-tiling strategy for stitching the sub-volume results2 .

4 Conclusions
In this paper, we analyzed the capabilities of VoxResNet in the field of medical image computing and
demonstrated its potential to advance the performance of biomedical volumetric image segmenta-
tion problems. The proposed method extends the deep residual learning in a 3D variant for handling
2
Project page: http://www.cse.cuhk.edu.hk/˜hchen/research/seg_brain.html

6
volumetric data. Furthermore, an auto-context version of VoxResNet is proposed to further boost the
performance under an integration of low-level appearance information, implicit shape information
and high-level context. Extensive experiments on the challenging segmentation benchmark corrobo-
rated the efficacy of our method when applied to volumetric data. Moreover, the proposed algorithm
goes beyond the application of brain segmentation and it can be applied in other volumetric image
segmentation problems. In the future, we will investigate the performance of our method on more
object detection and segmentation tasks from volumetric data.

References
[1] P. Aljabar, R. A. Heckemann, A. Hammers, J. V. Hajnal, and D. Rueckert. Multi-atlas
based segmentation of brain images: atlas selection and its effect on accuracy. Neuroimage,
46(3):726–738, 2009.
[2] X. Artaechevarria, A. Munoz-Barrutia, and C. Ortiz-de Solórzano. Combination strategies in
multi-atlas image segmentation: application to brain mr data. IEEE transactions on medical
imaging, 28(8):1266–1277, 2009.
[3] H. Chen, D. Ni, J. Qin, S. Li, X. Yang, T. Wang, and P. A. Heng. Standard plane localiza-
tion in fetal ultrasound via domain transferred deep neural networks. Biomedical and Health
Informatics, IEEE Journal of, 19(5):1627–1636, 2015.
[4] H. Chen, X. J. Qi, J. Z. Cheng, and P. A. Heng. Deep contextual networks for neuronal structure
segmentation. In Thirtieth AAAI Conference on Artificial Intelligence, 2016.
[5] H. Chen, L. Yu, Q. Dou, L. Shi, V. C. Mok, and P. A. Heng. Automatic detection of cerebral mi-
crobleeds via deep learning based 3d feature representation. In 2015 IEEE 12th International
Symposium on Biomedical Imaging (ISBI), pages 764–767. IEEE, 2015.
[6] Ö. Çiçek, A. Abdulkadir, S. S. Lienkamp, T. Brox, and O. Ronneberger. 3d u-net: Learn-
ing dense volumetric segmentation from sparse annotation. arXiv preprint arXiv:1606.06650,
2016.
[7] J. Dai, K. He, and J. Sun. Instance-aware semantic segmentation via multi-task network cas-
cades. arXiv preprint arXiv:1512.04412, 2015.
[8] Q. Dou, H. Chen, L. Yu, L. Zhao, J. Qin, D. Wang, V. C. Mok, L. Shi, and P.-A. Heng. Auto-
matic detection of cerebral microbleeds from mr images via 3d convolutional neural networks.
IEEE transactions on medical imaging, 35(5):1182–1195, 2016.
[9] A. Giorgio and N. De Stefano. Clinical use of brain volumetry. Journal of Magnetic Resonance
Imaging, 37(1):1–14, 2013.
[10] K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. arXiv
preprint arXiv:1512.03385, 2015.
[11] K. He, X. Zhang, S. Ren, and J. Sun. Identity mappings in deep residual networks. arXiv
preprint arXiv:1603.05027, 2016.
[12] S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing
internal covariate shift. arXiv preprint arXiv:1502.03167, 2015.
[13] Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and
T. Darrell. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint
arXiv:1408.5093, 2014.
[14] K. Kamnitsas, C. Ledig, V. F. Newcombe, J. P. Simpson, A. D. Kane, D. K. Menon, D. Rueck-
ert, and B. Glocker. Efficient multi-scale 3d cnn with fully connected crf for accurate brain
lesion segmentation. arXiv preprint arXiv:1603.05959, 2016.
[15] A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional
neural networks. In Advances in neural information processing systems, pages 1097–1105,
2012.
[16] C.-Y. Lee, S. Xie, P. Gallagher, Z. Zhang, and Z. Tu. Deeply-supervised nets. In AISTATS,
volume 2, page 6, 2015.
[17] R. Li, W. Zhang, H.-I. Suk, L. Wang, J. Li, D. Shen, and S. Ji. Deep learning based imaging
data completion for improved brain disease diagnosis. In International Conference on Medical
Image Computing and Computer-Assisted Intervention, pages 305–312. Springer, 2014.
[18] J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation.
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages
3431–3440, 2015.

7
[19] O. Maier, B. H. Menze, J. von der Gablentz, L. Häni, M. P. Heinrich, M. Liebrand, S. Winzeck,
A. Basit, P. Bentley, L. Chen, et al. Isles 2015-a public evaluation benchmark for ischemic
stroke lesion segmentation from multispectral mri. Medical Image Analysis, 35:250–269,
2017.
[20] A. M. Mendrik, K. L. Vincken, H. J. Kuijf, M. Breeuwer, W. H. Bouvy, J. De Bresser,
A. Alansary, M. De Bruijne, A. Carass, A. El-Baz, et al. Mrbrains challenge: Online eval-
uation framework for brain image segmentation in 3t mri scans. Computational intelligence
and neuroscience, 2015.
[21] B. H. Menze, A. Jakab, S. Bauer, J. Kalpathy-Cramer, K. Farahani, J. Kirby, Y. Burren, N. Porz,
J. Slotboom, R. Wiest, et al. The multimodal brain tumor image segmentation benchmark
(brats). IEEE Transactions on Medical Imaging, 34(10):1993–2024, 2015.
[22] P. Moeskops, M. A. Viergever, M. J. Benders, and I. Išgum. Evaluation of an automatic brain
segmentation method developed for neonates on adult mr brain images. In SPIE Medical
Imaging, pages 941315–941315. International Society for Optics and Photonics, 2015.
[23] D. Nie, L. Wang, Y. Gao, and D. Sken. Fully convolutional networks for multi-modality
isointense infant brain image segmentation. In 2016 IEEE 13th International Symposium on
Biomedical Imaging (ISBI), pages 1342–1345. IEEE, 2016.
[24] S. Pereira, A. Pinto, J. Oliveira, A. M. Mendrik, J. H. Correia, and C. A. Silva. Automatic
brain tissue segmentation in mr images using random forests and conditional random fields.
Journal of Neuroscience Methods, 270:111–123, 2016.
[25] A. Prasoon, K. Petersen, C. Igel, F. Lauze, E. Dam, and M. Nielsen. Deep feature learning for
knee cartilage segmentation using a triplanar convolutional neural network. In International
Conference on Medical Image Computing and Computer-Assisted Intervention, pages 246–
253. Springer, 2013.
[26] O. Ronneberger, P. Fischer, and T. Brox. U-net: Convolutional networks for biomedical im-
age segmentation. In International Conference on Medical Image Computing and Computer-
Assisted Intervention, pages 234–241. Springer, 2015.
[27] H. R. Roth, L. Lu, A. Seff, K. M. Cherry, J. Hoffman, S. Wang, J. Liu, E. Turkbey, and R. M.
Summers. A new 2.5 d representation for lymph node detection using random sets of deep
convolutional neural network observations. In International Conference on Medical Image
Computing and Computer-Assisted Intervention, pages 520–527. Springer, 2014.
[28] D. Sarikaya, L. Zhao, and J. J. Corso. Multi-atlas brain mri segmentation with multiway cut.
In Proceedings of the MICCAI WorkshopsThe MICCAI Grand Challenge on MR Brain Image
Segmentation (MRBrainS13), 2013.
[29] F. Shen and G. Zeng. Weighted residuals for very deep networks. arXiv preprint
arXiv:1605.08831, 2016.
[30] H.-C. Shin, H. R. Roth, M. Gao, L. Lu, Z. Xu, I. Nogues, J. Yao, D. Mollura, and R. M.
Summers. Deep convolutional neural networks for computer-aided detection: Cnn architec-
tures, dataset characteristics and transfer learning. IEEE transactions on medical imaging,
35(5):1285–1298, 2016.
[31] K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recog-
nition. arXiv preprint arXiv:1409.1556, 2014.
[32] M. F. Stollenga, W. Byeon, M. Liwicki, and J. Schmidhuber. Parallel multi-dimensional lstm,
with application to fast biomedical volumetric image segmentation. In Advances in Neural
Information Processing Systems, pages 2998–3006, 2015.
[33] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and
A. Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE Conference on
Computer Vision and Pattern Recognition, pages 1–9, 2015.
[34] D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri. Deep end2end voxel2voxel
prediction. arXiv preprint arXiv:1511.06681, 2015.
[35] Z. Tu. Auto-context and its application to high-level vision tasks. In Computer Vision and
Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1–8. IEEE, 2008.
[36] Z. Tu and X. Bai. Auto-context and its application to high-level vision tasks and 3d brain image
segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(10):1744–
1757, 2010.
[37] A. van Opbroek, F. van der Lijn, and M. de Bruijne. Automated brain-tissue segmentation by
multi-feature svm classification. 2013.

8
[38] L. Wang, Y. Gao, F. Shi, G. Li, J. H. Gilmore, W. Lin, and D. Shen. Links: Learning-based
multi-source integration framework for segmentation of infant brain images. NeuroImage,
108:160–172, 2015.
[39] S. Zagoruyko and N. Komodakis. Wide residual networks. arXiv preprint arXiv:1605.07146,
2016.

Log
No ratings yet
Log
256 pages
4.RMTF-Net_ Residual Mix Transformer Fusion Net
No ratings yet
4.RMTF-Net_ Residual Mix Transformer Fusion Net
18 pages
3-Dimensional Deep Learning With Spatial Erasing for Unsupervised Anomaly Segmentation in Brain MRI
No ratings yet
3-Dimensional Deep Learning With Spatial Erasing for Unsupervised Anomaly Segmentation in Brain MRI
16 pages
SSRN Id4630905
No ratings yet
SSRN Id4630905
29 pages
Medical Image Segmentation With Deep Learning
No ratings yet
Medical Image Segmentation With Deep Learning
42 pages
Deepmriprep: Voxel-Based Morphometry (VBM) Preprocessing Via Deep Neural Networks
No ratings yet
Deepmriprep: Voxel-Based Morphometry (VBM) Preprocessing Via Deep Neural Networks
26 pages
1-s2.0-S1361841524002548-main
No ratings yet
1-s2.0-S1361841524002548-main
14 pages
a robust volumetric transformer for accurate 3D tumor segmentation paper
No ratings yet
a robust volumetric transformer for accurate 3D tumor segmentation paper
12 pages
Toward Deep MRI Segmentation for Alzheim
No ratings yet
Toward Deep MRI Segmentation for Alzheim
17 pages
Brain Ieee
No ratings yet
Brain Ieee
11 pages
DNet for Vol MedImgSeg
No ratings yet
DNet for Vol MedImgSeg
18 pages
Paper Draft
No ratings yet
Paper Draft
30 pages
nihms-1904802 (1)
No ratings yet
nihms-1904802 (1)
16 pages
500 citation
No ratings yet
500 citation
27 pages
Literature Survey: Performance Comparision of Residual Deep Network For The Brain Tumor Detection
No ratings yet
Literature Survey: Performance Comparision of Residual Deep Network For The Brain Tumor Detection
19 pages
O_Level_Module_3_M3_R5_Programming_&_Problem_Solving_Through_Python
No ratings yet
O_Level_Module_3_M3_R5_Programming_&_Problem_Solving_Through_Python
128 pages
Azad Beyond Self-Attention Deformable Large Kernel Attention for Medical Image Segmentation WACV 2024 Paper
No ratings yet
Azad Beyond Self-Attention Deformable Large Kernel Attention for Medical Image Segmentation WACV 2024 Paper
11 pages
Csc508 Test 1 Answer Scheme.docx
No ratings yet
Csc508 Test 1 Answer Scheme.docx
7 pages
(Slide) Do Et Al. - 3D-DDA 3D DUAL-DOMAIN ATTENTION FOR BRAIN TUMOR SEGMENTATION
No ratings yet
(Slide) Do Et Al. - 3D-DDA 3D DUAL-DOMAIN ATTENTION FOR BRAIN TUMOR SEGMENTATION
17 pages
Splitting Expands The Application Range of Vision
No ratings yet
Splitting Expands The Application Range of Vision
11 pages
Volumetric and Multi-View CNNs For Object Classification On 3D Data
No ratings yet
Volumetric and Multi-View CNNs For Object Classification On 3D Data
14 pages
2.5D Solutions
No ratings yet
2.5D Solutions
16 pages
Small Lesion Segmentation in Brain Mris With Subpixel Embedding
No ratings yet
Small Lesion Segmentation in Brain Mris With Subpixel Embedding
17 pages
VISTA3D
No ratings yet
VISTA3D
24 pages
Wang_Rethinking_Bayesian_Deep_Learning_Methods_for_Semi-Supervised_Volumetric_Medical_Image_CVPR_2022_paper
No ratings yet
Wang_Rethinking_Bayesian_Deep_Learning_Methods_for_Semi-Supervised_Volumetric_Medical_Image_CVPR_2022_paper
9 pages
5 VNet PDF
No ratings yet
5 VNet PDF
11 pages
EMCAD Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation
No ratings yet
EMCAD Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation
14 pages
FANet_A_Feedback_Attention_Network_for_Improved_Biomedical_Image_Segmentation
No ratings yet
FANet_A_Feedback_Attention_Network_for_Improved_Biomedical_Image_Segmentation
14 pages
Segmenting Medical MRI via Recurrent Decoding Cell
No ratings yet
Segmenting Medical MRI via Recurrent Decoding Cell
8 pages
RACE-Net: A Recurrent Neural Network For Biomedical Image Segmentation
No ratings yet
RACE-Net: A Recurrent Neural Network For Biomedical Image Segmentation
12 pages
Cascaded V-Net Using ROI Masks For Brain Tumor Segmentation
No ratings yet
Cascaded V-Net Using ROI Masks For Brain Tumor Segmentation
11 pages
Seminar Report: Abhishek Ranjan (12080002)
No ratings yet
Seminar Report: Abhishek Ranjan (12080002)
39 pages
Volumetric Attention For 3D Medical Image Segmentation and Detection
No ratings yet
Volumetric Attention For 3D Medical Image Segmentation and Detection
9 pages
Rahman 24 A
No ratings yet
Rahman 24 A
19 pages
Brain Tumor Segmentation
No ratings yet
Brain Tumor Segmentation
15 pages
Zhao2019 - Multi-View Semi-Supervised 3D Whole Brain Segmentation With A Self-Ensemble Network PDF
No ratings yet
Zhao2019 - Multi-View Semi-Supervised 3D Whole Brain Segmentation With A Self-Ensemble Network PDF
10 pages
Advanced Communication Systems
No ratings yet
Advanced Communication Systems
207 pages
2009.13120v3
No ratings yet
2009.13120v3
23 pages
3D U-Net Learning Dense Volumetric Segmentation FR
No ratings yet
3D U-Net Learning Dense Volumetric Segmentation FR
9 pages
Sensors 22 02559
No ratings yet
Sensors 22 02559
17 pages
Geos: Geodesic Image Segmentation: Abstract. This Paper Presents Geos, A New Algorithm For The E Cient
No ratings yet
Geos: Geodesic Image Segmentation: Abstract. This Paper Presents Geos, A New Algorithm For The E Cient
14 pages
TBConvL-Net a Hybrid Deep Learning Architecture for Robust Medical Image Segmentation - Main Ver
No ratings yet
TBConvL-Net a Hybrid Deep Learning Architecture for Robust Medical Image Segmentation - Main Ver
12 pages
s13246-021-01019-w
No ratings yet
s13246-021-01019-w
10 pages
Post-Reading Report Alex Shen (Mid Exam)
No ratings yet
Post-Reading Report Alex Shen (Mid Exam)
36 pages
Cardiac MRI Segmentation With A Dilated CNN Incorporating Domain Specific Constraints
No ratings yet
Cardiac MRI Segmentation With A Dilated CNN Incorporating Domain Specific Constraints
9 pages
Multi-Scale Feature Pyramid Fusion Network For Medical Image Segmentation
No ratings yet
Multi-Scale Feature Pyramid Fusion Network For Medical Image Segmentation
13 pages
1 s2.0 S131915782300099X Main
No ratings yet
1 s2.0 S131915782300099X Main
13 pages
UNesT - Local Spatial Representation Learning With Hierarchical Transformer For Efficient Medical Segmentation
No ratings yet
UNesT - Local Spatial Representation Learning With Hierarchical Transformer For Efficient Medical Segmentation
21 pages
A Deep Convolutional Neural Network Learning Transfer To SVM-Based Segmentation Method For Brain Tumor
No ratings yet
A Deep Convolutional Neural Network Learning Transfer To SVM-Based Segmentation Method For Brain Tumor
5 pages
Jbhi 2020 2998146
No ratings yet
Jbhi 2020 2998146
9 pages
Li2022 Article BrainTumorSegmentationBasedOnR
No ratings yet
Li2022 Article BrainTumorSegmentationBasedOnR
11 pages
UNETR: Transformers For 3D Medical Image Segmentation
No ratings yet
UNETR: Transformers For 3D Medical Image Segmentation
11 pages
Consumer Power PDF
No ratings yet
Consumer Power PDF
13 pages
Avaya IP Telephone How To: Basic Use and Features
No ratings yet
Avaya IP Telephone How To: Basic Use and Features
11 pages
End-to-End Boundary Aware Networks For Medical Image Segmentation
No ratings yet
End-to-End Boundary Aware Networks For Medical Image Segmentation
8 pages
Ug NX Workbook Student Manual Design Applications of Unigraphics
100% (1)
Ug NX Workbook Student Manual Design Applications of Unigraphics
676 pages
Implementation SOCSTA CL
No ratings yet
Implementation SOCSTA CL
66 pages
JJ
100% (1)
JJ
42 pages
2017 Article 9983-Read
No ratings yet
2017 Article 9983-Read
11 pages
DaVinci Resolve 12 Configuration Guide
No ratings yet
DaVinci Resolve 12 Configuration Guide
68 pages
An Exploration of 2D and 3D Deep Learning Techniques For Cardiac MR Image Segmentation
No ratings yet
An Exploration of 2D and 3D Deep Learning Techniques For Cardiac MR Image Segmentation
8 pages
Inglés 3 1
No ratings yet
Inglés 3 1
8 pages
Result of BA/Bsc/BCOM/BBA/BCA Part III, Jammu, 22 May 2012
No ratings yet
Result of BA/Bsc/BCOM/BBA/BCA Part III, Jammu, 22 May 2012
28 pages
Bharath Simha Reddy 2021 IOP Conf. Ser. Mater. Sci. Eng. 1022 012020
No ratings yet
Bharath Simha Reddy 2021 IOP Conf. Ser. Mater. Sci. Eng. 1022 012020
11 pages
3D U-Net: Learning Dense Volumetric Segmentation From Sparse Annotation
No ratings yet
3D U-Net: Learning Dense Volumetric Segmentation From Sparse Annotation
8 pages
Location Sensitive Deep Convolutional PDF
No ratings yet
Location Sensitive Deep Convolutional PDF
13 pages
Question Bank Template OOAD
No ratings yet
Question Bank Template OOAD
5 pages
Left Ventricle Segmentation in Cardiac MR Images Using Fully Convolutional Network
No ratings yet
Left Ventricle Segmentation in Cardiac MR Images Using Fully Convolutional Network
4 pages
07iccv segmMRI
No ratings yet
07iccv segmMRI
8 pages
Report Writing
0% (1)
Report Writing
28 pages
A Hybrid Machine Learning Method For Image Classification
No ratings yet
A Hybrid Machine Learning Method For Image Classification
15 pages
R2 Unet PDF
No ratings yet
R2 Unet PDF
12 pages
Research Paper
No ratings yet
Research Paper
5 pages
Application Portfolio Rationalization
No ratings yet
Application Portfolio Rationalization
8 pages
A Deep Learning Guide To MICCAI 2015 - v2
No ratings yet
A Deep Learning Guide To MICCAI 2015 - v2
21 pages
A Comprehensive Analysis of Medical Image Segmentation Using Deep Learning
No ratings yet
A Comprehensive Analysis of Medical Image Segmentation Using Deep Learning
10 pages
Joint Probability
No ratings yet
Joint Probability
12 pages
System Error Codes (12000-15999) (Windows)
No ratings yet
System Error Codes (12000-15999) (Windows)
48 pages
New Assets Accounting SAP S
No ratings yet
New Assets Accounting SAP S
47 pages
An Analysis Nomoto Gain and Norbin Parameter On Ship Turning Maneuver
No ratings yet
An Analysis Nomoto Gain and Norbin Parameter On Ship Turning Maneuver
8 pages
Taming Google-Scale Continuous Testing: Ntroduction
No ratings yet
Taming Google-Scale Continuous Testing: Ntroduction
10 pages
Toontrack Superior Drummer 3
100% (1)
Toontrack Superior Drummer 3
11 pages
PHYS 356/452/953 - Medical Imaging Physics Lab Assignment 5
No ratings yet
PHYS 356/452/953 - Medical Imaging Physics Lab Assignment 5
6 pages
Znet - Lung Nodule Detection Moira Berens, Robbert Van Der Gugten, Michael de Kaste, Jeroen Manders, and Guido Zuidhof
No ratings yet
Znet - Lung Nodule Detection Moira Berens, Robbert Van Der Gugten, Michael de Kaste, Jeroen Manders, and Guido Zuidhof
4 pages
Automatic Lung Segmentation in CT Images Using Watershed Transform
No ratings yet
Automatic Lung Segmentation in CT Images Using Watershed Transform
4 pages
"On Line Placement Cell System": Six Months Industrial Training ON
No ratings yet
"On Line Placement Cell System": Six Months Industrial Training ON
68 pages
Math Module
No ratings yet
Math Module
3 pages
Goods Spend Analysis Report
No ratings yet
Goods Spend Analysis Report
5 pages
Frequency Distribution and Graphs
No ratings yet
Frequency Distribution and Graphs
11 pages
List of Software Companies in India
No ratings yet
List of Software Companies in India
62 pages
03 Work Center Where Used CA82
No ratings yet
03 Work Center Where Used CA82
6 pages
Blince Update PI About Hydraulic Motor
No ratings yet
Blince Update PI About Hydraulic Motor
1 page
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
From Everand
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
Fouad Sabry
No ratings yet

Voxresnet: Deep Voxelwise Residual Networks For Volumetric Brain Segmentation

Uploaded by

Voxresnet: Deep Voxelwise Residual Networks For Volumetric Brain Segmentation

Uploaded by

VoxResNet: Deep Voxelwise Residual Networks for

Volumetric Brain Segmentation

Hao Chen, Qi Dou, Lequan Yu and Pheng-Ann Heng

The Chinese University of Hong Kong

2.1 Deep Residual Learning

2.2 VoxResNet for Volumetric Image Segmentation

Conv4, 64, 3x3x3, /2

2.3 Multi-modality and Auto-context Information Fusion

3.2 Evaluation and Comparison

3.3 Implementation Details

You might also like