ReSkipNetSkipConnectedConvolutionalAutoencoderforOriginalDocumentDenoising
ReSkipNetSkipConnectedConvolutionalAutoencoderforOriginalDocumentDenoising
net/publication/378528732
CITATIONS READS
0 52
7 authors, including:
All content following this page was uploaded by Ehsanur Rahman Rhythm on 29 February 2024.
Abstract—Data pre-processing, data analysis, and Optical picture noise and blur. A typical degraded document from our
Character Recognition need a huge amount of clean data, and original dataset is shown in Fig. 1.
document images are usually a good source for this. However, The field of image restoration techniques has garnered
document images frequently exhibit blurring and various other
forms of noise, which can pose challenges in their manipulation increasing attention in recent decades. The objective is to
and analysis. To denoise and deblur such document images, generate a new image that exhibits reduced levels of noise
autoencoders have been used for a long time. For this task, and blur, while also closely resembling the original image.
we propose a novel Convolutional Autoencoder Network which Similar to prior studies, the distorted image of poor quality
is composed of multiple skip-connected residual blocks and can be expressed as:
other layers for supporting the encoder and decoder parts.
This model not only uses less computational power to denoise y = D(x) + n (1)
existing document image datasets but also performs well. While
prior research primarily concentrates on optimizing evaluation where y represents the degraded image, x represents the
metrics, our approach additionally prioritizes larger resolution original image of good quality, D denotes the degradation
input sizes. This characteristic of using larger image sizes function, and n represents the noise as denoted in equation
enhances its practicality and usability as real-world documents (1). The process of image restoration is sometimes referred
are typically characterized by a higher word density. Moreover,
in order to further advance the development of our model, we to as an inverse issue, that is, the estimation of variable x
produced an original dataset and proceeded to train our model based on the observation of variable y.
on this dataset, resulting in satisfactory outcomes.
Index Terms—Convolutional Autoencoder, Residual Block,
Skip Connections, Denoising, Documents, Image Processing,
Original Dataset
I. I NTRODUCTION
M AXI2
P SN R = 10 · log10 (2)
M SE
M AXI
P SN R = 20 · log10 √ (3)
M SE
P SN R = 20 · log10 (M AXI ) − 10 · log10 (M SE) (4)
where I is the given m x n monochrome image and K is its Fig. 7. Noisy vs Predicted Clean Images from Original Dataset
noisy approximation.