ICACE 2020 Presentation
ICACE 2020 Presentation
T ime Period
January 2013 – December 2018
5
Dataset Summary
Boxes showing median, quartiles and range. Whiskers are showing the range for all variables 6
N o. of Indoor Patients
100
120
0
20
40
60
80
1/1/2013
4/1/2013
7/1/2013
10/1/2013
1/1/2014
4/1/2014
7/1/2014
10/1/2014
1/1/2015
4/1/2015
7/1/2015
10/1/2015
1/1/2016
Date
4/1/2016
7/1/2016
10/1/2016
1/1/2017
4/1/2017
Output/Response Variable
Daily Indoor Patient Data, N IDC H (2013-2018)
7/1/2017
10/1/2017
1/1/2018
4/1/2018
7/1/2018
10/1/2018
7
Data Prepocessing: Missing Value Imputation
SO2, NO2, CO, O3, PM2.5, PM10, Solar Radiation,
Variables Containing Missing Values
No. of indoor Patients
S ummary
Imputation T echnique Unprocessed Data (%) After Imputation (%)
CAMS CAMS CAMS CAMS CAMS CAMS
• R eplaced a missing value of a specific 1 2 3 1 2 3
chronology in a year with the SO2 78.5 21.2 12.1 7.3 4.1 4.5
average value of the previous and NO2 84.1 42.0 12.8 8.4 2.6 1.6
next year’s data of the same CO 54.7 14.6 24.6 10.2 3.1 0.2
8
Data Prepocessing: Data Cleaning and Data Scaling
9
Multiple Linear Artificial Neural Network
Regression (ML P) Arc hitec ure
Implementation Framework: PyTorch (version 0.23.1), on
Implementation Framework: Python
Scikit-learn (version 0.23.1), on
Python
11
𝑦𝑦 = 𝛼𝛼 + � 𝛽𝛽𝑖𝑖 𝑋𝑋𝑖𝑖
𝑖𝑖=1
H ere,
𝑦𝑦 = value of response variable (daily
no. of indoor patients),
𝛼𝛼 = unknown regression bias,
𝛽𝛽𝑖𝑖 = unknown regression coefficients,
𝑥𝑥𝑖𝑖 = values of independent variables.
Contact:
Rizvan Ahmed Rafsan. Email: [email protected]