Predictive Machine Learning Algorithms for Metro R
Predictive Machine Learning Algorithms for Metro R
Article
Predictive Machine Learning Algorithms for Metro Ridership
Based on Urban Land Use Policies in Support of
Transit-Oriented Development
Aya Hasan AlKhereibi 1 , Tadesse G. Wakjira 2 , Murat Kucukvar 1 and Nuri C. Onat 3, *
1 Industrial and Systems Engineering, College of Engineering, Qatar University, Doha P.O. Box 2713, Qatar
2 Civil and Architectural Engineering, College of Engineering, Qatar University, Doha P.O. Box 2713, Qatar
3 Qatar Transportation and Traffic Safety Center, College of Engineering, Qatar University,
Doha P.O. Box 2713, Qatar
* Correspondence: [email protected]
Abstract: The endeavors toward sustainable transportation systems are a key concern for planners
and decision-makers where increasing public transport attractiveness is essential. In this paper, a
machine-learning-based predictive modeling approach is proposed for metro ridership prediction,
considering the built environment around the stations; it is in the best interest of sustainable transport
planning to ultimately contribute to the achievement of Sustainable Development Goals (UN-SDGs).
A total of twelve parameters are considered as input features including time of day, day of the
week, station, and nine types of land use density. Hence, a time-series database is used for model
development and testing. Several machine learning (ML) models were evaluated for their predic-
tive performance: ridge regression, lasso regression, elastic net, k-nearest neighbor, support vector
regression, decision tree, random forest, extremely randomized trees, adaptive boosting, gradient
boosting, extreme gradient boosting, and stacking ensemble learner. Bayesian optimization and grid
search are combined with 10-fold cross-validation to tune the hyperparameters of each model. The
performance of the developed models was validated based on the test dataset using five quantita-
tive performance measures. The results demonstrated that, among the base learners, the decision
Citation: AlKhereibi, A.H.; Wakjira,
tree showed the highest performance with an R2 of 87.4% on the test dataset. KNN and SVR were
T.G.; Kucukvar, M.; Onat, N.C.
the second and third-best models among the base learners. Furthermore, the feature importance
Predictive Machine Learning
investigation explains the relative contribution of each type of land use density to the prediction of the
Algorithms for Metro Ridership
metro ridership. The results showed that governmental land use density, educational facilities land
Based on Urban Land Use Policies in
Support of Transit-Oriented
use density, and mixed-use density are the three factors that play the most critical role in determining
Development. Sustainability 2023, 15, total ridership. The outcomes of this research could be of great help to the decision-making process
1718. https://doi.org/10.3390/ for the best achievement of sustainable development goals in relation to sustainable transport and
su15021718 land use.
1. Introduction
Sustainable transport includes the application of the sustainable development con-
Copyright: © 2023 by the authors.
cept in the process of planning and development of transport infrastructure. Refs. [1–3]
Licensee MDPI, Basel, Switzerland. suggested that sustainable transportation systems evolve the application of sustainable
This article is an open access article development planning strategies; for instance, transportation sustainability is defined as
distributed under the terms and “meeting the current mobility and transport needs nevertheless not compromising the
conditions of the Creative Commons future generations to meet those needs”. According to OECD, Taiwan Transport Institute,
Attribution (CC BY) license (https:// and ref. [4], sustainable transport must be able to meet long-term and simultaneous envi-
creativecommons.org/licenses/by/ ronmental, social, and economic needs and impacts. There are three aspects of sustainable
4.0/). transport: environmental, social, and economic. Environmental aspect requires taking
into account the external effects of the transport process; the social aspect also requires
considering the interests of residents at different socio-economic levels while improving
transport; economic aspects require efficient use and conservation of resources [1].
In this respect, to sustain the public transportation resources, planners should address
the transportation systems operations with respect to the built environment around sta-
tions; in other words, they are expected to assure the highest efficiency of the transport
service by increasing transport accessibility to as many residents as possible [4]. The trade-
offs between transportation operation needs and built environment considerations could
easily be solved by having high-density urban areas centered on transit service, yet an
acceptable economic aspect should be maintained [5]. While the socio-economic attribute
of sustainable development can be improved through transport infrastructure through
logistics and multimodal capacity planning, intra-regional accessibility of the transport
infrastructure promotes economic dimensions and other social interactions. Accessibility
plays an undeniable role in urban areas and also ensures infrastructure efficiency combined
with the multimodality of public transport [2].
With the rapid development of urbanization, the metro has become one of the main
drivers of public transport due to its various advantages, such as high efficiency, high capac-
ity, convenience, etc. [1]. During the planning phase of urban transport and construction,
it is important to examine many urban indicators from a systemic point of view to cover
urban-transport phenomena, such as density, ridership, and accessibility. Metro ridership
at the station level is an important factor in determining the size of stations and access
to facilities. Various components of the urban system (e.g., land use and socioeconomic
aspects) require the kind of modeling presented in this paper, which could be of help to
accurately estimate and predict the number of passengers, as well as analyze influencing
factors. Recently, with the development of artificial intelligence and computational capa-
bilities, machine learning (ML) techniques have gained considerable attention owing to
their ability to effectively determine the relationship between the response variable and
its predictors in a complex system [6]. In spite of their great efficacy, the literature lacks
the application of machine learning models to the metro transportation system ridership,
considering the built environment.
To this end, this study mainly focuses on metro transportation system ridership on
a station level, and the land use density component of the urban system. The research
contributes to both theory and practice as follows. The proposed methods examined
in this research enable a time series prediction of metro ridership, considering the built
environment and transportation sustainability. On the practical aspect, the examined model
outcomes are applied to the case of Doha Metro and could be similarly applied to similar
regions and cases. Furthermore, the outcomes of this research have numerous implications
for both transportation operation and urban planning policymakers, which significantly
bridge the gaps between theory and practical urban-transport models.
changes from a vehicle-oriented society to a TOD [13]. Moreover, ref. [14] in their study
examined the spatiotemporal heterogeneity in the nonlinear influence of transit-oriented
development (TOD) on metro ridership. The findings have led to a better understanding of
the spatiotemporal heterogeneity in the nonlinear influence of TOD on metro ridership.
In addition to the current macro-level literature, a number of survey-based studies
have provided overlapping results, as they have recognized anomalies in the existing
paradigm. Ref. [15] pointed out that the influences of travel time and financial costs on
modality alternatives are independent of land use influences. In addition to the infrastruc-
ture of the city, demographic information, and the choice of transportation mode could be
a matter of higher cognitive processing on the part of citizens, and this decision is plagued
by psychological and emotional patterns. Ref. [2] accepts this as true, with ref. [3] thereon
stating that urban fabric and transportation interactions may be accomplished by observing
past tendencies in the urban fabric and a stripped-down upsurge in the ridership of public
transportation. Despite the inflated federal investments in public transportation in several
societies, such as the state of Qatar, the shared idea, implicit assumptions, and perceptions
push right up against public transportation participation.
• Degradation of the urban landscape due to new road construction and new vehicles on
the roads, the historic buildings demolition, the reduction of open spaces, and urban
sprawl [7].
• Space use by a vehicle enables the movement of car drivers but decreases other
accessibility, as traffic routes turned out to be obstacles due to the fact that parked cars
create difficulties for pedestrians, cyclists, and people who have disabilities, which
leads to a predominance of traffic in urban environments.
• Global warming is the result of the usage of fossil fuels. Currently, transportation
systems account for 25% of CO2 emissions, and this level is increasing in both com-
parative and absolute terms. Moreover, transportation is entirely dependent on fuel,
which is a non-renewable source of energy.
• Furthermore, transportation contributes to change in the urban fabric of the city, thus,
development and land use factors should be added to the list above.
• Vehicle use has contributed to the city’s decentralization, combined with efficient
public transport. This significantly increases commuting time and the development of
more dispersed movement patterns with a center in the city center, thereby increasing
dependence on cars and reducing opportunities for developing an efficient public
transport community; transport has become a driving force of change.
• Industry displacement and globalization (for instance, the information economy)
have led to newfangled patterns of distribution and an increase in freight traffic
at the global, regional, and local levels. Technologies and solutions that could be
of help in promoting the relatively most efficient use of space and eliminating the
total additional land allocated for the developments are highly needed. There is
a sort of common agreement on these issues, and largely the variety of strategies
existing is known, but progress has been made, and it’s slow to integrate sustainability
into everyday solutions.
• With the intensive transformation of the built environment in the state of Qatar from a
traditional mixed-use, high-density fabric to inaccessible, car-oriented superblocks of
the newly developed cities and closed communities on the periphery, urban mobility
is noticeably affected. Along with the use of retail/commercial and office space, they
benefit greatly from the country’s sustainability [25]. Undoubtedly, based on the
Transit Oriented Development (TOD) consensus, metro transport in the state of Qatar
promises rapid and ambitious growth and is expected to become one of the main
methods of connecting different districts. TOD is an auspicious tool that enhances
greener mobility by switching from cars to the metro, encouraging short-distance
walking, and using local public transport for long distances [25], and it can slow
sprawl through the compact development of urban areas [26,27]. With the motivation
to promote rail transport systems, studies looking at the factors influencing the number
of passengers in transit have become interesting, but the impact of local communities
on the number of passengers in transit is surprisingly different from the various
contexts of the city.
a. Machine Learning-Based Ridership Prediction for Public Transportation
In general, optimization models are used for the planning of the transit network, as
well as the optimum transfer, the orientation of the route, and, therefore, the ridership [28].
Ref. [29] outlined an empirically supported genetic algorithm to enhance the performance
of the present networks by plummeting the vehicles while not penalizing the typical travel
time. In this sense, it’s been shown that heuristic algorithms are helpful in resolving
numerous transport problems; for example, the matter of the road trafficker, the routing
of multi-depot vehicles, or the shuttle network design [30–32], to simply call a couple of
examples. Ref. [33] planned a computational tool for optimizing massive routes of transit
networks that minimize transfers and optimize route openness through improved coverage
of service. They tested the tool and functioned it to a significant and accurate network
optimization drawback to the city of Miami. Ref. [34] investigated a three-tier hierarchic
optimization method for resolving the mass transit network development problems that are
Sustainability 2023, 15, 1718 5 of 20
typical of versatile large-scale mass transit choices and the exploitation of new technologies
for passenger-vehicle communication. Ref. [35] deliberated on accessibility, which is a vital
component of service delivery. Access sometimes has an approximate service associated
with the cost, while access is expounded to the adequacy of the transit system to induce
individuals from wherever they approach to where they are in an inexpensive period. In
Australia, ref. [36] established a hybrid coverage model to concurrently expand access to
services and, moreover, increase public transportation accessibility. The scholar suggested
that the operation aspect of transport planning is the spatial effects of the service. Ref. [37]
estimated rural mass transit traveler models that reply to public demand employing types
Sustainability 2023, 15, x FOR PEER REVIEW 4 of 22
of variables and derived helpful data for correct planning transport services that respond
to local demand.
2. Methods
2.1. Data Collection
The State of Qatar is located on the east coast of the Persian Arabian Gulf. It is
connected to the Arabian Peninsula and borders Saudi Arabia to the south. Qatar has
been shaped by rapid globalization and urbanization. For the past five decades, notable
economic change from a fishing and pearl collecting-based economy to a thriving and
varied economy based on the production and export of natural gas/oil has been observed.
Today, Qatar has almost 13% of the total international supply of natural gas reserves [43].
The country’s growth and hosting of several mega-events, such as the 2006 Asian Games
and the 2022 World Cup, led to intensive urban growth and development. The country
witnessed a significant increase in population over the past two decades, from a projected
urban population of 492 hundred thousand in 2000 to more than 2.4 million people in
2020. Consequently, the government confronted major challenges in the management of
the growth, transport, infrastructure, accommodations, and preservation of the ecosystems.
The changes included the construction of a sophisticated road network, including
road extensions and a ring road/expressway system. The traditional low-rise housing
was almost abandoned for modern residential villas, typically three-story and high-rise
apartments [44–46]. In parallel with its rapid urbanism growth. Sustainable development
of transport systems is a core diminution in Qatar National Vision (QNV) 2030, which
includes plans to develop a 300 km metro system with four lines and 98 stations connecting
the international airport, stadiums, and urban areas [47]. In 2019, three lines and 38 stations
are in operation.
The input features comprised a total of twelve parameters including time of day, day
Sustainability 2023, 15, 1718 of the week, station, and land use density for open space and recreation facilities (indoor-
8 of 20
outdoor), religious facilities, education facilities, retail/commercial, residential, special
use, transportation, and government facilities. The distribution of the ridership for the
complete dataset is shown in Figure 4 regardless of the land-use density. During the pre-
dataset is shown in Figure 4 regardless of the land-use density. During the pre-processing
processing stage, the database shown in Figure 4 is normalized based on Equation (1) be-
stage, the database shown in Figure 4 is normalized based on Equation (1) below:
low:
j j
j Xi − Xmin
Xi, n = ,j = j
= 1,
, j ,= 1, 2, 2,. .…. ,, M;; i =
=1,1,2,2,…., . . , N (1)
(1)
Xmax − Xmin
where is the ith observation of jth input features and , is its corresponding nor-
j j
where X is
malizedi value,the ith observation
and of jth
areinput features and
the maximum Xi,minimum
and n is its corresponding
values of thenormalized
th input
j j
features,
value, Xmax and represents thethe
Xmin are total observation
maximum andnumbers,
minimum andvaluesisof the number
the of input
jth input fea-
features,
N represents the total observation numbers, and M is the number of input features.
tures.
Figure 4. Ridership
Figure 4. Ridership data
data was
was used
used for
for model
model development.
development.
Figure
Figure 55 represents
represents the the diversity
diversity of
of land
land use
use within
within the
the 800
800 mm catchment
catchment area
area at
at the
the
38
38 metro
metro stations.
stations. ItItisisclear
clearfrom
fromthis
thisfigure
figurethat
thatthe
themajority
majorityofof
the land
the use
land in in
use this area
this areais
High-Density Residential land use with a wide mix of offices, commercial, and recreation
zones. The developed model utilized the data within the catchment areas at each station to
predict the ridership at each metro station.
is High-Density Residential land use with a wide mix of offices, commercial, and recrea-
oftion
each parameter
zones. of the models.
The developed The penalty
model utilized function
the data within thein the elastic
catchment netatiseach
areas given
sta- by:
ε(tion
λ, βto
)=predict β2i +
λ1 ∑ithe λ2 ∑i | β iat
ridership |. each metro station.
2.3. Predictive
2.3.2. K-Nearest Models
Neighbor
2.3.1. Regularization
The K-nearest neighbor (KNN for short) is among the nonparametric supervised ML
algorithms that can beis used
A regularization to solveform
an extended classification as well as
of linear regression regression
with an aim toproblems.
enhance theThe
predicted outputability
generalization in KNN is evaluated
of the model. Inas the mean
contrast of the regression,
to linear predictionswhich from the K data points
computes the
parameters
nearest to theestimates by minimizing
query point. Provided athe RMSE data
training of theset {( xi , yi )}aiN=regularization
predictions, 1 and a query param-
point Xq ,
eter
the that punishes
K-nearest the models
neighbor entails with multiple modelofparameters
the determination the pointsisinintroduced
the trainingin dataset
regulari-that
zation,
are as illustrated
nearest below:
to Xq [2]. The prediction of Xq is then computed as the weighted average of the
predictions from the K observations nearest to+Xq(. , )
min (2)
2.3.3.
whereSupport Vector
controls the Regression
trade-off between bias-variance.
Support vector regression
Ridge regression (SVR)
(RR), elastic netutilizes
(EN), andkernel functions
lasso (short fortoleast
mapabsolute
the datashrinkage
into a wide-
and selectionspace,
dimensional operator)
where regression are the foremost
linear separation usuallyDProvided
is conceivable. used regularization algo- set
aEtraining data
{( xi , yi )} i=1 e R × R, with N data points, where xi e X = xi , . . . , xi ⊆ R arethe
rithms. InNridge regression,
Q the regularization function is taken into
1 account
Q because
Q input
total of the sq. of the constants, ( , ) = ∑ , whereas the absolute values of the coef-
parameters Q and yi is the response, SVR the function f(x) in Equation (3).
ficient are employed in lasso regression, ( , ) = ∑ | |. The penalty in lasso regression
tends to set the model parameters to 0zero,
f ( x ) = ∑ αi − αi K ( xwhile RR tends to scale back0 the absolute values
i , x ) + b subject to αi , αi e [0, C ] (3)
of each parameter of the models.
ie SV The penalty function in the elastic net is given by:
( , )= ∑ + ∑ | |.
where C denotes the regularization parameter, K ( xi , x ) is the kernel function, SV represents
support vectors, b is the bias, and αi and αi0 are multipliers of Lagrangians of the inferior
and superior SVs.
Sustainability 2023, 15, 1718 10 of 20
yi − yi (t) ( xi )
(t)
Li = , i = 1, . . . , N (4)
max yi − yi (t) ( xi )
Sustainability 2023, 15, 1718 11 of 20
2
yi − yi (t) ( xi )
(t)
Li = , i = 1, . . . , N (5)
max yi − yi ( p) ( xi )
|yi −yi (t) ( xi )|
( )
(t) max |yi −yi ( p) ( xi )|
Li = 1−e , i = 1, . . . , N (6)
(d) Determine the mean of the loss:
n
L(t) = ∑ t =1 L i ( t ) wi ( t ) , Lt e [0, 1] (7)
L(t)
βt = (9)
1 − L(t)
The final prediction of the response is computed as the final cumulative predictions
using T trees.
Gradient boosted trees as a more general form of boosting ensemble allows the usage of
arbitrary loss functions, unlike AdaBoost. Provided a training set {( xi , yi ), i = 1, 2, . . . , N },
GBT performs the following:
(a) Start model with constant value: Fo ( x ) = arg min ∑iN=1 L(yi , γ) in which L, yi and γ
γ
represent loss function, observed response value, and predicted response value.
(b) For t = 1 to T (where T denotes the total number of estimators):
h i
∂L(yi ,F ( xi ) )
i. Determine git = − ∂F ( x )
, i = 1, . . . , N which is the negative
i F ( x )= Ft−1 ( x )
gradient descent.
Train CART ht ( x ) using xi , git iN=1 as training examples.
ii.
iii. Compute the value of γt by solving γt = arg min ∑i L(yi , Ft−1 ( xi ) + γht ( xi )).
γ
iv. Update the model: Ft ( x ) = Ft−1 ( x ) + γt ht ( x ).
(c) Finally, output FT ( x ).
Another variation of boosting ensemble, extreme gradient boosting (xgBoost) is intro-
duced in 2016 by [52] (Chen & Guestrin, 2016) as an effective implementation of gradient
boosting. A regularization term is introduced in the objective function of xgBoost for
the purpose of minimizing the complexity of the model and overcoming the problem of
overfitting, as given in Equation (10) [52] (Chen & Guestrin, 2016).
N t
∑L yi , ŷi (t) + ∑ Ω( f i ) (10)
i =1 i =1
1 K
Ω( f ) = γD + λ ∑ w2j (11)
2 j =1
The first term in Equation (10) denotes the loss function, while the second term denotes
the complexity of the model. Moreover, λ and j represent the regularization parameters,
D is the number of leaves, and γ and w j denote the complexity and weight of each leaf.
(e) the index of agreement (IA), and (f) the Kling-Gupta efficiency (KGE), as given by
Equation (12) through Equation (16). Low error (RMSE and MAE), and high IA, R2 , and
KGE demonstrate the best predictive model. The KGE as an improvement of Nash-Sutcliffe
efficiency considers three important measures: namely, correlation, bias, and variability
in a more balanced way, as given in Equation (16). KGE = 1 shows a perfect agreement
between the actual and predicted ridership. The same is true for R2 and IA.
1 N
N ∑ i =1 i
MAE = |y − ŷi | (12)
v
u1 N
u
RMSE = t ∑ (yi − ŷi )2 (13)
N i =1
2
∑iN=1 (yi − ŷi )
R2 = 1 − 2
(14)
∑iN=1 (yi − y)
2
∑in=1 (yi − ŷi )
IA = 1 − 2
, 0 < IA ≤ 1 (15)
∑in=1 (|yi − y| + |ŷi − y|)
q
KGE = 1 − (r − 1)2 + (α − 1)2 + ( β − 1)2 (16)
where yi is the ith observed value of output variable, ŷi is the ith the estimated value
of the output variable, y is the average value of the output variable, r is the linear cor-
relation between the actual and predicted response variable, α = σy /σy is a measure of
variability, β = µy /µy , is a bias term, σy and σy are the standard deviation of the pre-
dicted and actual response variable, and µy and µy are the mean of the predicted and
actual response variable.
Models Parameters
RR alpha = 0.1
LR alpha = 0.001
EN alpha = 0.001
KNN Number of neighbors = 2
SVR Kernel = RBF, C = 95, ε = 0.01, gamma = ‘auto’
CART Maximum depth = 10, maximum features = 7
Number of estimators = 15, maximum features = 11, maximum
RF
depth = 13, minimum sample leaf = 1, minimum sample split = 2
Number of estimators = 20, maximum features = 12, maximum
ET
depth = 13, minimum sample leaf = 1, minimum sample split = 2
Base learner = CART, number of estimators = 30, learning rate =
ADB 0.25, maximum depth of tree for base learner = 12, maximum
features for base learner = 11
Number of estimators = 150, maximum depth = 8, learning rate =
GBT
0.25, subsample = 0.8, maximum features = 12
Number of estimators = 145, learning rate = 0.15, subsample = 0.5,
xgBoost maximum depth = 11, colsample by node = 1.0, colsample by
level = 1.0, colsample by tree = 1
Sustainability 2023, 15, 1718 13 of 20
Figure 7.
Figure 7. Performance
Performance measures
measures for
for single
single models.
models.
The performance of the CART model can be improved with the use of ensemble
learners, as can be observed in Figure 8, which shows the scatter plots of the actual and
predicted ridership based on the ensemble models. As noted in the figure, the ensemble
learners showed good prediction performance with R2 ≥ 95.4% for both the test and train
datasets. Among the five bagging and boosting algorithm-based ensemble models (i.e.,
RF, ET, ADB, GBT, and xgBoost), xgBoost exhibited the best predictive performance in
terms of all the performance measures, as can be seen in Figure 9, which compares the
ensemble models in terms of the five performance metrics on the train as well as test sets.
Moreover, all ensemble models showed a better predictive performance compared with
that of the base learners (see Figures 6 and 8 and Table 2). The performance of xgBoost was
improved with the use of a stacking ensemble (SE) in which the best three models (xgBoost,
GBT, and ADB) were stacked using linear SVR as a meta-model. Overall, via comparing
the five quantitative performance measures, the proposed stacking ensemble was the best
predictive
Sustainability 2023,model
15, x FOR PEER the largest R2 , KGE, and IA and lowest RMSE AND MAE relative
withREVIEW 16 of to
22
Figure
Figure 10.10. Featureimportance
Feature importance analysis
analysisbased
basedonon
gradient treetree
gradient boosting.
boosting.
4. Model Implications
In contrast, on Policymaking land use density has the least influence on the
the retail/commercial
Thisofsection
prediction illustrates the
total ridership. relationship
These findings between the implications
demonstrate the nature of of the
the occupiers
prediction of the
urban-transport model investigated in this research and each related
three most important land uses that tend to use metro transit, such as the governmental Sustainable Devel-
opment Goal and
and educational sectors. pursues to make proposals for converting the transportation sector to a
more “sustainable” one. Several SDGs rely on transportation to achieve their goals. Trans-
portation
4. Model might not show
Implications on aPolicymaking
large role in a destination, yet for the achievement of the des-
tination and its objectives, transportation is essential and performs as an important “ena-
ThisHence,
bler”. section illustrates
it has been shown the that
relationship between
if one ignores the implications
sustainable transportation,ofit the prediction
becomes
urban-transport
much more problematic to meet most of the proposed SDGs. Figure 11 presents the urban-Devel-
model investigated in this research and each related Sustainable
opment Goal and pursues
transport-related SDGs, andtothe make proposals
following sectionfor converting
suggests policythe transportation
implications sector to
that could
a more “sustainable”
be in helping to achieveone.theseSeveral SDGs
goals, from thisrely on transportation
research study perspective: to achieve their goals.
Transportation might
SDG3: Ensure not show
healthy a large
lives and role
promote in a destination,
well-being for all of allyet
agesfor the achievement of the
The integrated and significantly planned transport networks have the probability to
enhance road safety using human-centered planning, transit-oriented development, or
traffic improvement actions, such as speed maps. In developing countries, measures to
build dimensions in road construction and transport policy and their execution are crucial
elements in attaining this goal. To conclude the well-planned road network and the insur-
Sustainability 2023, 15, 1718 16 of 20
Author Contributions: Conceptualization, A.H.A., T.G.W., M.K. and N.C.O.; Methodology, A.H.A.,
T.G.W., M.K. and N.C.O.; Validation, N.C.O.; Formal analysis, A.H.A. and T.G.W.; Investigation,
A.H.A. and N.C.O.; Data curation, A.H.A., T.G.W. and M.K.; Supervision, M.K. and N.C.O. All
authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
Sustainability 2023, 15, 1718 19 of 20
References
1. Chan, W.C.; Ibrahim, W.H.W.; Lo, M.C.; Suaidi, M.K.; Ha, S.T. Sustainability of public transportation: An examination of user
behavior to real-time gps tracking application. Sustainability 2020, 12, 9541. [CrossRef]
2. Castanho, R.A.; Behradfar, A.; Vulević, A.; Naranjo Gómez, J.M.N. Analyzing Transportation Sustainability in the Canary Islands
Archipelago. Infrastructures 2020, 5, 58. [CrossRef]
3. Ercan, T.; Onat, N.C.; Tatari, O.; Mathias, J.D. Public transportation adoption requires a paradigm shift in urban development
structure. J. Clean. Prod. 2017, 142, 1789–1799. [CrossRef]
4. Reisi, M.; Aye, L.; Rajabifard, A.; Ngo, T. Transport sustainability index: Melbourne case study. Ecol. Indic. 2014, 43, 288–296.
[CrossRef]
5. Wey, W.M.; Huang, J.Y. Urban sustainable transportation planning strategies for livable City’s quality of life. Habitat Int. 2018,
82, 9–27. [CrossRef]
6. Mangalathu, S.; Hwang, S.H.; Choi, E.; Jeon, J.S. Rapid seismic damage evaluation of bridge portfolios using machine learning
techniques. Eng. Struct. 2019, 201, 109785. [CrossRef]
7. Banister, D. Sustainable urban development and transport —A eurovision for 2020. Transp. Rev. 2000, 20, 113–130. [CrossRef]
8. May, A.D. Urban Transport and Sustainability: The Key Challenges. Int. J. Sustain. Transp. 2013, 7, 170–185. [CrossRef]
9. Politis, I.; Georgiadis, G.; Nikolaidou, A.; Kopsacheilis, A.; Fyrogenis, I.; Sdoukopoulos, A.; Verani, E.; Papadopoulos, E. Mapping
travel behavior changes during the COVID-19 lock-down: A socioeconomic analysis in Greece. Eur. Transp. Res. Rev. 2021, 13, 21.
[CrossRef]
10. Karjalainen, L.E.; Juhola, S. Urban transportation sustainability assessments: A systematic review of literature. Transp. Rev. 2021,
41, 659–684. [CrossRef]
11. Litman, T.A. Comprehensive Transport Planning Framework—Best Practices for Evaluating All Options and Impacts; Victoria Transport
Policy Institute: Victoria, BC, Canada, 2012; pp. 1–59. Available online: https://[email protected] (accessed on 7
November 2022).
12. Litman, T.; Burwell, D. Issues in sustainable transportation. Int. J. Glob. Environ. Issues 2006, 6, 331–347. [CrossRef]
13. AlKhereibi, A.; AlSuwaidi, M.; Al-Mohammed, R.; Pokharel, S.; Ayari, M. An integrated urban-transport smart growth model
around metro stations: A case of Qatar. Transp. Res. Interdiscip. Perspect. 2021, 10, 100392. [CrossRef]
14. Su, S.; Wang, Z.; Li, B.; Kang, M. Deciphering the influence of TOD on metro ridership: An integrated approach of extended
node-place model and interpretable machine learning with planning implications. J. Transp. Geogr. 2022, 104, 103455. [CrossRef]
15. Zhang, Y.; Haghani, A. A gradient boosting method to improve travel time prediction. Transp. Res. Part C Emerg. Technol. 2015,
58, 308–324. [CrossRef]
16. Singh, Y.J.; Lukman, A.; Flacke, J.; Zuidgeest, M.; van Maarseveen, M.F.A.M. Measuring TOD around transit nodes—Towards
TOD policy. Transp. Policy 2018, 56, 96–111. [CrossRef]
17. Shirzadi Babakan, A.; Taleai, M. Impacts of transport development on residence choice of renter households: An agent-based
evaluation. Habitat Int. 2015, 49, 275–285. [CrossRef]
18. Motieyan, H.; Mesgari, M.S. Towards sustainable urban planning through transit-oriented development (A case study: Tehran).
ISPRS Int. J. Geo-Information 2017, 6, 402. [CrossRef]
19. Singh, Y.J.; Flacke, J.; van Maarseveen, M. Planning for Transit Oriented. In GIS in Sustainable Urban Planning and Management;
CRC Press: Boca Raton, FL, USA, 2018; Volume 2002, pp. 267–282.
20. Jain, A.; Ong, S.P.; Hautier, G.; Chen, W.; Richards, W.D.; Dacek, S.; Cholia, S.; Gunter, D.; Skinner, D.; Ceder, G.; et al. Commentary:
The materials project: A materials genome approach to accelerating materials innovation. APL Mater. 2013, 1, 011002. [CrossRef]
21. Türe, C. A methodology to analyse the relations of ecological footprint corresponding with human development index: Eco-
sustainable human development index. Int. J. Sustain. Dev. World Ecol. 2013, 20, 9–19. [CrossRef]
22. RCervero, R.; Dai, D. BRT TOD: Leveraging transit oriented development with bus rapid transit investments. Transp. Policy 2014,
36, 127–138. [CrossRef]
23. Archer, J.; Fotheringham, M.; Symmons, N.; Corben, B.J. The Impact of Lowered Speed Limits in Urban/Metropolitan Areas; Transport
Accident Commission: Geelong, Australia, 2008.
24. Wrótny, M.; Bohatkiewicz, J. Traffic noise and inhabitant health—A comparison of road and rail noise. Sustainability 2021, 13, 7340.
[CrossRef]
25. An, D.; Tong, X.; Liu, K.; Chan, E.H.W. Understanding the impact of built environment on metro ridership using open source in
Shanghai. Cities 2019, 93, 177–187. [CrossRef]
26. Cervero, R. Linking urban transport and land use in developing countries. J. Transp. Land Use 2013, 6, 7–24. [CrossRef]
27. Khan, J.; Hrelja, R.; Pettersson-Löfstedt, F. Increasing public transport patronage—An analysis of planning principles and public
transport governance in Swedish regions with the highest growth in ridership. Case Stud. Transp. Policy 2021, 9, 260–270.
[CrossRef]
28. Giuffrida, N.; le Pira, M.; Inturri, G.; Ignaccolo, M. Addressing the public transport ridership/coverage dilemma in small cities: A
spatial approach. Case. Stud. Transp. Policy 2021, 9, 12–21. [CrossRef]
29. Bielli, M.; Caramia, M.; Carotenuto, P. Genetic algorithms in bus network optimization. Transp. Res. Part C Emerg. Technol. 2002,
10, 19–34. [CrossRef]
Sustainability 2023, 15, 1718 20 of 20
30. Roy, A.; Manna, A.; Maity, S. A novel memetic genetic algorithm for solving traveling salesman problem based on multi-parent
crossover technique. Decis. Mak. Appl. Manag. Eng. 2019, 2, 100–111. [CrossRef]
31. Barma, P.S.; Dutta, J.; Mukherjee, A. A 2-opt guided discrete antlion optimizationalgorithm for multi-depotvehicle routing
problem. Decis. Mak. Appl. Manag. Eng. 2019, 2, 112–125. [CrossRef]
32. Calabrò, G.; Inturri, G.; le Pira, M.; Pluchino, A.; Ignaccolo, M. Bridging the gap between weak-demand areas and public transport
using an ant-colony simulation-based optimization. Transp. Res. Procedia 2019, 45, 234–241. [CrossRef]
33. Zhao, Y.; Huang, P.; Long, G.; Yuan, Y.; Sun, Y. Influence of Fluid Viscous Damper on the Dynamic Response of Suspension Bridge
under Random Traffic Load. Adv. Civ. Eng. 2020, 2020, 1857378. [CrossRef]
34. Pagès, L.; Jayakrishnan, R.; Cortés, C.E. Real-time mass passenger transport network optimization problems. Transp. Res. Rec. J.
Transp. Res. Board 2006, 1964, 229–237. [CrossRef]
35. Murray, A.T.; Davis, R.; Stimson, R.J.; Ferreira, L. Public transportation access. Transp. Res. D. Transp. Environ. 1998, 3, 319–328.
[CrossRef]
36. Murray, A.T. A Coverage Model for Improving Public Transit System Accessibility and Expanding Access. Ann. Oper. Res. 2003,
123, 143–156. [CrossRef]
37. Mattson, J. Estimating Ridership of Rural Demand–Response Transit Services for the General Public. Transp. Res. Rec. J. Transp.
Res. Board 2017, 2647, 127–133. [CrossRef]
38. Walker, J. Purpose-driven public transport: Creating a clear conversation about public transport goals. J. Transp. Geogr. 2008,
16, 436–442. [CrossRef]
39. Le Pira, M.; Ignaccolo, M.; Inturri, G.; Pluchino, A.; Rapisarda, A. Modelling stakeholder participation in transport planning. Case
Stud. Transp. Policy 2016, 4, 230–238. [CrossRef]
40. Gonzalez-Urango, H.; Inturri, G.; le Pira, M.; García-Melón, M. Planning for Pedestrians with a Participatory Multicriteria
Approach. J. Urban Plan. Dev. 2020, 146, 05020007. [CrossRef]
41. Inturri, G.; Giuffrida, N.; le Pira, M.; Fazio, M.; Ignaccolo, M. Linking Public Transport User Satisfaction with Service Accessibility
for Sustainable Mobility Planning. SPRS Int. J. Geo-Inf. 2021, 10, 235. [CrossRef]
42. Inturri, G.; Fiore, S.; Ignaccolo, M.; Caprì, S.; le Pira, M. “You study, you travel free”: When mobility management strategies meet
social objectives. Transp. Res. Procedia 2020, 45, 193–200. [CrossRef]
43. Major, M.D.; Tannous, H.O.; Al-Thani, S.; Hasan, M.; Khan, A.; Salaheldin, A. Macro and micro scale modelling of multi-modal
transportation spatial networks in the city-state of Doha, Qatar. In Proceedings of the 56th ISOCARP Virtual World Planning
Congress, Online, 8 November 2020–4 February 2021. [CrossRef]
44. Furlan, R.; Almohannadi, M. Light rail transit and land use in Qatar: An integrated planning strategy for Al-Qassar’s TOD.
Archnet-IJAR Int. J. Arch. Res. 2016, 10, 170–192. [CrossRef]
45. Salama, A.M.; Wiedmann, F. Demystifying. Doha: On Architecture and Urbanism in an Emerging City; Qatar University: Doha, Qatar, 2016.
46. Furlan, R.; Petruccioli, A.; Major, M.D.; Zaina, S.; Zaina, S.; Al Saeed, M.; Saleh, D. The urban regeneration of west-bay, business
district of Doha (State of Qatar): A transit-oriented development enhancing livability. J. Urban Manag. 2019, 8, 126–144. [CrossRef]
47. Furlan, R.; Al-Mohannadi, A. An Urban Regeneration Planning Scheme for the Souq Waqif Heritage Site of Doha. Sustainability
2020, 12, 7927. [CrossRef]
48. Breiman, L.; Ihaka, R. Nonlinear discriminant analysis via scaling and ACE. In Davis One Shields Avenue Davis; Department of
Statistics, University of California: Berkeley, CA, USA, 1984.
49. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [CrossRef]
50. Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [CrossRef]
51. Drucker, P.F. The future that has already happened. Harv. Bus. Rev. 1997, 75, 20–22.
52. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd Acm Sigkdd International Conference
on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.