ITS Architect
ITS Architect
net/publication/334410680
CITATIONS READS
115 2,023
9 authors, including:
Some of the authors of this publication are also working on these related projects:
Communication Connect: Improving long term communication and mental health outcomes following stroke and brain injury View project
All content following this page was uploaded by Tharindu Bandaragoda on 16 July 2019.
Abstract— The technological landscape of intelligent transport Things (IoT), sensor networks and social media has surpassed
systems (ITS) has been radically transformed by the emergence traditional means of collecting data, by creating voluminous
of the big data streams generated by the Internet of Things (IoT), and continuous streams of real-time data. Leveraging such big
smart sensors, surveillance feeds, social media, as well as growing
infrastructure needs. It is timely and pertinent that ITS harness data environments is a formidable issue, due to the intense vol-
the potential of an artificial intelligence (AI) to develop the ume and velocity at which data is generated by transportation
big data-driven smart traffic management solutions for effective and mobility systems [1]. Furthermore, the dynamic nature of
decision-making. The existing AI techniques that function in these environments makes the data generation volatile, which
isolation exhibit clear limitations in developing a comprehensive impedes the effectiveness of decision-making in ITS.
platform due to the dynamicity of big data streams, high-
frequency unlabeled data generation from the heterogeneous data The dynamicity of data generated by transportation systems
sources, and volatility of traffic conditions. In this paper, we pro- consists of continuously changing patterns and concept drifts.
pose an expansive smart traffic management platform (STMP) In a traffic context, concept drifts are the changes to the
based on the unsupervised online incremental machine learning, distributions of data in a traffic data stream over time [3].
deep learning, and deep reinforcement learning to address these Based on the nature of fluctuations in data streams, these
limitations. The STMP integrates the heterogeneous big data
streams, such as the IoT, smart sensors, and social media, changes are further classified as recurrent and non-recurrent
to detect concept drifts, distinguish between the recurrent and concept drifts. For example, traffic congestion changes due
non-recurrent traffic events, and impact propagation, traffic flow to peak/off-peak traffic is a recurrent concept drift whereas
forecasting, commuter sentiment analysis, and optimized traffic an accident or breakdown is a non-recurrent concept drift.
control decisions. The platform is successfully demonstrated Special importance should be placed into identifying non-
on 190 million records of smart sensor network traffic data
generated by 545,851 commuters and corresponding social media recurrent concept drifts as it could affect the entire road
data on the arterial road network of Victoria, Australia. network. Existing literature reports a number of supervised
machine learning algorithms that detect drifts and adapt to new
Index Terms— Smart traffic management, concept drift, unsu-
pervised incremental learning, deep learning, deep reinforcement concepts [3]–[6]. Although real-time concept drift detection is
learning, impact propagation, traffic optimization, traffic fore- crucial for effective decision making in transportation, feed-
casting, traffic control, social media analytics. back on the type of traffic incident is only received following
an unknown delay. This severely limits the applicability of
I. I NTRODUCTION the supervised learning nature of these algorithms. Therefore,
we postulate that concept drift detection in road traffic requires
R OAD traffic conditions and flow management con-
tinue to be an important area of research with many
practical implications. During the last decade, the techno-
unsupervised online incremental machine learning to address
the challenges of real-time, unlabeled, volatile data streams.
In this work, we distinguish online learning and incremen-
logical landscape of transportation has gradually integrated
tal learning. Online learning updates the model using each
disruptive technology paradigms into current transportation
incoming data point that arrives during the operation, without
management systems, leading to Intelligent Transportation
storing [7]. As such, online learning is utilized to handle
Systems (ITS) [1], [2]. The emergence of Internet of
large volumes of streaming data arriving at high velocity.
Manuscript received July 15, 2018; revised November 7, 2018 and Incremental learning is learning from batches of data at distinct
April 8, 2019; accepted May 31, 2019. This work was supported in part by the time intervals, and has the capability to stabilize the historical
Australian Government Research Training Program Scholarship and in part by
the Data to Decisions Cooperative Research Centre (D2D CRC) through the knowledge of the learning model over novel learnings [8].
Analytics and Decision Support Program. The Associate Editor for this paper Hence, the model is updated to any new data point that is
was J. Sanchez-Medina. (Corresponding author: Tharindu Bandaragoda.) received while keeping its existing knowledge intact. Further,
D. Nallaperuma, R. Nawaratne, T. Bandaragoda, A. Adikari, S. Nguyen,
T. Kempitiya, D. De Silva, and D. Alahakoon are with the Research Centre it is essential that non-recurrent concept drifts are identified
for Data Analytics and Cognition, La Trobe Business School, Bundoora, VIC and utilized for updated traffic propagation and traffic flow
3083, Australia (e-mail: [email protected]). prediction models in a real-time manner.
D. Pothuhera is with the Department of Information Management and
Technology, VicRoads, Kew, VIC 3101, Australia. To this end, we further address several key concerns which
Digital Object Identifier 10.1109/TITS.2019.2924883 are underexplored in current ITS, to support the development
This work is licensed under a Creative Commons Attribution 3.0 License. For more information, see http://creativecommons.org/licenses/by/3.0/
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
NALLAPERUMA et al.: ONLINE INCREMENTAL MACHINE LEARNING PLATFORM FOR BIG DATA-DRIVEN SMART TRAFFIC MANAGEMENT 3
NALLAPERUMA et al.: ONLINE INCREMENTAL MACHINE LEARNING PLATFORM FOR BIG DATA-DRIVEN SMART TRAFFIC MANAGEMENT 5
where Q(s, a) is the primary Q network, Q̃(s, a) is the target the event. Twitter API is queried with re and te to collect the
network, and γ is the discount factor [50]. This trick has been relevant georeferenced tweets of the event. Note that, there are
shown to significantly improve the effectiveness of Deep Q- advanced methods of localizing the non-georeferenced tweets
learning. [41], however, such approaches are beyond the scope of this
The reward obtained for each time interval (e.g., 10 minutes) work. The radius re is set based on the impact propagation
is calculated as, analysis of the accident by drawing a bounding circle covering
the road segments with significant relative incident impact.
Wt − Wt −1 i f (Wt − Wt −1 ) < 0 The emotion extraction is based on Ekman’s model of
rt = (8)
−P ∗ (W t − Wt −1 ) other wi se emotions where it presents six basic emotions (“Joy”, “Anger”,
“Surprise”, “Disgust”, “Fear”, “Sadness”) which are known as
where Wt is the average waiting of vehicles traveling on the
universal emotions [42]. An emotion vocabulary was formed
traffic network in the time interval t, and P is the penalty for
by creating a dictionary of emotional expression per each
slowing down the traffic. Based on the recorded states, actions,
emotion category. Emotion expressions were obtained from
and rewards, DRL will update its parameters (i.e., connection
the LIWC dictionary [43] and other published research studies
weights) to optimize its decisions by using back-propagation.
[44]. Text mining and NLP techniques were used to extract
In this method, the traffic controller can be triggered via time-
the emotion expressions from social media contents. Having
based (e.g., every five minutes) or event-based mechanisms
identified the emotions of a particular tweet, the weight of
(e.g., by anomaly detection).
each emotion (we ) was calculated by taking the average
of emotional expressions present in each tweet in order to
F. Social Media Based Commuter Emotion Analysis determine the strength of the emotions expressed. Afterwards,
In the proposed architecture, once an event is detected using the accumulated emotion intensity level was calculated and
the concept drift module (Section II-B), the emotion analysis is used to determine if the emotional behavior was significant.
used to capture the emotional behaviors of commuters from the Emotion intensity (It ) can be defined based on the aggregation
live social media stream. The importance and the applicability of emotions over a specified time period, commencing at
of social media in transport sector have been stated in [38] time t, where n denotes the number of tweets and m denotes
where it mentions that social media data from commuters the number of emotions in the category, as expressed below.
potentially enrich other data sources by providing a different n m
view of the commuters and their behaviors which are not cap- It,t +t = wi j (9)
tured from other traffic data sources. This analysis is conducted i=0 j =0
as a supporting module to investigate commuter behaviors
during detected events, as social media platforms such as III. E XPERIMENTS
Twitter are widely used communication platforms to notify This section demonstrates the proposed STMP platform
about significant events such as accidents, traffic congestions using real traffic data from the arterial road network of the
and roadblocks [10]–[12] and in several studies, sentiment State of Victoria, Australia.
analysis using Twitter has been carried out as a measure of The information on traffic has been acquired from the
commuter satisfaction and to observe commuter behavior and Bluetooth Traffic Monitoring System (BTMS) that is used to
opinion patterns in order to improve the transportation services monitor the road traffic of arterial roads in Victoria. BTMS is
[39], [40]. This work extends the traditional sentiment analysis a type of automatic vehicle detectors that is used to estimate
by extracting deeper emotion from twitter, thus enabling a travel times in a road network [45], [46]. For a comprehensive
more granular analysis of user opinion and feeling. Fig. 5 is understanding of the Bluetooth sites refer [47].
a high-level illustration of the process. As shown in Fig. 6, BTMS consists of a network of
Once an event is identified by the concept drift detection, Bluetooth traffic scanners that are placed in the junctions of
the Twitter data stream is analyzed to collect tweets that are arterial roads. These Bluetooth scanners capture the Bluetooth
relevant to the event. Such tweets are defined as originating devices that transit the scanning zone, which are either Blue-
within a radius re of the event for a time interval te since tooth enabled vehicle stereo systems or the mobile devices
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
NALLAPERUMA et al.: ONLINE INCREMENTAL MACHINE LEARNING PLATFORM FOR BIG DATA-DRIVEN SMART TRAFFIC MANAGEMENT 7
Fig. 7. Schematic of the road network in the area of interest selected for
traffic analysis. Fig. 8. Recurrent and non-recurrent concept drift detection.
TABLE I TABLE II
S UMMARY OF R ECURRENT C ONCEPT D RIFT D ETECTION I NCIDENT I MPACT P ROPAGATION FOR AN I NCIDENT AT
WARRIGAL R D -WAVERLEY R D I NTERSECTION
Fig. 9. Evidence for non-recurrent concept drift detection, left: usual traffic C. Impact Propagation Analysis
congestion is reduced in Warrigal Rd due to an accident in Monash Fwy, right:
accident on Warrigal Rd which has created traffic congestion on Warrigal Rd In Section II-D the impact propagation across the road
and Monash Fwy. network due to a non-recurrent incident was derived based on
the traffic flow towards that location from other road segments.
This selected incident has occurred at the Warrigal Rd-Waverly This capability is demonstrated based on the Warrigal Rd -
Rd intersection (see Fig. 7). Waverley Rd intersection (see Fig. 7) where the traffic incident
The tweets relevant to this incident were collected using was identified in the previous section (non-recurrent concept
the technique delineated in Section II-F, in which the two drift at timestamps [t32] in Fig. 8). Based on the Eq. 5
parameters are set as follows. The radius of the event (re ) (Sec. II-C) the relative incident impact was derived for the
is set to 3.7km which is the distance to the furthest location road segments that are close to the incident location.
(Warrigal Rd - Burwood Hwy intersection) with a significant Table II presents the road segments with a significant impact
relative incident impact identified by the impact propagation from the incident with its respective relative incident impact
analysis in the next section (see Table II). The event duration determined for the weekday 8 am traffic and weekday 8 pm
(te ) is set experimentally to 1 hour to be suitable for both major traffic. Note that two time-of-the-day values were selected to
and minor incidents. From the collected tweets, it was found compare and contrast differences in incident impact propaga-
that these non-recurrent traffic flow changes had occurred due tion due to different traffic behaviors.
to an accident (Fig. 9). Tweets in Fig. 9 (right) shows that As shown in Table II, the incident impact is mainly prop-
there was a communication gap of more than 45 minutes to agated along the Warrigal Rd in both directions (south and
gather information on the situation. Due to the data-driven northbound), compared to relatively less impact on Waverley
nature of the algorithm, concept drifts on traffic flow can be Rd (east and westbound). This is because Warrigal Rd carries
detected almost real time. Being able to provide a real-time a high traffic flow as it has entrance to the freeway. At 8 am,
notification on non-recurrent traffic flow changes will allow the highest impact on Warrigal Rd is for Batesford Rd to
better communications and effective optimizations. Waverley Rd traffic flow, as that traffic is bound to cross the
The proposed algorithm has the capability of detecting Warrigal Rd-Waverley Rd intersection and enter the Monash
concept drifts with different granularities such as daily or Fwy towards the City. In contrast, impact at 8 pm is highest
hourly. This can be achieved by changing the growth threshold on Monash Fwy to Waverley Rd, which consists of the traffic
and distance measure. With the demonstrated granularity, out coming from the city and exiting Monash Fwy to Warrigal Rd.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
NALLAPERUMA et al.: ONLINE INCREMENTAL MACHINE LEARNING PLATFORM FOR BIG DATA-DRIVEN SMART TRAFFIC MANAGEMENT 9
TABLE III
E VALUATION OF A CCURACY BASED ON THE T IME -L AG
S ELECTED FOR P REDICTION
Fig. 10. Traffic flow prediction performance comparison on the time period
Oct. 25 to Oct. 31, 2017.
The incident impact on Waverley Rd is relatively less, which
mostly impacts the eastbound traffic while the westbound
traffic is only impacted at 8 am. The DNN trained upon input of 3 time-lags (i.e., 45 minutes
prior traffic flow data) was selected for the examination
D. Traffic Forecasting in Fig. 10 to compare with the ground truth traffic flow
data. Based on the graphical representation of the prediction
The primary objective of the traffic forecasting experiment
performance, it is clear that the DNN model can successfully
is to evaluate the proposed DNN’s fitness in short-term traffic
model traffic flow with normal fluctuation, however, there are
flow forecasting. Based on the impact propagation analysis
limitations in predicting traffic flow that has severe fluctuations
conducted in the previous step, it was identified that the
which occur at a higher frequency. This limitation is expected
traffic flow of the road segment on Warrigal Rd (annotated
to be addressed when additional traffic data (e.g. roadworks,
as B to C in Fig. 7), in which heavy traffic congestion befall
incidents, events) is used for training the DNN model.
in frequent time periods, is critically impacted by the road
segments; A to B, C to D and C to E. Therefore, in this
experiment we have evaluated the effect of the identified road E. Intelligent Traffic Control
segments with respect to the traffic flow prediction of road To illustrate the effectiveness of DRL for adaptive traffic
segment B to C. In the experiment, the data horizon is taken control, we apply it to the traffic network around the selected
as 15 minutes, where the data extension was 24 days (1st to shopping center (Fig. 7) to control the traffic light phases.
24th October 2017). The selected area has 1805 lanes so the input dimension (i.e.,
Based on the Eq. 10, traffic flow is determined for the the state S) of the proposed DNN model is 1805. In our
selected road segments using 15-minute sampling intervals experiments, we perform 500 simulation runs (or epochs)
and utilized as the input sequences of the DNN. The data to learn the DNN model. Five master control programs are
was divided into training and testing subsets, in which the considered, i.e., M = 5. The first program is the default
first 24 days’ data were utilized for training of the model and program determined by the SUMO’s generator [36]. The sec-
the remaining 7 days’ data was utilized for testing the model ond program set all phase durations to 60 seconds. The
performance. To validate the effectiveness of the algorithm, third program doubles the phase durations for phases with
the model performance was measured by means of Mean durations greater than 40 seconds. The fourth program double
Squared Error (MSE) and Mean Absolute Error (MAE). the phase durations for all phases. The last program set the
Table III demonstrates the prediction performance with phase durations for all phases to 30 seconds. The parameters
respect to the measures selected. We evaluate the prediction of the proposed DRL algorithm are: discount factor γ = 0.95,
performance based on the multiple time-lags, which have initial exploration factor = 1, and decay factor = 0.99.
presented to the DNN as input, ranging from 15 minutes to The network weights are updated using Adam optimizer with
2 hours. The time lag with the highest performance is marked the learning rate of 0.001 at the end of each simulation run
in bold. The findings reveal that with the time lag increases using the experience replayed from the agent’s memory. For
the model performance. However, after the time lag of 3, the reward function in Eq. 8, we use the penalty P = 5.
the model performance begins to decrease, and it starts to The results of DRL is shown in Fig. 11. It is easy to see
drastically reduce after the time lag of 5. Based on the internals that the average waiting time is roughly reduced by half after
of the algorithms and the findings, it is evident that the DNN the 500 epochs. It demonstrates the ability of the proposed
with three LSTM layers can successfully model dependencies DRL to learn effective control decisions even for a very large
between time series input (i.e., traffic flow data as input) for traffic network.
a time lag of 3. However, the current model is not capable The learning mechanism of the proposed DRL can be
enough to model complex dependencies over 3-time lags, adapted to different networks and other control problems
hence, the decline of the performance of the prediction model. in intelligent traffic systems. Instead of periodically rese-
Fig. 10 presents the traffic flow prediction performance lect the control program, we can modify the algorithm to
comparison on the time period Oct. 25 to Oct. 31, 2017, adjust the control program based on the signal from con-
in order to examine the prediction performance visually. cept drift or abnormally detection. Compared to conventional
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE IV
H IGHER E MOTION I NTENSITY T IMELINE
NALLAPERUMA et al.: ONLINE INCREMENTAL MACHINE LEARNING PLATFORM FOR BIG DATA-DRIVEN SMART TRAFFIC MANAGEMENT 11
We acknowledge a number of potential areas for improve- [18] R. Yu, Y. Li, C. Shahabi, U. Demiryurek, and Y. Liu, “Deep learning:
ment. Further research directions involve fusing data from A generic approach for extreme condition traffic forecasting,” in Proc.
SIAM Int. Conf. Data Mining, Jun. 2017, pp. 777–785.
heterogeneous data sources such as security cameras, weather [19] G. Michau, A. Nantes, A. Bhaskar, E. Chung, P. Abry, and P. Borgnat,
information, and other transportation-related data sources. “Bluetooth data in an urban context: Retrieving vehicle trajectories,”
Also, the interpretability of AI modules, especially ones based IEEE Trans. Intell. Transp. Syst., vol. 18, no. 9, pp. 2377–2386,
Sep. 2017.
on complicated techniques such as deep neural networks, are [20] D. Nallaperuma, D. D. Silva, D. Alahakoon, and X. Yu, “A cognitive
worth investigating in the future to gain the acceptance of the data stream mining technique for context-aware IoT systems,” in Proc.
platforms. 43rd Annu. Conf. IEEE Ind. Electron. Soc., Nov. 2017, pp. 4777–4782.
[21] D. Nallaperuma, D. D. Silva, D. Alahakoon, and X. Yu, “Intelligent
ACKNOWLEDGMENT detection of driver behavior changes for effective coordination between
autonomous and human driven vehicles,” in Proc. 43rd Annu. Conf.
Authors would like to thank VicRoads–Victorian Road IEEE Ind. Electron. Soc., Oct. 2018, pp. 3120–3125.
Authority, Australia for their consultations in domain knowl- [22] A. Câmpan and G. Şerban, “Adaptive clustering algorithms,” in Proc.
Adv. Artif. Intell., Oct. 2006, pp. 407–418.
edge and providing us with real road traffic data. Authors [23] D. D. Silva and D. Alahakoon, “Incremental knowledge acquisition and
would also like to acknowledge the in-kind support from the self learning from text,” in Proc. Int. Joint Conf. Neural Netw. (IJCNN),
Data to Decisions Cooperative Research Centre (D2D CRC) Jul. 2010, pp. 1–8.
as part of their analytics and decision support Program. [24] D. Alahakoon, S. K. Halgamuge, and B. Srinivasan, “Dynamic self-
organizing maps with controlled growth for knowledge discovery,” IEEE
Trans. Neural Netw., vol. 11, no. 3, pp. 601–614, May 2000.
R EFERENCES
[25] C. S. Wirasinghe, “Determination of traffic delays from shock-wave
[1] I. Lana, J. D. Ser, M. Velez, and E. I. Vlahogianni, “Road traffic analysis,” Transp. Res., vol. 12, no. 5, pp. 343–348, Oct. 1978.
forecasting: Recent advances and new challenges,” IEEE Intell. Transp. [26] B. Pan, U. Demiryurek, C. Shahabi, and C. Gupta, “Forecasting spa-
Syst. Mag., vol. 10, no. 2, pp. 93–109, Apr. 2018. tiotemporal impact of traffic incidents on road networks,” in Proc. IEEE
[2] E. I. Vlahogianni, M. G. Karlaftis, and J. C. Golias, “Short-term 13th Int. Conf. Data Mining, Dec. 2013, pp. 587–596.
traffic forecasting: Where we are and where we’re going,” Transp. Res. [27] W. Huang, G. Song, H. Hong, and K. Xie, “Deep architecture for traffic
C, Emerg. Technol., vol. 43, pp. 3–19, Jun. 2014. flow prediction: Deep belief networks with multitask learning,” IEEE
[3] J. Gama, I. Žliobaite, A. Bifet, M. Pechenizkiy, and A. Bouchachia, Trans Intell. Transp. Syst., vol. 15, no. 5, pp. 2191–2201, Oct. 2014.
“A survey on concept drift adaptation,” ACM Comput. Surv., vol. 46, [28] X. Ma, Z. Tao, Y. Wang, H. Yu, and Y. Wang, “Long short-term memory
no. 4, p. 44, Apr. 2014. neural network for traffic speed prediction using remote microwave
[4] J. L. Lobo, J. D. Ser, M. N. Bilbao, C. Perfecto, and S. Salcedo-Sanz, sensor data,” Transp. Res. Part C, Emerg. Technol., vol. 54, pp. 187–197,
“DRED: An evolutionary diversity generation method for concept drift May 2015.
adaptation in online learning environments,” Appl. Soft Comput., vol. 68, [29] S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural
pp. 693–709, Jul. 2018. Comput., vol. 9, no. 8, pp. 1–32, 1997.
[5] M. M. Masud et al., “Addressing concept-evolution in concept-drifting [30] M. Hermans and B. Schrauwen, “Training and analysing deep recurrent
data streams,” in Proc. IEEE Int. Conf. Data Mining, Dec. 2010, neural networks,” in Proc. Adv. Neural Inf. Process. Syst., Jun. 2013,
pp. 929–934. pp. 190–198.
[6] A. Bifet, G. Holmes, B. Pfahringer, R. Kirkby, and R. Gavaldà, [31] A. Graves, A.-R. Mohamed, and G. Hinton, “Speech recognition with
“New ensemble methods for evolving data streams,” in Proc. 15th deep recurrent neural networks,” in Proc. IEEE Int. Conf. Acoust.,
ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining, Jun. 2009, Speech Signal Process., May 2013, pp. 6645–6649.
pp. 139–148. [32] Z. Zhao et al., “LSTM network: A deep learning approach for short-
[7] M. Sayed-Mouchaweh, “Learning in dynamic environments,” in Learn- term traffic forecast,” IET Intell. Transp. Syst., vol. 11, no. 2, pp. 68–75,
ing from Data Streams in Dynamic Environments, Springer, New York, 2017.
NY, USA: 2016. [33] L. Baskar, B. De Schutter, J. Hellendoorn, and Z. Papp, “Traffic control
[8] M. A. Maloof and R. S. Michalski, “Incremental learning with par- and intelligent vehicle highway systems: A survey,” IET Intell. Transp.
tial instance memory,” Artif. Intell., vol. 154, nos. 1–2, pp. 95–126, Syst., vol. 5, no. 1, pp. 38–52, Mar. 2011.
Apr. 2004. [34] V. Mnih et al., “Human-level control through deep reinforcement learn-
[9] K. N. Qureshi and A. H. Abdullah, “A survey on intelligent transporta- ing,” Nature, vol. 518, pp. 529–533, Feb. 2015.
tion systems,” Middle-East J. Sci. Res., vol. 15, no. 5, pp. 629–642,
[35] M. Haklay and P. Weber, “OpenStreetMap: User-generated street maps,”
2013.
IEEE Pervasive Comput., vol. 7, no. 4, pp. 12–18, Oct. 2008.
[10] C. Khatri, “Real-time road traffic information detection through
social media,” Jan. 2018, arXiv:1801.05088. [Online]. Available: [36] A. Wegener, M. Piórkowski, M. Raya, H. Hellbrück, S. Fischer, and
https://arxiv.org/abs/1801.05088 J.-P. Hubaux, “TraCI: An interface for coupling road traffic and network
[11] D. Wang, A. Al-Rubaie, J. Davies, and S. S. Clarke, “Real time road simulators,” in Proc. 11th Commun. Netw. Simulation Symp., Apr. 2008,
traffic monitoring alert based on incremental learning from tweets,” pp. 155–163.
in Proc. IEEE Symp. Evolving Auton. Learn. Syst. (EALS), Dec. 2014, [37] H. V. Hasselt, A. Guez, and D. Silver, “Deep reinforcement learning with
pp. 50–57. double Q-Learning,” Sep. 2015, arXiv:1509.06461. [Online]. Available:
[12] P.-T. Chen, F. Chen, and Z. Qian, “Road traffic congestion monitoring https://arxiv.org/abs/1509.06461
in social media with hinge-loss Markov random fields,” in Proc. IEEE [38] S. M. Grant-Muller, A. Gal-Tzur, E. Minkov, S. Nocera, T. Kuflik,
Int. Conf. Data Mining, Dec. 2014, pp. 80–89. and I. Shoor, “Enhancing transport data collection through social
[13] M. J. Lighthill and G. B. Whitham, “On kinematic waves. II. A media sources: Methods, challenges and opportunities for textual data,”
theory of traffic flow on long crowded roads,” Proc. Roy. Soc. London, IET Intell. Transp. Syst., vol. 9, no. 4, pp. 407–417, Nov. 2014.
Ser. A, Math. Phys. Sci., vol. 229, pp. 317–345, May 1955. [39] F. Ali, D. Kwak, P. Khan, S. M. R. Islam, K. H. Kim, and K. S. Kwak,
[14] C. F. Daganzo, “The cell transmission model. Part I: A simple dynamic “Fuzzy ontology-based sentiment analysis of transportation and city
representation of highway traffic,” Transp. Res. Part B Methodol., feature reviews for safe traveling,” Transp. Res. C, Emerg. Technol.,
vol. 28, no. 4, pp. 269–287, 1994. vol. 77, pp. 33–48, Apr. 2017.
[15] P. I. Richards, “Shock waves on the highway,” Oper. Res., vol. 4, no. 1, [40] F. C. Pereira, A. L. C. Bazzan, and M. Ben-Akiva, “The role of context
pp. 42–51, 1956. in transport prediction,” IEEE Intell. Syst., vol. 29, no. 1, pp. 76–80,
[16] A. Aw, A. Klar, M. Rascle, and T. Materne, “Derivation of contin- Jan. 2014.
uum traffic flow models from microscopic follow-the-leader models,” [41] H. Abdelhaq, C. Sengstock, and M. Gertz, “EvenTweet: Online localized
SIAM J. Appl. Math., vol. 63, no. 1, pp. 259–278, 2002. event detection from twitter,” Proc. VLDB Endowment, vol. 6, no. 12,
[17] Y. Lv, Y. Duan, W. Kang, Z. Li, and F.-Y. Wang, “Traffic flow prediction pp. 1326–1329, Aug. 2013.
with big data: A deep learning approach,” IEEE Trans. Intell. Transp. [42] P. Ekman, “An argument for basic emotions,” Cognit. Emotion, vol. 6,
Syst., vol. 16, no. 2, pp. 865–873, Apr. 2015. nos. 3–4, pp. 169–200, 1992.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
[43] J. H. Kahn, R. M. Tobin, A. E. Massey, and J. A. Anderson, “Measuring Su Nguyen received the Ph.D. degree in artificial
emotional expression with the linguistic inquiry and word count,” intelligence and data analytics from the Victoria
Amer. J. Psychol., vol. 120, no. 2, pp. 263–286, Jul. 2007. University of Wellington, New Zealand, in 2013.
[44] R. Valitutti, “WordNet-affect: An affective extension of WordNet,” He is currently a Senior Research Fellow with the
in Proc. 4th Int. Conf. Lang. Resour. Eval., May 2004, pp. 1083–1086. Centre for Research in Data Analytics and Cogni-
[45] U. Mori, A. Mendiburu, M. Álvarez, and J. A. Lozano, “A review of tion, La Trobe University, Australia. His primary
travel time estimation and forecasting for advanced traveller information research interests include computational intelligence,
systems,” Transportmetrica A, Transp. Sci., vol. 11, no. 2, pp. 119–157, optimization, data analytics, large-scale simulation,
2015. and their applications in operations management and
[46] A. Bhaskar and E. Chung, “Fundamental understanding on the use of social media.
Bluetooth scanner as a complementary transport data,” Transp. Res. C,
Emerg. Technol., vol. 37, pp. 42–72, Dec. 2013.
[47] VicRoads. Bluetooth Sites–VicRoads Open Data. Accessed:
Feb. 26, 2019. [Online]. Available: https://vicroadsopendata-
vicroadsmaps.opendata.arcgis.com/datasets/48fd4d7e1127453ea
5f9bdc757ab00e7_0
[48] A. Nantes, M. P. Miska, A. Bhaskar, and E. Chung, “Noisy Bluetooth
traffic data?” Road Transp. Res., A J. Austral. New Zealand Res. Pract., Thimal Kempitiya received the bachelor’s degree
vol. 23, no. 1, pp. 33–43, Mar. 2014. (Hons.) from the Department of Computer Sci-
[49] C. Hiemstra and J. D. Jones, “Testing for linear and nonlinear Granger ence and Engineering, University of Moratuwa, Sri
causality in the stock price-volume relation, ” J. Finance, vol. 49, no. 5, Lanka, in 2015. He is currently a senior software
pp. 1639–1664, Dec. 1994. engineer in machine learning and digital human
simulation fields. His research interests include data
mining, business analytics, deep learning, incremen-
Dinithi Nallaperuma received the master’s degree tal learning, and human simulation.
in advanced software engineering from the Univer-
sity of Westminster, U.K., and the Ph.D. degree
from the Research Center for Data Analytics and
Cognition, La Trobe University, Australia. Prior to
commencing her Ph.D. degree, she was a Research
Engineer with the National University of Singapore.
Her research interests include the Internet of Things,
data stream mining, unsupervised learning, and cog-
Daswin De Silva received the Ph.D. degree in
nitive computing.
machine learning and artificial intelligence from
Monash University. He is currently a Senior Lecturer
with La Trobe University, Australia. Besides his
Rashmika Nawaratne received the bachelor’s academic and professional expertise in analytics, he
degree (Hons.) from the Department of Computer is also actively involved in industry research and
Science and Engineering, University of Moratuwa, engagement. His research interests include cognitive
Sri Lanka, in 2014. He is currently pursuing the computing, autonomous learning algorithms, incre-
Ph.D. degree with the Research Centre for Data Ana- mental knowledge acquisition, stream mining, social
lytics and Cognition, La Trobe University, Australia. media, and text mining.
Prior to commencing his Ph.D. degree, he was a
Technical Lead with the Software Product Devel-
opment Organization. His research interests include
incremental learning, video analytics, deep learning,
the Internet of Things, and data stream mining.