Big Data and Climate Change: Hossein Hassani, Xu Huang and Emmanuel Silva
Big Data and Climate Change: Hossein Hassani, Xu Huang and Emmanuel Silva
cognitive computing
Review
Big Data and Climate Change
Hossein Hassani 1, * , Xu Huang 2 and Emmanuel Silva 3
1 Research Institute of Energy Management and Planning, University of Tehran, Tehran 1417466191, Iran
2 Faculty of Business and Law, De Montfort University, Leicester LE1 9BH, UK; [email protected]
3 Fashion Business School, London College of Fashion, University of the Arts London, London WC1V 7EY, UK;
[email protected]
* Correspondence: [email protected]
Received: 8 January 2019; Accepted: 30 January 2019; Published: 2 February 2019
Abstract: Climate science as a data-intensive subject has overwhelmingly affected by the era of big data
and relevant technological revolutions. The big successes of big data analytics in diverse areas over the
past decade have also prompted the expectation of big data and its efficacy on the big problem—climate
change. As an emerging topic, climate change has been at the forefront of the big climate data analytics
implementations and exhaustive research have been carried out covering a variety of topics. This paper
aims to present an outlook of big data in climate change studies over the recent years by investigating
and summarising the current status of big data applications in climate change related studies. It is also
expected to serve as a one-stop reference directory for researchers and stakeholders with an overview of
this trending subject at a glance, which can be useful in guiding future research and improvements in the
exploitation of big climate data.
1. Introduction
Big data analytics have been rapidly developing along with the emerging needs of big data
technologies in numerous subjects (see, for example, [1,2]). The accessibility, availability and exponentially
growing quantity of big data have further promoted the corresponding technological advancements
and practical implementations. Earth is a complex dynamical system [3]; thereafter, big data analytics
encountered more challenges in climate science than other subjects regardless of the extensive resources
of big climate data. Climate change as an emerging topic and also a data-intensive subject has been the
research focus of big data scholars over the past several decades [4,5]. Exhaustive big data analytics
applications have been carried out on big climate data, while the Internet of Things, cloud computing,
big data tools to investigate climate, as well as intelligent analytics platforms and new technological
progressions, have further emphasized its significance and possible impacts on climate science and big
data science development (see, for example, [6,7]).
Given the context of combating climate change, existing research has applied big data analytics in
mainly the aspects of energy efficiency, intelligent agriculture, smart urban planning, weather forecast, natural
disaster management, etc. Although overall this is not a new subject and there is a large amount of existing
literature, there is no recent review to the best of our knowledge that particularly investigates the topic of
big data in climate change, not to mention that the novel developments are progressing rapidly everyday
along with the technological advancements. Therefore, this paper contributes to the existing literature by
providing the most up-to-date overview of big data applications in climate change related studies at a glance
with the most recently published research that reflects the cutting edge of this topic. It is of note that over
80% of the listed applications are after 2016, which makes this review the latest comprehensive review of
big data in climate change that is significantly different from the previously existing literature. This paper
also contributes by serving as the one stop directory for researchers to gain the most up-to-date overview
of this topic. Furthermore, we aim to summarize the popular practice court of research in this domain, and
also seek to identify the non-mainstream applications that lack thorough exploration. It is expected that, by
providing this comprehensive review, both researchers and practitioners can gain better knowledge of the
current research trend and identify the research gaps with valuable potential.
As can be seen in Figure 1, it is identified that the applications of big data in climate change have two
fundamental elements: the big climate data resources and the big data analytics techniques. We classify
these studies by means of value creation as well as the specific topic of application. For convenient access
of applications and clear guiding purposes, it is summarized that big data in climate change mainly
function in four aspects of value creation: observing and monitoring, understanding, and predicting and
optimizing, whilst the applications are grouped into five topics: energy efficiency and intelligence; smart
farming and agriculture and forestry; sustainable urban planning and infrastructure; natural disaster and
disease assessment; and other advanced supports.
The remainder of this review paper is organized such that the values of big data to climate change
study are summarized in Section 2 along with the trends of this focused topic. Section 3 lists a detailed
review of the big data applications in climate change studies by topics. Finally, the paper concludes in
Section 4 with current challenges and directions of future research.
monitoring and analysing institutions over the world. A detailed report that introduces the core of global
scale climate research and cyber-infrastructure can be found in [11].
The abundant resources above have enabled us to gain knowledge of what the big climate data
are how the data are collected, and what the data can be used for. However, these barely scratch
the surface of big data analytics. Big data have also been playing a vital role in predicting when it
is incorporated with climate science, for instance weather forecasting, natural disasters monitoring and
early warning, energy consumption forecasting, traffic forecasting, etc. By applying corresponding data
mining techniques (the detailed introduction of data mining techniques can be found in [12,13]), it allows
knowledge discovery of the potential relationships and causal inferences, which further contribute to the
modelling and predicting [14].
Accurate forecasts can aid in adaptive policy making in relation to climate changes, whilst the value
creation feature of big data puts emphasis on optimizing. Being able to understand and predict based on
sufficient knowledge extracted from big data or drawing inferences across different cases/applications are
relatively straightforward. However, optimization requires comprehensive theoretical understanding as
well as adequate big data analytics skills to structure the optimal model/infrustructure so as to maximise
performance, efficiency and utility, or, in some cases, for achieving sustainable development. In recent
years, exhaustive relevant applications have been carried out on energy efficiency management, natural
resource management, smart grid, smart farming, etc. More details of applications are provided in
Section 3.
2.3. Trends
Google Trends provide the real-time index of interest for keywords that people search worldwide,
and it has become a great tool for both academic research and practical implementations in marketing,
journalism, entrepreneurship, etc. As can be seen in Figure 2, the worldwide Google Trends of big data
have shown the interest index over time, which is generally between 80 to 100 with 100 the peak popularity.
It reflects the emerging and trending topic of big data and its relevant terms over the recent years. Although
climate change has the average interest index around 40 over time, the rapidly ascending interest from 20
with a wide span over 60 within three years also confirms the increasing popularity and growing attention
to climate.
Yaqoob et al. (2016) [15] presented the emerging trend and sources of big data, i.e., Internet of
Things [16], multimedia [17] and social media [18,19]. The authors also summarized the state-of-the-art
big data processing technologies by categorizing them into the batch based and the stream based
processing technologies (more detailed introduction of each technology and a list of big data vendors
can be found in [20], a brief summary of the open source tools and initiatives can be seen in [21]).
Another tool that is developed by the United States Environmental Protection Agency in 2015 named
EnviroAtlas [22] is a web-based ecosystem service framework that brings together environmental,
economic and demographic data.
In regard to big data analytics in climate science, some recent developments focus on the introduction
of different platforms/services that adopt advanced technological progressions like cloud computing [23],
in memory processing, real-time processing, etc. Schnase et al. [24] introduced Modern-Era Retrospective
Analysis for Research and Applications (MERRA) analytic services that will enable cloud computing
and analytics-as-a-service for climate science, it enables the capability of providing complete life cycle
management of large scale scientific repositories, as well as incorporating MapReduce analytics for research
and applications targeting a broad range of users. Another platform proposed by Hu et al. [25] named
ClimateSpark is an in-memory distributed computing framework that aims to facilitate big climate data
analytics and improve computational efficiency. Manogaran et al. [26] proposed the spatial cumulative sum
Big Data Cogn. Comput. 2019, 3, 12 5 of 17
algorithm to be planted in a scalable big data processing framework for seasonal climate change detection,
and the new algorithm has been validated, outperforming the existing change detection approaches. Later,
in [27], the authors introduced the In-Mapper based combiner algorithm with higher performance than
the widely used MapReduce algorithm in terms of processing big climate data. The proposed algorithm
was implemented in seeking correlation between the climate parameters and incidence of dengue by
integrating climate and health data.
80
60
40
20
0
Jan Apr Jul Oct Jan Apr Jul Oct Jan Apr Jul Oct
2015 2016 2017
Worldwide Google Trends of Climate Change
100
80
60
40
20
0
Jan Apr Jul Oct Jan Apr Jul Oct Jan Apr Jul Oct
2015 2016 2017
Figure 2. Google Trends index of big data and climate change since 2015.
multidimensional big metering data so as to achieve user privacy in a semi-trusted cloud with fast response
and high search quality.
A decision support system for energy efficiency management is developed in [53] that enables the
capability of effectively processing cross domain data and high-level energy/climate data management
and analytics. The novel high level architecture addressed by the authors integrated the single-entry,
processing and semantification point for multiple-sourced data, data storage cluster, data access control,
advanced analytics services and dashboard interface service.
A specific focus of the energy efficiency assessment of China industry sectors can be found in [54],
where the authors applied k-means clustering and the multi-dimensional association rules algorithm to
evaluate the environmental performance and seek possible energy leakages and faults so to further guide
regional energy conservation.
In terms of the water energy efficiency management, a systematic review can be found in [55] that
summarized the big data analytics applications in rainfall data management, flood risk management,
water productivity optimizing, drinking water network management, etc. A recent paper [56] thoroughly
investigated the big data challenges and coping strategies in water-energy metering intelligence. Moreover,
it demonstrated four different case studies in details as showcase applications of big water-energy
data analytics.
Li et al. [66] presented a case study of big data analytics application in forest ecosystem service at Anhui
province, China. Another case study in China was proposed in [67], which focuses on the ecosystem
service in a karst region—Guangxi provice of China—and identifies the positive influence of ecological
big data analytics in ecosystem services. Xie et al. [68] focused on the influencing factors of ecological
land change in China’s Beijing–Tianjin–Hebei region. With the big climate/ecosystem data, the authors
discovered substantial differences across factors regarding its impacts on ecological land changes in
order to recommend on better strategies of natural resource management. A related research that also
targeted China in [69], which introduced the big data analytics with provincial multisourced data on
determining the optimal farmland conversion to reach the best balance between farmland protection and
economic growth.
is also vital to gain the most up-to-date knowledge of big data analytics applications in natural disaster
and/or disease management.
There are existing reviews on data mining applications in terms of combating natural
disasters [88,89]—for instance, floods, storms, landslides, volcanic eruptions, tsunamis, earthquakes,
etc., in which the authors summarized the relevant big data analytics applications in predicting, detecting
and management strategy improvement for a variety of natural disasters, along with identifying existing
challenges and future research directions. A review that specially targeted climate prediction applications
in China can be found in [90].
With regard to computational development, Miyoshi [91] reported the encouraging performance of a
Japanese flagship K supercomputer on big climate data processing comparing to the system operated at
the world’s weather prediction centers. The performance reached 120 times more rapid speed and the big
data assimilation system is also validated with encouraging performance.
One of the highlights is disaster resilience, and Ofli et al. [92] addressed the importance of aerial
imagery data by unmanned aerial vehicles for timely disaster response, and a hybrid crowdsourcing and
real-time machine learning framework was proposed to aid in the existing artificial intelligence disaster
response platform. In a recent piece of research, Papadopoulos et al. [93] employed big data analytics in
multisourced online press and social media big data, in order to validate the disaster resilience in supply
chain for sustainability.
Some recent research exploited the big data analytics in disaster assessment. Nguyen et al. [94]
applied the deep convolutional neural networks (deep-CNN) on processing post disaster social media
images, and the damage assessment showed effective and accurate performance, which also outperformed
the existing technique bag-of-visual-words (BoVW). A case study in South Korea [95] predicted the
regional level building damage risk under weather extremes or natural disasters using decision tree
analysis. In regards to flood damage assessment mapping, the case study of the Myanmar flood in [96]
emphasised the significance of global open geo-information, while later Cian et al. [97] adopted the earth
observation big data and proposed the normalized difference flood index for flood mapping. Given the
significant amount of emissions caused by fire disturbance on a global scale, a recent research work by
Ramo et al. [98] investigated the global burned area mapping using classification data mining techniques.
It has been systematically reviewed in [99] of the big data analytics in global infectious disease
surveillance, whilst its corresponding big data applications related to climate science still stay in their
early stage of development. Traore et al. [100] applied data mining techniques on satellite imagery data for
identifying risk areas that are exposed to epidemic crisis. Similarly, Manogaran and Lopez [101] focused
on dengue and proposed a big climate data based surveillance system for continuous monitoring and
timely warning.
disposing, etc. The authors demonstrated the framework in detail and also validated the framework
with real case studies showing significant improvement of material efficiency, energy efficiency, emission
reduction, customer service improvement and economic benefits growth. Later, Liang et al. [104]adopted
the Fruit Fly Optimisation and proposed the novel cyber physical system for machining optimisation
during the manufacturing lifecycle. Another framework named the Intelligent Immune System is
introduced in [105]. It was also designed for improving productivity and energy saving, employed
artificial neural networks for identifying abnormal patterns and planted the re-scheduling algorithm for
follow-up processing.
A special focus on the electrical and electronic equipment waste/recycling management with big
data analytics is addressed in [106]. The authors proposed and illustrated the waste recovery/recycling
framework based on the Internet of Things and big data technologies, and also discussed its economical
and technological challenges of implementation.
independent, and the prediction class can be determined for the one with the highest posterior probability.
Neural Networks are inspired by the structure and mechanism of real neurons in gathering and processing
information, and it enables the estimation of the posterior probabilities for completing the classification
task [114]. The kNN approach seeks to identify the nearest neighbours of the observation object so to
determine its class label [115].
Association Rule Mining was firstly proposed in [116] on the supermarket data set and aimed to
investigate into co-occurrences among data objects. According to [117], it is a technique for identifying the
simultaneous occurrence that happens more frequently than the average co-occurrence frequency existing
in the data set.
Regression is also considered an important big data analysis technique due to its capabilities in
dimension reduction, information extraction, estimation and prediction. The fundamental concept of
regression is to investigate the relationship between two or more variables so to assist forecasting and
decision-making. There are a variety of well established regression techniques like linear and nonlinear
regression, lasso regression, logistic regression and regression tree, etc.
Social Network Analysis (SNA) is based on the principle of graph theory and it is a relatively new big
data analysis technique. It investigates the connections and contents among objects in a massive stack
of information in order to structure a social network. The basic social network is formed by nodes and
its related nodes connected by links (namely edges) [118,119]. The most frequently used measurement
techniques to analyze the patterns consisted of the social network including degree, density and centrality.
Big Data Cogn. Comput. 2019, 3, 12 12 of 17
Author Contributions: All authors have equally contributed substantially to the work reported.
Funding: This research received no external funding.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Hassani, H.; Huang, X.; Silva, E. Digitalisation and Big Data Mining in Banking. Big Data Cogn. Comput. 2018,
2, 18. [CrossRef]
2. Hassani, H.; Huang, X.; Silva, E. Big-Crypto: Big Data, Blockchain and Cryptocurrency. Big Data Cogn. Comput.
2018, 2, 34. [CrossRef]
3. Faghmous, J.H.; Kumar, V. A big data guide to understanding climate change: The case for theory-guided data
science. Big Data 2014, 2, 155–163. [CrossRef] [PubMed]
4. Götz, M.; Richerzhagen, M.; Bodenstein, C.; Cavallaro, G.; Glock, P.; Riedel, M.; Benediktsson, J.A. On scalable
data mining techniques for earth science. Procedia Comput. Sci. 2015, 51, 2188–2197. [CrossRef]
5. Wu, J.; Guo, S.; Li, J.; Zeng, D. Big data meet green challenges: Greening big data. IEEE Syst. J. 2016, 10, 873–887.
[CrossRef]
6. Almgren, K.; Alshahrani, S.; Lee, J. Weather Data Analysis using Hadoop to Mitigate Event Planning
Disasters. Available online: https://scholarworks.bridgeport.edu/xmlui/handle/123456789/1105 (accessed on
1 February 2019).
7. Oury, D.T.M.; Singh, A. Data Analysis of Weather Data Using Hadoop Technology. In Smart Computing and
Informatics; Springer: Singapore, 2018; pp. 723–730.
8. Guo, H.D.; Zhang, L.; Zhu, L.W. Earth observation big data for climate change research. Adv. Clim. Chang. Res.
2015, 6, 108–117. [CrossRef]
Big Data Cogn. Comput. 2019, 3, 12 13 of 17
9. Faghmous, J.H.; Kumar, V. Spatio-temporal data mining for climate data: Advances, challenges, and
opportunities. In Data Mining and Knowledge Discovery for Big Data; Springer: Berlin/Heidelberg, Germany, 2014;
pp. 83–116.
10. Sun, Q.; Miao, C.; Duan, Q.; Ashouri, H.; Sorooshian, S.; Hsu, K.L. A review of global precipitation data sets:
Data sources, estimation, and intercomparisons. Rev. Geophys. 2018, 56, 79–107. [CrossRef]
11. Schnase, J.L.; Lee, T.J.; Mattmann, C.A.; Lynnes, C.S.; Cinquini, L.; Ramirez, P.M.; Webster, W.P. Big data
challenges in climate science: Improving the next-generation cyberinfrastructure. IEEE Geosci. Remote Sens. Mag.
2016, 4, 10–22. [CrossRef]
12. Witten, I.H.; Frank, E.; Hall, M.A.; Pal, C.J. Data Mining: Practical Machine Learning Tools and Techniques; Morgan
Kaufmann: Burlington, MA, USA, 2016.
13. Hassani, H.; Huang, X.; Silva, E.S.; Ghodsi, M. A review of data mining applications in crime. Stat. Anal. Data
Min. ASA Data Sci. J. 2016, 9, 139–154. [CrossRef]
14. Shi, G.R.; Yang, X.S. Optimization and data mining for fracture prediction in geosciences. Procedia Comput. Sci.
2010, 1, 1359–1366. [CrossRef]
15. Yaqoob, I.; Hashem, I.A.T.; Gani, A.; Mokhtar, S.; Ahmed, E.; Anuar, N.B.; Vasilakos, A.V. Big data: From
beginning to future. Int. J. Inf. Manag. 2016, 36, 1231–1247. [CrossRef]
16. Ahmed, E.; Yaqoob, I.; Hashem, I.A.T.; Khan, I.; Ahmed, A.I.A.; Imran, M.; Vasilakos, A.V. The role of big data
analytics in Internet of Things. Comput. Netw. 2017, 129, 459–471. [CrossRef]
17. Zhu, W.; Cui, P.; Wang, Z.; Hua, G. Multimedia big data computing. IEEE Multimed. 2015, 22, 96-c3. [CrossRef]
18. Fan, W.; Gordon, M.D. The power of social media analytics. Commun. ACM 2014, 57, 74–81. [CrossRef]
19. Khan, N.; Yaqoob, I.; Hashem, I.A.T.; Inayat, Z.; Ali, M.; Kamaleldin, W.; Alam, M.; Shiraz, M.; Gani, A. Big data:
Survey, technologies, opportunities, and challenges. Sci. World J. 2014. [CrossRef] [PubMed]
20. Chen, C.P.; Zhang, C.Y. Data-intensive applications, challenges, techniques and technologies: A survey on Big
Data. Inf. Sci. 2014, 275, 314–347. [CrossRef]
21. Fan, W; Bifet, A. Mining big data: Current status, and forecast to the future. ACM sIGKDD Explor. Newsl. 2013,
14, 1–5. [CrossRef]
22. Pickard, B.R.; Daniel, J.; Mehaffey, M.; Jackson, L.E.; Neale, A. EnviroAtlas: A new geospatial tool to foster
ecosystem services science and resource management. Ecosyst. Serv. 2015, 14, 45–55. [CrossRef]
23. Yang, C.; Yu, M.; Hu, F.; Jiang, Y.; Li, Y. Utilizing cloud computing to address big geospatial data challenges.
Comput. Environ. Urban Syst. 2017, 61, 120–128. [CrossRef]
24. Schnase, J.L.; Duffy, D.Q.; Tamkin, G.S.; Nadeau, D.; Thompson, J.H.; Grieg, C.M.; McInerney, M.A.; Webster, W.P.
MERRA analytic services: Meeting the big data challenges of climate science through cloud-enabled climate
analytics-as-a-service. Comput. Environ. Urban Syst. 2017, 61, 198–211. [CrossRef]
25. Hu, F.; Yang, C.P.; Duffy, D.; Schnase, J.L.; Li, Z. ClimateSpark: An In-memory Distributed Computing Framework
for Big Climate Data Analytics. Comput. Geosci. 2018, 115, 154–166. [CrossRef]
26. Manogaran, G.; Lopez, D. Spatial cumulative sum algorithm with big data analytics for climate change detection.
Comput. Electr. Eng. 2018, 65, 207–221. [CrossRef]
27. Manogaran, G.; Lopez, D.; Chilamkurti, N. In-Mapper combiner based MapReduce algorithm for processing of
big climate data. Future Gener. Comput. Syst. 2018, 86, 433–445. [CrossRef]
28. Wu, X.; Zhu, X.; Wu, G.Q.; Ding, W. Data mining with big data. IEEE Trans. Knowl. Data Eng. 2014, 26, 97–107.
29. Chen, M.; Mao, S.; Liu, Y. Big data: A survey. Mob. Netw. Appl. 2014, 19, 171–209. [CrossRef]
30. Gandomi, A.; Haider, M. Beyond the hype: Big data concepts, methods, and analytics. Int. J. Inf. Manag. 2015,
35, 137–144. [CrossRef]
31. Hassani, H.; Huang, X.; Ghodsi, M. Big Data and Causality. Ann. Data Sci. 2017, 5, 1–24. [CrossRef]
32. Hassani, H.; Silva, E.S. Big Data: A big opportunity for the petroleum and petrochemical industry. OPEC Energy Rev.
2018, 42, 74–89. [CrossRef]
33. Zhou, K.; Fu, C.; Yang, S. Big data driven smart energy management: From big data to big insights. Renew. Sustain.
Energy Rev. 2016, 56, 215–225. [CrossRef]
Big Data Cogn. Comput. 2019, 3, 12 14 of 17
34. Schuelke-Leech, B.A.; Barry, B.; Muratori, M.; Yurkovich, B.J. Big Data issues and opportunities for electric
utilities. Renew. Sustain. Energy Rev. 2015, 52, 937–947. [CrossRef]
35. Tardioli, G.; Kerrigan, R.; Oates, M.; James, O.D.; Finn, D. Data driven approaches for prediction of building
energy consumption at urban level. Energy Procedia 2015, 78, 3378–3383. [CrossRef]
36. Ma, Z.; Xie, J.; Li, H.; Sun, Q.; Si, Z.; Zhang, J.; Guo, J. The role of data analysis in the development of intelligent
energy networks. IEEE Netw. 2017, 31, 88–95. [CrossRef]
37. Tu, C.; He, X.; Shuai, Z.; Jiang, F. Big data issues in smart grid—A review. Renew. Sustain. Energy Rev. 2017,
79, 1099–1107. [CrossRef]
38. Reka, S.S.; Dragicevic, T. Future effectual role of energy delivery: A comprehensive review of Internet of Things
and smart grid. Renew. Sustain. Energy Rev. 2018, 91, 90–108. [CrossRef]
39. Wen, L.; Zhou, K.; Yang, S.; Li, L. Compression of smart meter big data: A survey. Renew. Sustain. Energy Rev.
2018, 91, 59–69. [CrossRef]
40. Koseleva, N.; Ropaite, G. Big data in building energy efficiency: Understanding of big data and main challenges.
Procedia Eng. 2017, 172, 544–549. [CrossRef]
41. Molina-Solana, M.; Ros, M.; Ruiz, M.D.; Gómez-Romero, J.; Martín-Bautista, M.J. Data science for building
energy management: A review. Renew. Sustain. Energy Rev. 2017, 70, 598–609. [CrossRef]
42. Fan, C.; Xiao, F.; Li, Z.; Wang, J. Unsupervised Data Analytics in Mining Big Building Operational Data for
Energy Efficiency Enhancement: A Review. Energy Build. 2018, 159, 296–308. [CrossRef]
43. Wei, Y.; Zhang, X.; Shi, Y.; Xia, L.; Pan, S.; Wu, J.; Han, M.; Zhao, X. A review of data-driven approaches for prediction
and classification of building energy consumption. Renew. Sustain. Energy Rev. 2018, 82, 1027–1047. [CrossRef]
44. Tian, W.; Heo, Y.; De Wilde, P.; Li, Z.; Yan, D.; Park, C.S.; Feng, X.; Augenbroe, G. A review of uncertainty analysis
in building energy assessment. Renew. Sustain. Energy Rev. 2018, 93, 285–301. [CrossRef]
45. Fan, C.; Xiao, F.; Yan, C. Research and Applications of Data Mining Techniques for Improving Building
Operational Performance. Curr. Sustain. Renew. Energy Rep. 2018, 5, 181–188. [CrossRef]
46. Schmidt, M.; Ahlund, C. Smart buildings as Cyber-Physical Systems: Data-driven predictive control strategies
for energy efficiency. Renew. Sustain. Energy Rev. 2018, 90, 742–756. [CrossRef]
47. Linder, L.; Vionnet, D.; Bacher, J.P.; Hennebert, J. Big Building Data—A Big Data Platform for Smart Buildings.
Energy Procedia 2017, 122, 589–594. [CrossRef]
48. Terroso-Saenz, F.; González-Vidal, A.; Ramallo-González, A.P.; Skarmeta, A.F. An open IoT platform for the
management and analysis of energy data. Future Gener. Comput. Syst. 2019, 92, 1066–1079. [CrossRef]
49. Zakovorotnyi, A.; Seerig, A. Building energy data analysis by clustering measured daily profiles. Energy Procedia
2017, 122, 583–588. [CrossRef]
50. Fan, C.; Xiao, F. Mining Gradual Patterns in Big Building Operational Data for Building Energy Efficiency
Enhancement. Energy Procedia 2017, 143, 119–124. [CrossRef]
51. Ashouri, M.; Haghighat, F.; Fung, B.C.; Lazrak, A.; Yoshino, H. Development of building energy saving advisory:
A data mining approach. Energy Build. 2018, 172, 139–151. [CrossRef]
52. Jiang, R.; Lu, R.; Choo, K.K.R. Achieving high performance and privacy-preserving query over encrypted
multidimensional big metering data. Future Gener. Comput. Syst. 2018, 78, 392–401. [CrossRef]
53. Marinakis, V.; Doukas, H.; Tsapelas, J.; Mouzakitis, S.; Sicilia, A.; Madrazo, L.; Sgouridis, S. From big data to
smart energy services: An application for intelligent energy management. Future Gener. Comput. Syst. 2018.
[CrossRef]
54. Liu, G.; Yang, J.; Hao, Y.; Zhang, Y. Big data-informed energy efficiency assessment of China industry sectors
based on K-means clustering. J. Clean. Prod. 2018, 183, 304–314. [CrossRef]
55. Eggimann, S.; Mutzner, L.; Wani, O.; Schneider, M.Y.; Spuhler, D.; Moy de Vitry, M.; Beutler, P.; Maurer, M. The
potential of knowing more: A review of data-driven urban water management. Environ. Sci. Technol. 2017,
51, 2538–2553. [CrossRef] [PubMed]
56. Stewart, R.A.; Nguyen, K.; Beal, C.; Zhang, H.; Sahin, O.; Bertone, E.; Vieira, A.S.; Castelletti, A.; Cominola, A.;
Giuliani, M.; et al. Integrated intelligent water-energy metering systems and informatics: Visioning a digital
multi-utility service provider. Environ. Model. Softw. 2018, 105, 94–117. [CrossRef]
Big Data Cogn. Comput. 2019, 3, 12 15 of 17
57. Lokers, R.; Knapen, R.; Janssen, S.; van Randen, Y.; Jansen, J. Analysis of Big Data technologies for use in
agro-environmental science. Environ. Model. Softw. 2016, 84, 494–504. [CrossRef]
58. Wolfert, S.; Ge, L.; Verdouw, C.; Bogaardt, M.J. Big data in smart farming—A review. Agric. Syst. 2017, 153, 69–80.
[CrossRef]
59. Kamilaris, A.; Kartakoullis, A.; Prenafeta-Boldu, F.X. A review on the practice of big data analysis in agriculture.
Comput. Electron. Agric. 2017, 143, 23–37. [CrossRef]
60. Kamilaris, A.; Prenafeta-Boldu, F.X. Deep learning in agriculture: A survey. Comput. Electron. Agric. 2018, 147,
70–90. [CrossRef]
61. Pham, X.; Stack, M. How data analytics is transforming agriculture. Bus. Horiz. 2018, 61, 125–133. [CrossRef]
62. Popović, T.; Latinović, N.; Pešić, A.; Zečević, Ž.; Krstajić, B.; Djukanović, S. Architecting an IoT-enabled platform
for precision agriculture and ecological monitoring: A case study. Comput. Electron. Agric. 2017, 140, 255–265.
[CrossRef]
63. Franklin, J.; Serra-Diaz, J.M.; Syphard, A.D.; Regan, H.M. Big data for forecasting the impacts of global change
on plant communities. Glob. Ecol. Biogeogr. 2017, 26, 6–17. [CrossRef]
64. Song, M.; Cen, L.; Zheng, Z.; Fisher, R.; Liang, X.; Wang, Y.; Huisingh, D. How would big data support societal
development and environmental sustainability? Insights and practices. J. Clean. Prod. 2017, 142, 489–500.
[CrossRef]
65. Li, L.; Hao, T.; Chi, T. Evaluation on China’s forestry resources efficiency based on big data. J. Clean. Prod. 2017,
142, 513–523. [CrossRef]
66. Li, T.; Cui, Y.; Liu, A. Spatiotemporal dynamic analysis of forest ecosystem services using “big data”: A case
study of Anhui province, central-eastern China. J. Clean. Prod. 2017, 142, 589–599. [CrossRef]
67. Zhang, M.; Wang, K.; Liu, H.; Zhang, C.; Yue, Y.; Qi, X. Effect of ecological engineering projects on ecosystem
services in a karst region: A case study of northwest Guangxi, China. J. Clean. Prod. 2018, 183, 831–842. [CrossRef]
68. Xie, H.; He, Y.; Xie, X. Exploring the factors influencing ecological land change for China’s Beijing–Tianjin–Hebei
region using big data. J. Clean. Prod. 2017, 142, 677–687. [CrossRef]
69. Xiao, Y.; Wu, X.Z.; Wang, L.; Liang, J. Optimal farmland conversion in China under double restraints of economic
growth and resource protection. J. Clean. Prod. 2017, 142, 524–537. [CrossRef]
70. Hughes, S. A meta-analysis of urban climate change adaptation planning in the US. Urban Clim. 2015, 14, 17–29.
[CrossRef]
71. Hashem, I.A.T.; Chang, V.; Anuar, N.B.; Adewole, K.; Yaqoob, I.; Gani, A. The role of big data in smart city. Int. J.
Inf. Manag. 2016, 36, 748–758. [CrossRef]
72. Kitchin, R. The real-time city? Big data and smart urbanism. GeoJournal 2014, 79, 1–14. [CrossRef]
73. Albino, V.; Berardi, U.; Dangelico, R.M. Smart cities: Definitions, dimensions, performance, and initiatives.
J. Urban Technol. 2015, 22, 3–21. [CrossRef]
74. Al Nuaimi, E.; Al Neyadi, H.; Mohamed, N.; Al-Jaroodi, J. Applications of big data to smart cities. J. Internet Serv.
Appl. 2015, 6, 25. [CrossRef]
75. Iqbal, R.; Doctor, F.; More, B.; Mahmud, S.; Yousuf, U. Big data analytics: Computational intelligence techniques and
application areas. Technol. Forecast. Soc. Chang. 2018. [CrossRef]
76. Haarstad, H. Constructing the sustainable city: Examining the role of sustainability in the ‘smart city’discourse.
J. Environ. Policy Plan. 2017, 19, 423–437. [CrossRef]
77. Kudva, S.; Ye, X. Smart Cities, Big Data, and Sustainability Union. Big Data Cogn. Comput. 2017, 1, 4. [CrossRef]
78. Bibri, S.E.; Krogstie, J. Smart sustainable cities of the future: An extensive interdisciplinary literature review.
Sustain. Cities Soc. 2017, 31, 183–212. [CrossRef]
79. Bibri, S.E. The IoT for Smart Sustainable Cities of the Future: An Analytical Framework for Sensor-Based Big
Data Applications for Environmental Sustainability. Sustain. Cities Soc. 2017, 38, 230–253. [CrossRef]
80. Kaur, G.; Tomar, P.; Singh, P. Design of Cloud-Based Green IoT Architecture for Smart Cities. In Internet of Things
and Big Data Analytics Toward Next-Generation Intelligence; Springer: Cham, Switzerland, 2018; pp. 315–333.
81. Lim, C.; Kim, K.J.; Maglio, P.P. Smart cities with big data: Reference models, challenges, and considerations.
Cities 2018, 82, 86–99. [CrossRef]
Big Data Cogn. Comput. 2019, 3, 12 16 of 17
82. Zhu, C.; Zhou, H.; Leung, V.C.; Wang, K.; Zhang, Y.; Yang, L.T. Toward big data in green city. IEEE Commun. Mag.
2017, 55, 14–18. [CrossRef]
83. Huang, L.; Bi, S.; Qian, L.P.; Xia, Z. Adaptive Scheduling in Energy Harvesting Sensor Networks for Green Cities.
IEEE Trans. Ind. Inform. 2018, 14, 1575–1584. [CrossRef]
84. Zhang, Y.; Mi, Z. Environmental benefits of bike sharing: A big data-based analysis. Appl. Energy 2018,
220, 296–301. [CrossRef]
85. Walther, G.R.; Post, E.; Convey, P.; Menzel, A.; Parmesan, C.; Beebee, T.J.; Fromentin, J.M.; Hoegh-Guldberg, O.;
Bairlein, F. Ecological responses to recent climate change. Nature 2002, 416, 389. [CrossRef]
86. Van Aalst, M.K. The impacts of climate change on the risk of natural disasters. Disasters 2006, 30, 5–18. [CrossRef]
87. Helmer, M.; Hilhorst, D. Natural disasters and climate change. Disasters 2006, 30, 1–4. [CrossRef] [PubMed]
88. Goswami, S.; Chakraborty, S.; Ghosh, S.; Chakrabarti, A.; Chakraborty, B. A review on application of data mining
techniques to combat natural disasters. Ain Shams Eng. J. 2016, 9, 365–378. [CrossRef]
89. Akter, S.; Wamba, S.F. Big data and disaster management: A systematic review and agenda for future research.
Ann. Oper. Res. 2017, 1–21. [CrossRef]
90. Wang, X.; Song, L.; Wang, G.; Ren, H.; Wu, T.; Jia, X.; Wu, H.; Wu, J. Operational climate prediction in the era of
big data in China: Reviews and prospects. J. Meteorol. Res. 2016, 30, 444–456. [CrossRef]
91. Miyoshi, T.; Kunii, M.; Ruiz, J.; Lien, G.Y.; Satoh, S.; Ushio, T.; Bessho, K.; Seko, H.; Tomita, H.; Ishikawa, Y. “Big Data
Assimilation” revolutionizing severe weather prediction. Bull. Am. Meteorol. Soc. 2016, 97, 1347–1354. [CrossRef]
92. Ofli, F.; Meier, P.; Imran, M.; Castillo, C.; Tuia, D.; Rey, N.; Briant, J.; Millet, P.; Reinhard, F.; Parkan, M.; et al.
Combining human computing and machine learning to make sense of big (aerial) data for disaster response. Big
Data 2016, 4, 47–59. [CrossRef]
93. Papadopoulos, T.; Gunasekaran, A.; Dubey, R.; Altay, N.; Childe, S.J.; Fosso-Wamba, S. The role of Big Data in
explaining disaster resilience in supply chains for sustainability. J. Clean. Prod. 2017, 142, 1108–1118. [CrossRef]
94. Nguyen, D.T.; Ofli, F.; Imran, M.; Mitra, P. Damage assessment from social media imagery data during disasters.
In Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and
Mining 2017, Sydney, Australia, 31 July–3 August 2017; pp. 569–576.
95. Kim, K.; Yoon, S. Assessment of Building Damage Risk by Natural Disasters in South Korea Using Decision Tree
Analysis. Sustainability 2018, 10, 1072. [CrossRef]
96. Li, S.; Cui, Y.; Liu, M.; He, H.; Ravan, S. Integrating Global Open Geo-Information for Major Disaster Assessment:
A Case Study of the Myanmar Flood. ISPRS Int. J. Geo-Inf. 2017, 6, 201. [CrossRef]
97. Cian, F.; Marconcini, M.; Ceccato, P. Normalized Difference Flood Index for rapid flood mapping: Taking
advantage of EO big data. Remote Sens. Environ. 2018, 209, 712–730. [CrossRef]
98. Ramo, R.; Garcia, M.; Rodríguez, D.; Chuvieco, E. A data mining approach for global burned area mapping. Int.
J. Appl. Earth Obs. Geoinf. 2018, 73, 39–51. [CrossRef]
99. Hay, S.I.; George, D.B.; Moyes, C.L.; Brownstein, J.S. Big data opportunities for global infectious disease
surveillance. PLoS Med. 2013, 10, e1001413. [CrossRef] [PubMed]
100. Traore, B.B.; Kamsu-Foguem, B.; Tangara, F. Data mining techniques on satellite images for discovery of risk
areas. Expert Syst. Appl. 2017, 72, 443–456. [CrossRef]
101. Manogaran, G.; Lopez, D. Disease surveillance system for big climate data processing and dengue transmission.
Int. J. Ambient Comput. Intell. 2017, 8, 88–105. [CrossRef]
102. Zhang, N.; Chen, Z. Sustainability characteristics of China’s Poyang Lake Eco-Economics Zone in the big data
environment. J. Clean. Prod. 2017, 142, 642–653. [CrossRef]
103. Zhang, Y.; Ren, S.; Liu, Y.; Sakao, T.; Huisingh, D. A framework for Big Data driven product lifecycle management.
J. Clean. Prod. 2017, 159, 229–240. [CrossRef]
104. Liang, Y.C.; Lu, X.; Li, W.D.; Wang, S. Cyber Physical System and Big Data enabled energy efficient machining
optimisation. J. Clean. Prod. 2018, 187, 46–62. [CrossRef]
105. Wang, S.; Liang, Y.C.; Li, W.D.; Cai, X.T. Big Data enabled Intelligent Immune System for energy efficient
manufacturing management. J. Clean. Prod. 2018, 195, 507–520. [CrossRef]
Big Data Cogn. Comput. 2019, 3, 12 17 of 17
106. Gu, F.; Ma, B.; Guo, J.; Summers, P.A.; Hall, P. Internet of things and Big Data as potential solutions to the
problems in waste electrical and electronic equipment management: An exploratory study. Waste Manag. 2017,
68, 434–448. [CrossRef]
107. Hassani, H.; Gheitanchi, S.; Yeganegi, M.R. On the Application of Data Mining to Official Data. J. Data Sci. 2010,
8, 75–89.
108. Hassani, H.; Saporta, G.; Silva, E.S. Data mining and official statistics: The past, the present and the future. Big
Data 2014, 2, 34–43. [CrossRef] [PubMed]
109. Pang-Ning, T.; Steinbach, M.; Kumar, V. Introduction to Data Mining, 1st ed.; Pearson: London, UK, 2014.
110. Kotsiantis, S.B.; Zaharakis, I.; Pintelas, P. Supervised machine learning: A review of classification techniques.
Emerg. Artif. Intell. Appl. Comput. Eng. 2017, 160, 3–24.
111. Quinlan, J.R. C4.5: Programs for Machine Learning; Morgan Kaufmann: Burlington, MA, USA, 1992.
112. Cortes, C.; Vapnik, V. Support Vector Networks. Mach. Learn. 1995, 20, 273–297. [CrossRef]
113. Langley, P.; Iba, W.; Thompson, K. An analysis of Bayesian classifiers. In Proceedings of the 10th National
Conference on Artificial Intelligence, San Jose, CA, USA, 12–16 July 1992; pp. 223–228.
114. Zhang, G.P. Neural networks for classification: A survey. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 2000,
30, 451–462. [CrossRef]
115. Han, E.H.S.; Karypis, G.; Kumar, V. Text categorization using weight adjusted k-nearest neighbor classification.
In Pacific-Asia Conference on Knowledge Discovery and Data Mining; Springer: Berlin/Heidelberg, Germany, 2001;
pp. 53–65.
116. Agrawal, R.; Imieliński, T.; Swami, A. Mining Association Rules between sets of items in large databases.
In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, Washington, DC,
USA, 26–28 May 1993; pp. 207–216.
117. Yun, H.; Ha, D.; Hwang, B.; Ryu, K.H. Mining Association Rules on significant rare data using relative support. J.
Syst. Softw. 2003, 67, 181–191. [CrossRef]
118. Mena, J. Investigative Data Mining for Security and Criminal Detection; Butterworth-Heinemann: Oxford, UK, 2003.
119. Fard, A.M.; Ester, M. Collaborative mining in multiple social networks data for criminal group discovery.
In Proceedings of the International Conference on Computational Science and Engineering, Vancouver, BC,
Canada, 29–31 August 2009; pp. 582–587.
120. Guo, H.; Liu, Z.; Jiang, H.; Wang, C.; Liu, J.; Liang, D. Big Earth Data: A new challenge and opportunity for
Digital Earth’s development. Int. J. Digit. Earth 2017, 10, 1–12. [CrossRef]
121. Ford, J.D.; Tilleard, S.E.; Berrang-Ford, L.; Araos, M.; Biesbroek, R.; Lesnikowski, A.C.; MacDonald, G.K.; Hsu,
A.; Chen, C.; Bizikova, L. Opinion: Big data has big potential for applications to climate change adaptation. Proc.
Natl. Acad. Sci. USA 2016, 113, 10729–10732. [CrossRef]
122. Merchant, C.J.; Paul, F.; Popp, T.; Ablain, M.; Bontemps, S.; Defourny, P.; Hollmann, R.; Lavergne, T.; Laeng, A.;
de Leeuw, G.; et al. Uncertainty information in climate data records from Earth observation. Earth Syst. Sci. Data
2017, 9, 511. [CrossRef]
123. Lee, J.G.; Kang, M. Geospatial big data: Challenges and opportunities. Big Data Res. 2015, 2, 74–81. [CrossRef]
124. Wang, H.; Xu, Z.; Fujita, H.; Liu, S. Towards felicitous decision making: An overview on challenges and trends of
Big Data. Inf. Sci. 2016, 367, 747–765. [CrossRef]
125. Khan, S.; Shakil, K.A.; Alam, M. Cloud-Based Big Data Analytics—A Survey of Current Research and Future
Directions. In Big Data Analytics; Springer: Singapore, 2018; pp. 595–604.
126. Maksimovic, M. Greening the future: Green Internet of Things (G-IoT) as a key technological enabler of
sustainable development. In Internet of Things and Big Data Analytics Toward Next-Generation Intelligence; Springer:
Cham, Switzerland, 2018; pp. 283–313.
c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution (CC
BY) license (http://creativecommons.org/licenses/by/4.0/).