RoleofBigDataandPredictiveAnalyticsV1 60-Revised
RoleofBigDataandPredictiveAnalyticsV1 60-Revised
net/publication/309809606
CITATIONS READS
15 6,827
3 authors, including:
Some of the authors of this publication are also working on these related projects:
Building energy, environment and sustainability linkages in management education in India - An innovative curriculum based approach View project
All content following this page was uploaded by Yogesh Patil on 19 November 2016.
Abstract :
Big Data has emerged as an important area of interest of study and research among practitioners and
academicians. Exponential growth of data is fueled by the exponential growth of internet and digital
devices. Advancement in technology is making it economically feasible to store and analyze huge
amounts of data. Big Data includes a mix of structured, semi-structured and unstructured real time data
originating from variety of sources. Predictive Analytics provides methodology in tapping intelligence
from large data sets. Many visionary companies such as Google, Amazon etc. have realized the potential
of Big Data and Analytics in gaining competitive advantage. These techniques provide several
opportunities like discovering patterns or better optimization algorithms. Managing and analyzing Big
data also constitutes few challenges - namely size, quality, reliability and completeness of data. This
paper provides an extensive review of literature on Big Data and Predictive Analytics. It gives details of
fundamental concepts in this emerging field. Finally, we have concluded with findings of our study and
outline future research directions in this field.
Keywords : Big Data, Big Data and Analytics, Data Science, Predictive Analytics, Advanced Analytics,
Social Media Analytics
Role of Big Data and Predictive Analytics
1.0 INTRODUCTION:
In the recent years Big Data Analytics has emerged as an important area of interest among practitioners
and academicians. Exponential growth of digital devices, penetration of internet, tablet computers and
smart phones are spawning large volumes of data round the clock. Contrary to traditional data, Big Data
comes from variety of data sources in different forms. The volume, variety and velocity of this data pose
unique challenges for those managing data centers. Nevertheless, computing, storage and analysis
capabilities have caught up to meet these challenges. Storage of large datasets has become easy and
economical.
Along with traditional business data, firms are realizing value from social media data obtained from sites
such as Twitter or Facebook. These mediums have exhibited potential of gathering business intelligence
required for designing competitive strategies. In this paper, we have narrated different ways in which
firms can derive intelligence which helps business managers make informed decisions. This can translate
into improved ROI for business. This paper provides conceptual underpinnings about Big Data, Predictive
Analytics, applications of Big Data Analytics, challenges and opportunities and further research direction.
This field has a great potential to address future challenges for business and society. It provides certain
unique advantages compared to statistical sampling method.
The present paper is organized as follows. The next section (section 2) provides review of extant
literature. In this section, we review literature on Big Data, sources of data and Predictive Analytics. The
third section provides concepts about Big Data Analytics and further delves into Predictive Analytics.
Fourth section will discuss Opportunities and challenges dealing with Big Data and Predictive Analytics.
Fifth section will present conclusion followed by future research opportunities in this field.
1|Page
Role of Big Data and Predictive Analytics
research papers. After initial review of over 400 papers, we have narrowed down to over hundred research
papers in the field of Big Data and Analytics for further study. Besides, we have conducted few short
interviews with practitioners working in this field to get insights from the industry. After completing our
study and analysis, research papers from different journals were classified in Table 3 (Annexure 1). The
research papers were selected from year 1992 to 2016, with majority of them published after year 2013.
In fact, above 50% of the papers have been published in the last 3 years, which is a good indicator of
significance of this topic (indicated by red color in Figure 1 below). These papers were reviewed to
collect, refine and review various aspects of Big Data and Analytics.
Figure 1: Year-wise classification of research papers on Big Data and Predictive Analytics
we have reached an era of “Smart Phones” and “Internet of Things”. Prime purpose of early applications
of information systems was record keeping and efficient processing of business transactions. Since then,
several breakthroughs in computer science and engineering have led to information revolution in last few
decades. Chen et al. (2012) summarize this into 3 distinct phases as narrated in table 1 below.
Phase Description
I – Till year 1999 Database systems to collect, analyze and report structured data in RDBMS
systems
II – 2000- 2010 Wide use of Internet, entry and growth of internet firms Yahoo, Google, Amazon
etc., web based business applications, ecommerce, supply chains
III – 2010- onwards Entry of smart phones, RFID, Sensor technologies, Internet of Things
Table 1: Phases of Data Evolution Chen et al., (2012)
The era of Big data seem to have started around year 2000. Several developments and trends led to the
evolution of big data as depicted in figure 3 below. Big Data and Analytics are the natural outcome of the
above evolution process. This includes advancements in computing hardware, digital storage capabilities,
high speed software solutions, internet and mobile technologies.
When compared with traditional data, Big Data differs not only on the size but also in its form. It gets
added continuously rather than relatively static data in the legacy or ERP systems (Davenport, 2014).
with terra-bytes and petabytes of data created in a short span of few hours. Traditional database
management technologies are unable to scale up to the demand of storage, analysis or management of
such large volumes of continuous data from a variety of data sources. Figure 4 below gives an overview
of how Big Data is created from variety of sources and how analytics can be performed to enable decision
making.
4|Page
Role of Big Data and Predictive Analytics
data is organized into well-defined table structures in a relational database. Traditional RDBMS systems
use ETL tools and processes to extract, transform and load the data into data warehouse.
Unstructured Data: Unstructured data originates from variety of sources such as social media, text
messages, emails, attachments, videos, images and sound files. In terms of volume and velocity this is
huge and accounts for over 80% of the large datasets. Analysis and mining of this data is more
challenging than structured data.
Semi-structured data: This originates from variety of different sources which is a mix of structured and
unstructured data. Various information systems have been in use for speed, efficiency and accuracy of
information exchange with stakeholders. Firms use emails for communication, RFID technology for faster
processing in logistics (Deng et al., 2010) or sensor devices for tracking objects. Thus there are several
sources of unstructured data - emails, XML documents, server logs, communication log from RFID tags,
GPS devices, etc. There are some tags with IP address, date and time stamp, user information which is
structured. Besides, there are error messages, SQL statements, event logs which are in unstructured text
formats. Hadoop, HDFS (Hadoop Distributed File System) and MapReduce provide technological
framework to process large volumes of unstructured data.
5|Page
Role of Big Data and Predictive Analytics
Variety: Big Data originates from Variety of different sources: enterprise systems (such as ERP), social
media as well many other digital devices. This list includes text, video, audio, location, date and time
data, emails, sensors, RFID data, web applications, etc. Data is in different structured or unstructured
formats based on the source.
Veracity: Data needs Veracity – that is to understand how much percentage of data is accurate.
Value: Finally, businesses need to learn how we can design models to improve business outcomes and
derive “value” from Big Data. Obtaining value from the large heterogeneous data leads to the success of
any industry (Weber et al., 2014).
6|Page
Role of Big Data and Predictive Analytics
B. New forms of data in semi-structured format have emerged: websites, clickstreams, weblogs, XML
files, blogs, emails, etc. Further addition of social media data such as Twitter, Facebook, Instagram,
and LinkedIn have led to exponential growth in data volume, speed and type.
C. When it comes to Big Data, data flow is continuous and it comes from variety of sources. There is no
fixed source or structure to the data. Facebook records billions of posts, likes, millions of photo
uploads every hour (Kitchin, 2014).
D. The volume, variety and velocity of collection of data have far outstripped capacity of manual
analysis. In some cases it has even exceeded the capacity of conventional databases. Analyzing such
huge volumes of data require specialized technological framework such as Hadoop, which is used by
technology leaders such as Microsoft, IBM and Oracle for managing Big Data (Chen et al., 2012).
Table 2 below compares Traditional Data and Big Data:
Parameters Traditional Data Big Data
Structure of data Structures are defined Mix of Structured, semi-structured and
unstructured data
Data Volume Based on business volumes and Very high, in petabytes and even more
extent of digitization
Variety of Data Data source from database systems Besides data from business information
Sources systems, text (emails, documents),
weblogs, sensors, RFID, etc.
Velocity Low to moderate based on volume High velocity
of business
Flow Fixed Continuous round the clock
accumulation of data
Structured Data Structured Data Structured, Semi-structured and
Unstructured data
Sources of data Organizational data, trading Organizational data, RFID, Sensor data,
partners data Google searches, Social media (Linked
in, Facebook, Twitter, Whatsapp, etc.)
Analytics Provide historical view, status Real-time, direct feedback from the
reports consumer, sentiment analysis, opinions
Functions Advises senior executives on Customer facing functions get direct
internal business decisions, market feedback which can be used for
focused on analyzing data for planning market strategies, planning etc.,
7|Page
Role of Big Data and Predictive Analytics
8|Page
Role of Big Data and Predictive Analytics
9|Page
Role of Big Data and Predictive Analytics
10 | P a g e
Role of Big Data and Predictive Analytics
Text Analytics techniques derives real-time and meaningful information from unstructured data sources
such as documents, emails, web pages and social media. It is being pursued in some of the emerging areas
such as sentiment analysis, opinion mining or for extracting information from text sources. (Chen et al.,
2012). In the recent years, soon after product launch, sentiment analysis with social media data provides
early indicators of consumer feedback about product.
As the data on social media is growing and it contains valuable information for business firms,
govrnments as well as NGOs it is being tapped for deriving value. It requires a different process of data
collection and analysis due to its large volume, continuous flow and variety of data to arrive at
meaningful information. We discuss this in detail in the next sub-section.
11 | P a g e
Role of Big Data and Predictive Analytics
12 | P a g e
Role of Big Data and Predictive Analytics
of this activity, analyze the data associated with it and plan their marketing and branding strategy
accordingly. For high price products wherein consumer purchase involvement is higher, firms are better
off providing information about products rather than using entertaining content for advertisements on the
social media (Lally, 2007). For low cost commodity items, firms can use entertaining methods to attract
consumer attention (Coursaris et al., 2016).
13 | P a g e
Role of Big Data and Predictive Analytics
competitive advantage as well as some challenges in dealing with these large and continuous data sets.
We discuss these in detail in this section.
4.1 OPPORTUNITIES:
Huge volume of data used to be a technological problem just few years ago, now it presents an
opportunity (Russom, 2011). Big Data provides many opportunities and competitive advantages. Early
mover Amazon.com started collecting customer information, preferences, purchase history, search history
and books reviews. Based on this data, it provides product recommendations which motivates customer to
buy similar or related product, improving chances of additional purchases from the same customer.
Next generation retailers will be able to track behavior of individual customers and develop models for
prediction or influencers. Walmart makes use of “Social Genome” that tracks connections between
people, products, brands and other related entities. Social Genome is used to make product
recommendation to customers when they are online or in store (Direction S., 2012). Big data provides
several advantages over traditional method of data collection such as drawing samples.
14 | P a g e
Role of Big Data and Predictive Analytics
information on weather and traffic which complements very well with information from enterprise
systems.
15 | P a g e
Role of Big Data and Predictive Analytics
essential to address the hurdles of big data like noise, heterogeneity, correlation and inefficiency (Fan et
al., 2014).
4.2.3 Reliability
Another challenge is about Reliability of data. Most of the unstructured data is often unreliable, prone to
outages and losses. The data comes from different sources such as social media, smart phones, emails or
text messages (Boyd et al., 2012). In manufacturing environment, data comes from heterogeneous sources
such as information systems and variety of sensors (Le and Pang, 2013). This makes data mining process
quite intense, requires mining through a large volume of unrelated data to arrive at small piece of relevant
and meaningful information. The process can be compared with finding a needle in a haystack.
16 | P a g e
Role of Big Data and Predictive Analytics
Some of the sites like Twitter provide access to limited filtered datasets based on certain criteria, which is
again a subset of data. Researchers must be able to account for the biases in their interpretation of data
(Behar and Gordon, 1996).
17 | P a g e
Role of Big Data and Predictive Analytics
Social Media Analytics is an emerging field which showcases several ways in which firms can derive
value. It has potential opportunities in getting firsthand information about their products from market
through sentiment analysis, by getting competitive intelligence or promoting brand using social media as
a platform. Depending on how effectively firms can tap into this medium, they can have competitive
advantage.
The nature of Big Data presents some unique challenges as well as opportunities. As described above, Big
Data can be collected, stored and analyzed through a systematic process. To derive insights from the
ocean of data requires years of experience and knowledge of the interdisciplinary field of Analytics.
Business firms and governments need to build infrastructure and framework for this purpose. Like any
strategic project, analytics projects must be led from the top for its success and gaining the advantage.
There remain few obstacles in managing large datasets like quality and reliability of data as well as
availability of knowledgeable data science professionals. Data size and completeness of data, lack of
business support, inadequate staffing and skills and problems with database software are few other
barriers in implementing Big Data Analytics strategy. Big data constitutes several other challenges like
data life cycle management, redundancy of data, analytical mechanism, confidentiality of data, energy
management, cooperation and data representation (Chen et al., 2014). However, with strong leadership
and willingness these can be overcome. Further, Big Data and Predictive Analytics provide several
opportunities to study, investigate and research in different fields.
18 | P a g e
Role of Big Data and Predictive Analytics
ANNEXURE 1
# of
Sr. Journal Research
Papers
1 Web Sources 19
2 Anita Mcgahan 1
3 Strategy and Leadership 1
4 Big Data 1
5 Aslib Journal of Information Management 1
6 Big Data and Society 1
7 Fourth Quarter 1
8 Ecological Economics 1
9 Decision Support Systems 1
10 Communications of The ACM 1
11 Chemical Leasing Goes Global 1
12 Geojournal 1
13 Harvard Business Review 1
14 Industrial Management and Data Systems 1
15 Information, Communication and Society 1
16 Mckinsey Quarterly 1
17 Journal of Parallel and Distributed Computing 1
18 Journal of Industrial Ecology 1
19 Jama 1
20 International Journal of Information Management 1
21 MIT Sloan Management Review 1
22 Mobile Networks and Applications 1
23 National Science Review 1
24 Plosbiol 1
25 New York, NY: Mcgraw-Hill 1
26 Performance Evaluation Review 1
27 Phd Thesis 1
28 Science 1
29 Significance 1
19 | P a g e
Role of Big Data and Predictive Analytics
# of
Sr. Journal Research
Papers
30 Technometric 1
31 The Journal of Economic Perspectives 1
32 IEEE Transactions 6
33 Industrial and Commercial Training 1
34 Information Sciences 1
35 Information Systems 1
36 Journal of Applied Management and Entrepreneurship 1
37 Refereed Sessions 5
38 Aslib Journal of Information Management 1
39 Strategic Direction 2
40 U.S. Patent And Trademark Office 1
41 The Journal of Economic Perspectives 1
42 International Journal of Production Economics 3
43 Journal of Business Logistics 2
44 John Wiley & Sons 2
45 MIS Quarterly 1
46 Online Information Review 3
47 Proceedings 4
The International Journal of Advanced Manufacturing
48 2
Technology
49 Journal of Business Logistics 1
50 Decision Support System 1
51 International Journal of Automation and Logistics 8
Total 96
Table 3 : Classification of Research Papers on Big Data and Predictive Analytics:
The growth in the quantity and diversity of data has led to data (Waller and Journal of
sets larger than manageable by conventional hands on Fawcett, 2013) Business
20 | P a g e
Role of Big Data and Predictive Analytics
Define Big Data as a cultural, technological, and scholarly Boyd and Craw. Information,
phenomenon that rests on the interplay of: (2012) Communication &
(1) Technology: maximizing computation power and algorithmic Society, 2012
accuracy to gather, analyze, link, and compare large data sets.
(2) Analysis: drawing on large data sets to identify patterns in
order to make economic, social, technical, and legal claims.
(3) Mythology: the widespread belief that large data sets offer a
higher form of intelligence and knowledge that can generate
insights that were previously impossible, with the aura of truth,
objectivity, and accuracy.
Big data is a term that refers to an explosion of available Fan et al., (2014) Natural Science
information Review
Datasets that are too large for traditional data processing systems Provost and Data Science and
to process and therefore require new technologies. Fawcett (2013) Big Data Journal
Define Big data in terms of attributes of data – volume, variety Russom, (2011) TDWI Research
and velocity of data Report
Big Data and Big Data analytics have been used to describe the Chen et al., MIS Quarterly –
data sets and analytical techniques in applications that are so large (2012) December 2012
from terabytes to exabytes and complex (from sensor to social
media data) that they require advanced and unique data storage,
management, analysis and visualization technologies.
Huge in volume, consisting of terabytes or petabytes of data; Kitchin (2014) BIG DATA &
high in velocity, being created in or near real-time; Society
diverse in variety, being structured and unstructured in nature;
exhaustive in scope, striving to capture entire populations or
systems (n=all);
21 | P a g e
Role of Big Data and Predictive Analytics
Big data is a vague term constituting huge mass which has spread Harford(2014) Significance
among entrepreneurs, scientists, government and the media.
Too large to handle with widely available, conventional software McGahan (2013) Rotman
programs such as Excel, and thus requires specialized analytics. Management,
2013
The collection and interpretation of massive data sets, made Davenport (2014) Strategy &
possible by vast computing power that monitors a variety of Leadership
digital streams – such as sensors, marketplace interactions and
social information exchanges – and analyses them using “smart”
algorithms.
2005 Adomavicius, G., & Product recommendation systems, such as association rule mining,
Tuzhilin, A. (2005). database segmentation and clustering, anomaly detection and graph
mining
2008 Pang, B., & Lee, L. Social media analytics of customer opinions, text analytics and
(2008). sentiment analysis
2011 Russom, P. (2011) Real-time analytics are relatively rare today, but will soon be common.
2012 Minelli, M., Getting deeper insights into buying habits and preferences of
Chambers, M., & customers, Fraud Detection by monitoring credit card transactions in
Dhiraj, A. (2012). near real time,
2012 Chen, H., Chiang, R. eCommerce and Market Intelligence – recommender system
H., & Storey, V. C. eGovernance and Politics – government services, political campaigns
(2012). and e-polling
Science and Technology – Hypothesis Testing, knowledge discovery
22 | P a g e
Role of Big Data and Predictive Analytics
2012 Boyd, D., & Era of Big Data has begun. Big Data offers potential of new insights
Crawford, K. (2012). into diverse areas such as cancer research, terrorism, and climate
change.
Marketers use it for target advertising, insurance providers use it to
optimize their offerings and Wall Street bankers use it to read the
market.
2013 Waller, M. A., & Call for papers to promote research on Big Data and Supply Chain
Fawcett, S. E. Management
(2013).
2013 Provost, F., & Data driven decision making – Walmart predicted pre-hurricane need of
Fawcett, T. (2013) certain products in addition to those on usual list
2014 Fan, J., Han, F., & Genomics, Neuroscience, Economics and Finance, predict stock market
Liu, H. (2014). trends, using social network data to predict box-office revenue, leading
economic indicators, economic and social states of society, Target
predicts customer needs by analysis of transaction records
2014 Hazen, B. T., Boone, Propose interdisciplinary research to address data quality problems in
C. A., Ezell, J. D., & the context of SCM (Supply Chain Management) and DPB (Data
Jones-Farmer, L. A. Science, Predictive Analytics and Big data)
(2014)
2014 Kitchin, R. (2014) Explain that Big Data includes entire data domain and provides view of
entire population. Recognizes a tectonic shift that is driven by data
rather than knowledge. Role Big data is playing in humanity and social
sciences. Provides numerous examples of practical applications of Big
Data - Amazon’s recommendation system based on association rule
23 | P a g e
Role of Big Data and Predictive Analytics
2015 Zhong, R. Y., Huang, Logistics complexity arises due to interaction of logistic network with
G. Q., Lan, S., Dai, external system ( Rai et al., 2015). Big data approach for logistics
Q. Y., Chen, X., & trajectory discovery from RFID- enabled production data. RFID has led
Zhang, T. (2015) to satisfy the urgent requirements of logistics ( Dang et al., 2010).
Table 5 - Big Data and its use in Operations and other domains:
BIBLIOGRAPHY:
BDA: Big Data Analytics
BI: Business Intelligence
DPB: Data science, Predictive analytics and Big data
GIS: Geographic Information System
RL: Reverse Logistics
SCM: Supply Chain Management
REFERENCES:
3PLs Investing Heavily in Big Data Capabilties to Ensure Seamless Supply Chain Integration. 2014.
Retrieved January 26, 2016, From http://www.supplychain247.com/photos/3pls _investing_
heavily_in_big_data_capabilties/3
Abbott, D. (2014). Applied Predictive Analytics: Principles and Techniques for the Professional Data
Analyst.John Wiley & Sons.
Adomavicius, G., &Tuzhilin, A. (2005). Toward the next generation of recommender systems: A
survey of the state-of-the-art and possible extensions. Knowledge and Data Engineering, IEEE
Transactions on, 17(6), 734-749.
Alstete, J. W., & Cannarozzi, E. G. M. (2014). Big data in managerial decision-making: concerns and
concepts to reduce risk. International Journal of Business Continuity and Risk
Management, 5(1), 57-71.
An inconvenient truth. Paramount, 2007. Al Gore.
24 | P a g e
Role of Big Data and Predictive Analytics
Assunção, M. D., Calheiros, R. N., Bianchi, S., Netto, M. A., &Buyya, R. (2015). Big Data computing
and clouds: Trends and future directions. Journal of Parallel and Distributed Computing, 79, 3-
15.
Automobile Industry in India.2015. Retrieved November, 2015, From
http://www.ibef.org/http://www.ibef.org/industry/india-automobiles.aspx
Bag, S., & Anand, N. (2015). Modelling barriers of sustainable supply chain network design using
interpretive structural modelling: an insight from food processing sector in India. International
Journal of Automation and Logistics, 1(3), 234-255.
Batra, S. (2014). Big Data Analytics and its Reflections on DIKW Hierarchy.Review of
Management, 4(1/2), 5.
Berg, W. F., Carlin, J. D., Kalmbach, M. T., & Schroeder, M. D. (2015). U.S. Patent No. 8,989,067.
Washington, DC: U.S. Patent and Trademark Office.
Bharathi, S. V., & Mandal, T. (2015). Prioritising and ranking critical factors for sustainable cloud
ERP adoption in SMEs. International Journal of Automation and Logistics, 1(3), 294-316.
Borra, E., & Rieder, B. (2014). Programmed method: developing a toolset for capturing and analyzing
tweets. Aslib Journal of Information Management, 66(3), 262-278.
Borra, E., &Rieder, B. (2014). Programmed method: developing a toolset for capturing and analyzing
tweets. Aslib Journal of Information Management,66(3), 262-278.
Bose, R. (2009). Advanced analytics: opportunities and challenges.Industrial Management & Data
Systems, 109(2), 155-172.
Boyd, D., & Crawford, K. (2012). Critical questions for big data: Provocations for a cultural,
technological, and scholarly phenomenon. Information, communication & society, 15(5), 662-
679.
Bröhl-Kerner, H. (2008). Intelligent replacement-making optimal use of household appliances.
Refereed Sessions I-II Monday 10 March, 277.
Bughin, J., Chui, M., &Manyika, J. (2010). Clouds, big data, and smart assets: Ten tech-enabled
business trends to watch. McKinsey Quarterly, 56(1), 75-86.
Business Pandit S. (2015).Walmart Company Statistics. Retrieved October 20, 2015,
by Gareth Bell, I. (2012). Interview with Marshall Sponder, author of Social Media Analytics.
Strategic Direction, 28(6), 32-35.
Chae, B. K. (2015). Insights from hashtag# supplychain and Twitter Analytics: Considering Twitter
and Twitter data for supply chain practice and research. International Journal of Production
Economics, 165, 247-259.
25 | P a g e
Role of Big Data and Predictive Analytics
Chen, C. P., & Zhang, C. Y. (2014). Data-intensive applications, challenges, techniques and
technologies: A survey on Big Data. Information Sciences,275, 314-347.
Chen, H., Chiang, R. H., &Storey, V. C. (2012). Business Intelligence and Analytics: From Big Data
to Big Impact. MIS quarterly, 36(4), 1165-1188.
Chen, M., Mao, S., & Liu, Y. (2014). Big data: A survey. Mobile Networks and Applications, 19(2),
171-209.
Corporate & Financial Facts.Retrieved January 205, 2016, From http://news.walmart.com/walmart-
facts /corporate-financial-fact-sheet
Coursaris, C. K., van Osch, W., & Balogh, B. A. (2016). Informing Brand Messaging Strategies via
Social Media Analytics. Online Information Review, 40(1).
Davenport, T. H. (2006). Competing on analytics.harvard business review, (84), 98-107.
Davis B. (2016).Tetradata, International Blog. Tetradata, International Blog Retrieved January 25,
2016, From http://blogs.teradata.com/ international/how-much-data-we-create-
daily/industry/india-automobiles.aspx
Demirkan, H., &Delen, D. (2013).Leveraging the capabilities of service-oriented decision support
systems: Putting analytics and big data in cloud. Decision Support Systems, 55(1), 412-421.
Deng, H. F., Deng, W., Li, H., & Yang, H. J. (2010). Authentication and access control in RFID based
logistics-customs clearance service platform. International Journal of Automation and
Computing, 7(2), 180-189.
Direction, S. Big data needn't be a big headache: How to tackle mind‐blowing amounts of information.
Strategic Direction, 28(8).
Direction, S. So, are the geeks inheriting the earth?: Big Data, big decisions and best practice for
information management. Strategic Direction, 29(9).
Dou, W., Zhang, X., Liu, J., & Chen, J. (2015).HireSome-II: Towards privacy-aware cross-cloud
service composition for big data applications.Parallel and Distributed Systems, IEEE
Transactions on, 26(2), 455-466.
Dubey, R., & Gunasekaran, A. (2015). Education and training for successful career in Big Data and
Business Analytics. Industrial and Commercial Training, 47(4), 174-181.
Dubey, R., Gunasekaran, A., Childe, S. J., Wamba, S. F., & Papadopoulos, T. (2015). The impact of
big data on world-class sustainable manufacturing. The International Journal of Advanced
Manufacturing Technology, 1-15.
Dyche J. (2012).Big Data and Discovery.Retrieved December 12, 2012, From
http://jilldyche.com/2012/12/04/big-data-and-discovery/
26 | P a g e
Role of Big Data and Predictive Analytics
E. Prescott, M. (2014). Big data and competitive advantage at Nielsen.Management Decision, 52(3),
573-601.
Embry, K. (2015). Moonshot! Game-Changing Strategies to Build Billion-Dollar Businesses.Journal
of Applied Management and Entrepreneurship, 20(2), 130.
Fan, J., Han, F., & Liu, H. (2014).Challenges of big data analysis.National science review, 1(2), 293-
314.
Fox, S., & Do, T. (2013). Getting real about Big Data: applying critical realism to analyse Big Data
hype. International Journal of Managing Projects in Business, 6(4), 739-760.
Franqueira, T. (2009). Creative Places for Collaborative Cities (Doctoral dissertation, PhD thesis,
Politecnico di Milano).
From http://www.statisticbrain.com/wal-mart-company-statistics/
Gandomi, A., &Haider, M. (2015). Beyond the hype: Big data concepts, methods, and
analytics. International Journal of Information Management,35(2), 137-144.
Geerdink, B. (2013). A Reference Architecture for Big Data Solutions.
Genome.In Wikipedia. Retrieved Jnauary 26, 2016 , From https://en.wikipedia.org/wiki/Genome
Gilbert, S., & Paulin, D. (2015, January). Tweet to learn: Expertise and centrality in conference
Twitter networks. In System Sciences (HICSS), 2015 48th Hawaii International Conference
on (pp. 1920-1929).IEEE.
Girtelschmid, S., Steinbauer, M., Kumar, V., Fensel, A., & Kotsis, G. (2014). On the application of
Big Data in future large-scale intelligent Smart City installations. International Journal of
Pervasive Computing and Communications, 10(2), 168-182.
H. Davenport, T. (2014). How strategists use “big data” to support internal business decisions,
discovery and production. Strategy & Leadership, 42(4), 45-50.
Hahn, G. J., & Packowski, J. (2015). A perspective on applications of in-memory analytics in supply
chain management. Decision Support Systems, 76, 45-52.
Harford, T. (2014). Big data: A big mistake?. Significance, 11(5), 14-19.
Harris D. (2013).Why Apple, eBay, and Walmart have some of the biggest data warehouses you’ve
ever seen. Retrieved march 27, 2013, From https://gigaom.com/2013/03/27/why-apple-ebay-
and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen/
Hashem, I. A. T., Yaqoob, I., Anuar, N. B., Mokhtar, S., Gani, A., & Khan, S. U. (2015). The rise of
“big data” on cloud computing: Review and open research issues. Information Systems, 47, 98-
115.
Hazen, B. T., Boone, C. A., Ezell, J. D., & Jones-Farmer, L. A. (2014). Data quality for data science,
predictive analytics, and big data in supply chain management: An introduction to the problem
27 | P a g e
Role of Big Data and Predictive Analytics
28 | P a g e
Role of Big Data and Predictive Analytics
Lazer, David, Ryan Kennedy, Gary King, and Alessandro Vespignani. "The parable of Google Flu:
traps in big data analysis." Science 343, no. 14 March (2014).
Le, C. V., & Pang, C. K. (2013). An energy data-driven decision support system for high-performance
manufacturing industries. International Journal of Automation and Logistics, 1(1), 61-79.
Li, J., Tao, F., Cheng, Y., & Zhao, L. (2015). Big data in product lifecycle management. The
International Journal of Advanced Manufacturing Technology, 81(1-4), 667-684.
Liu, Z., Chen, Y., Bash, C., Wierman, A., Gmach, D., Wang, Z., ...&Hyser, C. (2012, June).
Renewable and cooling aware workload management for sustainable data centers. In ACM
SIGMETRICS Performance Evaluation Review (Vol. 40, No. 1, pp. 175-186).ACM.
Manning C. (1999).Foundations of Statistical Natural Language Processing. Retrieved June 18, 1999
From
http://www.amazon.com/gp/product/0262133601/ref=as_li_ss_tl?ie=UTF8&camp=1789&creati
ve=390957&creativeASIN=0262133601&linkCode=as2&tag=algoshopa-
20&l=as2&o=1&a=0262133601
Manzini, E., Jégou, F., &Penin, L. (2008).Creative Communities for Sustainable Lifestyles.Refereed
Sessions I-II Monday 10 March, 259.
Marr B. (2015).Big Data, Walmart And The Future Of Retail. Volume(). Retrieved February 19,
2015, From https://www.linkedin.com/pulse/big-data-walmart-future-retail-bernard-marr
Marshall, A., Mueck, S., & Shockley, R. (2015). How leading organizations use big data and analytics
to innovate. Strategy & Leadership, 43(5), 32-39.
Myers, S. A., Sharma, A., Gupta, P., & Lin, J. (2014, April). Information network or social network?:
The structure of the twitter follow graph. InProceedings of the companion publication of the
23rd international conference on World wide web companion (pp. 493-498). International World
Wide Web Conferences Steering Committee.
Nair, P. R. (2012). Supply Chain Analytics. CSI Communications, 33(9), 11.
Neuroscience.In Wikipedia. Retrieved Jnauary 26, 2016, From
https://en.wikipedia.org/wiki/Neuroscience
Opresnik, D., & Taisch, M. (2015). The value of Big Data in servitization.International Journal of
Production Economics, 165, 174-184.
P. Noordhuis, M. Heijkoop, A. Lazovik, Mining twitter in the cloud: A case study, Cloud Computing
(CLOUD), 2010, in: Proceedings of IEEE 3rd International Conference on, IEEE, Miami, FL,
2010, pp. 107–114.
Pang and Lee B. (2008).Opinion mining and sentiment analysis.Volume (2). Retrieved 2008, From
http://www.cs.cornell.edu/home/llee/opinion-mining-sentiment-analysis-survey.html
29 | P a g e
Role of Big Data and Predictive Analytics
Provost, F., & Fawcett, T. (2013).Data science and its relationship to big data and data-driven decision
making. Big Data, 1(1), 51-59.
Rai, S. S., Sharma, V., & Ganguly, K. (2016). Logistics complexity in Indian garment supply chain.
International Journal of Automation and Logistics.
Rogers, D. P. (2012). HR Metrics and Analytics. The Encyclopedia of Human Resource Management:
Short Entries, 252-257.
Russom, P. (2011). Big data analytics. TDWI Best Practices Report, Fourth Quarter.
Sahay, B. S., & Ranjan, J. (2008). Real time business intelligence in supply chain analytics.
Information Management & Computer Security, 16(1), 28-48.
Schoenherr, T., & Speier‐Pero, C. (2015). Data science, predictive analytics, and big data in supply
chain management: Current state and future potential. Journal of Business Logistics, 36(1), 120-
132.
Schwager, P. (2008). Chemical Leasing and cleaner production.Chemical Leasing Goes Global, 9-15.
Sentiment 140.For Academics. Retrieved January 26, 2016,http://help.sentiment140.com/for-students
Stephens, Z. D., Lee, S. Y., Faghri, F., Campbell, R. H., Zhai, C., Efron, M. J., ...& Robinson, G. E.
(2015). Big data: astronomical or genomical?. PLoSBiol, 13(7), e1002195.
Tan, K. H., Zhan, Y., Ji, G., Ye, F., & Chang, C. (2015). Harvesting big data to enhance supply chain
innovation capabilities: An analytic infrastructure based on deduction graph. International
Journal of Production Economics, 165, 223-233.
teradata.com/international/how-much-data-we-create-daily
The Four V's of Big Data . Retrieved January 25, 2016, From http://www.ibmbigdatahub.com
/infographic/four-vs-big-data
The Road to Big Data Visibility Doesn’t Run through Your 3PL Technology.2014. Retrieved August
20, 2014, From
http://www.supplychain247.com/article/the_road_to_big_data_visibility_doesnt_run_through_y
our_3pl_technology/gt_nexus
Tischa, A., &Kalteneggerb, I. (2008). Energy Saving Performance Contracting for Federally Owned
Public Buildings–Factors of Success. Refereed Sessions I-II Monday 10 March, 337.
Tiwari, S. (2015). Framework for adopting sustainability in the supply chain.International Journal of
Automation and Logistics, 1(3), 256-272.
Unlocking The Big Promise of Big Data – Anita McGahan (2013)
Van den Burg, S. W. K. (2008, February). Towards future sustainable consumption policies; a three-
tier approach. In Proceedings: Refereed Sessions I-II, Sustainable Consumption and Production:
30 | P a g e
Role of Big Data and Predictive Analytics
Framework for Action, 2nd Conference of the Sustainable Consumption Research Exchange
(SCORE (pp. 215-224).
VanRijmenam M. (2012).Walmart Is Making Big Data Part Of Its DNA. Retrieved March 15, 2012,
From https://datafloq.com/read/walmart-making-big-data-part-dna/509
Varian, H. R. (2014). Big data: New tricks for econometrics. The Journal of Economic Perspectives, 3-
27.
Venkatesh, V. G., Bhattacharya, S., Sethi, M., & Dua, S. (2015). Performance measurement of
sustainable third party reverse logistics provider by data envelopment analysis: a case study of an
Indian apparel manufacturing group. International Journal of Automation and Logistics, 1(3),
273-293.
Vera-Baquero, A., Colomo Palacios, R., Stantchev, V., & Molloy, O. (2015). Leveraging big-data for
business process analytics. The Learning Organization, 22(4), 215-228.
Vermeir, I., &Verbeke, W. (2008). Sustainable food consumption among young adults in Belgium:
Theory of planned behaviour and the role of confidence and values. Ecological Economics,
64(3), 542-553.
Vilajosana, I., Llosa, J., Martinez, B., Domingo-Prieto, M., Angles, A., &Vilajosana, X. (2013).
Bootstrapping smart cities through a self-sustainable model based on big data
flows. Communications Magazine, IEEE, 51(6), 128-134.
Waller, M. A., & Fawcett, S. E. (2013). Click Here for a Data Scientist: Big Data, Predictive
Analytics, and Theory Development in the Era of a Maker Movement Supply Chain. Journal of
Business Logistics, 34(4), 249-252.
Waller, M. A., & Fawcett, S. E. (2013). Data science, predictive analytics, and big data: a revolution
that will transform supply chain design and management. Journal of Business Logistics, 34(2),
77-84.
Wamba, S. F., Akter, S., Edwards, A., Chopin, G., & Gnanzou, D. (2015). How ‘big data’can make
big impact: Findings from a systematic review and a longitudinal case study. International
Journal of Production Economics, 165, 234-246.
Wang, L., Zhan, J., Luo, C., Zhu, Y., Yang, Q., He, Y., ...& Zheng, C. (2014, February).
Bigdatabench: A big data benchmark suite from internet services. In High Performance
Computer Architecture (HPCA), 2014 IEEE 20th International Symposium on (pp. 488-
499).IEEE.
Wang, R. Y. (1998). A product perspective on total data quality management.Communications of the
ACM, 41(2), 58-65.
31 | P a g e
Role of Big Data and Predictive Analytics
Weber, G. M., Mandl, K. D., &Kohane, I. S. (2014).Finding the missing link for big biomedical
data. Jama, 311(24), 2479-2480.
Wu, Xindong, Xingquan Zhu, Gong-Qing Wu, and Wei Ding. "Data mining with big
data." Knowledge and Data Engineering, IEEE Transactions on 26, no. 1 (2014): 97-107.
Yin, X. F., Fu, X., Ponnambalam, L., & Goh, R. S. M. (2016). A k-means clustering for supply chain
risk management with embedded network connectivity. International Journal of Automation and
Logistics, 2(1-2), 108-121.
Zhong, R. Y., Huang, G. Q., Lan, S., Dai, Q. Y., Chen, X., & Zhang, T. (2015). A big data approach
for logistics trajectory discovery from RFID-enabled production data. International Journal of
Production Economics, 165, 260-272.
32 | P a g e