(25439251 - Data and Information Management) Big Data in Health Care - Applications and Challenges
(25439251 - Data and Information Management) Big Data in Health Care - Applications and Challenges
Liang Hong, Mengqi Luo, Ruixue Wang, Peixin Lu, Wei Lu*, Long Lu*
1 Introduction
Big Data, the generic term for data sets of structured
and unstructured data that are extremely large and
complex so that the traditional software, algorithm, and
data repositoriesareinadequatetocollect, process,
analyze, and store them (Asante-Korang & Jacobs,
2016; Kyoungyoung Jee & Gang Hoon Kim, 2013;
Khoury & Ioannidis, 2014; Tan, Gao, & Koch, 2015),
has become an intensively studied area in recent years.
With the development of
Open Access. © 2018 Liang Hong et al., published by Sciendo. This work is licensed under the Creative Commons Attribution-
NonCommercial-NoDerivatives 3.0 License.
17 Liang Hong et
al.
6
Personal health Allergies and adverse drug Cloud computing, Health Insurance
record (PHR) reactions, chronic diseases, Portability and Accountability
As its name family history, illnesses and Act(HIPAA) , and HL7 (Chen et al.,
suggests, it is the hospitalizations, imaging 2012); stored in paper like printed
health-related data reports, laboratory test results, laboratory reports, copies of clinic
and information of medications and dosing, notes, and health histories created
patients (Tang, Ash, prescription record, surgeries and by the individual; electronic devices
Bates, Overhage, other procedures, vaccinations and such as personal computer-based
& Sands, 2006) observations of daily living, and software, CD, DVD, and smart card;
and about people’s reported by patients (Rumsfeld, web applications such as HealthVault
lifelong health Joynt, & Maddox, 2016) and PatientsLikeMe; and cloud
information. It is servers (Chen et al., 2012)
available for further
use (Chen et al.,
2012)
Molecular biology
Interaction and Molecular cloning, polymerase NCBI
experiment
regulation of chain reaction (PCR),
biological activity macromolecule blotting and
within cells, such probing, microarrays, and next-
as interactions generation sequencing
between DNA,
RNA, proteins, and
biosynthesis
Human body
Data and samples Cells, tissues, and organs Mayo Clinic Biobanks (http://
samples
of cells, tissues, and specimencentral.com/biobank-
organs in human directory/)
body (Bagayoko,
Dufour, Chaacho,
Big Data in medical experiment
Bouhaddou, &
Fieschi, 2010)
2012, which is expected to reach 25,000 petabytes by 2020 such as smartphones with third-party applications
(Feldman, Martin, & Skotnes, 2012). (HealthKit from Apple, Google Fit from Google, and S
PHR comes from a variety of patient health and Health form Samsung), Android watches, and Google
social information; the main role of it is as a data source Glasses have been developed with sensors in the health
for medical analysis and clinical decision support care area (Safavi & Shukur, 2014). Since people have
(Poulymenopoulou et al., 2015) . It includes data of become more concerned with their own health on a day-
allergies and adverse drug reactions (ADRs), chronic to- day basis, ODLs have come to play a key role in
diseases, family history, illnesses and hospitalizations, recording personal daily health and behavior, signs, and
imaging reports, laboratory test results, medications symptoms of patients (Backonja et al., 2012).
and dosing, prescription records, surgeries and other Additionally, data of sports and diet of people also
procedures, vaccinations, and observations of daily contribute significantly to Big Data in public health
living (ODLs). Unlike other document or text data, and behavior. In the Apple iTunes store alone, there are
medical imaging mainly comes from X-ray, CT, more than 40,000 health care apps available (Aitken &
histology, PET, radiography, magnetic resonance Gauntlett, 2013). In 2017, it is predicted that more than
imaging (MRI), nuclear medicine, ultrasound, 1.7 billion people will have downloaded health care
elastography, tactile imaging, photoacoustic imaging, apps.
echocardiography, and so on. It contains visual In terms of infectious diseases in public health,
elements, and this means that data are usually very large there is a well-known case in which Google
(Kovalev & Kalinovsky, 2015). successfully predicted the time and scale of an influenza
by analyzing the search engine results.
System Description
HIS Hospital information system; the system provides quality community for historical data
resource, information, and knowledge in healthcare for hospital administration and patient
health care (Bagayoko et al., 2010; Kanagaraj & Sumathi, 2011; Sirintrapun & Artz, 2016;
Tsumoto, Hirano, & Iwata, 2013)
LIS Laboratory information system; often used to collect, restore, archive, process, extract,
and analyze data in laboratory; this system aims to improve efficiency of turn-around-times
(TAT) of records, quality of resource utilization, and public health supporting (Blaya et al.,
2007; Sepulveda & Young, 2013)
RIS Radiology information system; it is used to capture and store data including images,
demographic and clinical information, and so on, also assisting in patient
registration, report repository, and physician directory with advanced technology
(Nance, Meenan, & Nagy, 2013)
PACS (super sound PACS, endoscope PACS) Picture archiving and communication systems; it is a common HIS for storage and
transferring of digital images (Joshi & Yesha, 2012)
EMR EMR system is used to maintain medical records and store, process, and retrieve
information. It also ensures accuracy of information. Its aim is to ensure accuracy of
information in order to provide patient control and transparency, interdepartmental
communication, and great reporting capabilities for treatment (Kumar & Aldrich,
2010)
Cost accounting System for collecting, recording, classifying, analyzing, summarizing, allocating,
and evaluating financial cost in the medical area
terms of handling HL7 format data, the open archive Information System (HIS) development. According to
information system model was applied (Celesti, Fazio, Bagayoko & Dufour (2010), web infrastructure, server
Romano, & Villari, 2016). HIS presents the ability to operation systems, developer tools, and databases are
capture, store, and process health care data and often commonly used in Europe and North America.
requires a large number of techniques to assist it. In
other words, one of the major research challenges is
how to integrate advanced techniques of information
processing into HIS (Roberts, 1985). Cloud computing,
3 Unique Features of Big Data
a technique for data storage and sharing, is widely used in Health Care
in information system. The use of cloud computing in
HIS is well known and very common for data In addition to the “5V” features of Big Data, Big Data
processing, data backup, and information sharing in health care has its own unique features, such as
between different organizations, such as cloud-based heterogeneity, incompleteness, timeliness and longevity,
PACS and cloud-based EHR systems (He et al., 2010; data privacy, and ownership.
Joshi & Yesha, 2012; Kanagaraj & Sumathi, 2011). Cloud
security requires in many aspects, including data
security, application security, system security, network 3.1 Heterogeneity
security, and physical security, a high-quality of
security management platform. Additionally, novel Big Data in health care often has incompatible formats,
techniques have been proposed to improve the quality of which can be classified into structured and unstructured
HIS. For example, in order to achieve data-level data. For example, some EHR collect data in structured
interoperability, an adaptive AdapteR Interoperability formats and International Classification of Diseases 10th
ENgine (ARIEN) mediation system was proposed revision (ICD-10) are structured (Asante-Korang &
(Khan et al., 2014) for HIS with different health care Jacobs, 2016). However, the majority of Big Data in
standards. Open- source software is also available for health care is
supporting Hospital
182 Liang Hong et
al.
Table 3
An Example of Data Privacy Breach
Name Sex Zip code Date of birth Address Sex Zip code Date of birth Disease