SlideShare a Scribd company logo
1
Sujan Perera
Kno.e.sis Center, Wright State University
Big Data and Smart Healthcare
Wright State Honors Institute Symposium
Healthcare is Changing
• Introduction of new federal rules and incentive
programs
• Hospitals are forced to change the process (30-day
readmission, ICD10 adaptation, quality measures)
• Free and Open health information
• Rise of discussions/forums/social media
• 70-75% Americans online have used internet to find
health information1
• Rapid growth of health related devices
• Variety of cheap sensors for health status/activity
monitoring
• IBM Watson
• Adaptation of Watson technology to Healthcare
1 http://www.additiveanalytics.com/blog/infographic-healthcare-social-media
Challenges on the way
• Huge amount of data being generated
• Scientific knowledge, social forums, patient records
• Variety of data formats (text, images, videos)
• Find the signal from noise (actionable information)
• Expert can’t keep up with the new information
• Need expert knowledge to interpret data (esp.
combination of observations)
• Trustworthiness
• Especially on social forums
• Privacy
It is clear that we need mechanisms to automate some
parts of data processing and help humans in decision
making.
This talk will concentrate on how to improve the machine
understanding of unstructured data
Structured vs Unstructured Data
Patient Disorders ICD-9 Code
Patient1 Hypertension 401
Patient2 Atrial fibrillation 427.31
Patient1 Pulmonary hypertension 416
Patient3 Edema 782.3
Patient4 hyperthyroidism 242.9
Coronary artery disease, status post four-vessel coronary
artery bypass graft surgery on , by Dr. X with a left internal
mammary artery to the left anterior descending artery,
sequential vein graft to the ramus and first diagonal, and a vein
graft to the posterior descending artery. He had normal left
ventricular function. He is having some symptoms that are
unclear if they are angina or not. I am therefore going to get
him scheduled for an exercise Cardiolite stress test.
VS
• Structured data is incomplete and not accurate2,3
• 80% of patient data is unstructured1
• Stake holders interested in unstructured data
• Medical professionals
• Scientists
• Insurance Companies
• Policy makers
• Interesting Applications
• Search
• Prediction
• Applications like CAC and CDI
• Data and knowledge mining
• Decision Support
Unstructured Data
1 http://www.zdnet.com/within-two-years-80-percent-of-medical-data-will-be-unstructured-7000013707
2Strengths and Limitations of CMS Administrative Data in Research
3Comparison of clinical and administrative data sources for hospital coronary artery bypass graft surgery report cards
Patient Data Distribution
Structured data
Unstructured data
Lab results
HbA1C, BP,
ECG
• Key indicators for readmission prediction reside in unstructured
patient notes
• facilities
• “Holter monitor was ordered by Lisa. She failed to get this
because she did not have transportation”
• non-compliance
• “Atrial fibrillation with poorly controlled ventricular rate due
to noncompliance.”
• financial status
• “The patient mentioned that Bystolic is expensive and cannot
afford it now.”
How Important is Unstructured Data
• ICD10 adaptation – need to understand the relationships
E08 - Diabetes mellitus due to underlying condition
E08.0 - Diabetes mellitus due to underlying condition with hyperosmolarity
E08.00 - without nonketotic hyperglycemic-hyperosmolar coma (NKHHC)
E08.01 - with coma
E08.1 - Diabetes mellitus due to underlying condition with ketoacidosis
E08.10 - without coma
E08.11 – with coma
• The underlying condition can be congenital rubella, Cushing's syndrome, cystic fibrosis,
malignant neoplasm, malnutrition, pancreatitis
How Important is Unstructured Data
Search Mining
Decision Support
Knowledge Discovery Prediction
NLP
+
Semantics
The Solution
• Semantic Web
– Provides a common framework that allows data to
be shared and reused across application,
enterprise, and community boundaries
– Offers mechanisms to query data and reason over
them
• Natural Language Processing
– Enable computers to understand natural language
input
The Solution
An Example
He is off both Diovan and Lotrel. I am unsure if it is due to underlying renal insufficiency. He
has actually been on atenolol alone for his hypertension.
Raw Text
Concepts
Knowledge
Inference
diovan lotrel
renal
insufficiency
atenolol hypertension
diovanvaltuna
valsartan
antihypertensive
agent
atenolol
tenominatenix
kidney
failure
renal
insufficiency
kidney
disease
disorder
blood pressure
disorder
hypertension
systoloc
hypertension
pulmonary
hypertension
Patient taking diovan
for hypertension
Patient has
kidney disease
Patient is on
antihypertensive drugs
is used to treat
is a
drug
disorder
cTAKES
ezNLP
ezKB
<problem value="Asthma" cui="C0004096"/>
<med value="Losartan" code="52175:RXNORM" />
<med value="Spiriva" code="274535:RXNORM" />
<procedure value="EKG" cui="C1623258" />
ezFIND ezMeasure ezCDIezCAC
www.ezdi.us
ezHealth Platform
Health Outcome Prediction
Thank You
Visit us: www.knoesis.org

More Related Content

PDF
Big implications of Big Data in healthcare
Guires
 
PDF
Big data in healthcare
Xavier Rafael Palou
 
PDF
Big Data Analytics in the Health Domain
BigData_Europe
 
PPTX
The future of healthcare and big data
Charles Barnett
 
PDF
Big Data in Healthcare: Hype and Hope on the Path to Personalized Medicine
New York eHealth Collaborative
 
PDF
Data-Driven Healthcare for Manufacturers
Amit Mishra
 
PDF
Data driven Healthcare for Providers
Amit Mishra
 
PPTX
Big Data applications in Health Care
Leo Barella
 
Big implications of Big Data in healthcare
Guires
 
Big data in healthcare
Xavier Rafael Palou
 
Big Data Analytics in the Health Domain
BigData_Europe
 
The future of healthcare and big data
Charles Barnett
 
Big Data in Healthcare: Hype and Hope on the Path to Personalized Medicine
New York eHealth Collaborative
 
Data-Driven Healthcare for Manufacturers
Amit Mishra
 
Data driven Healthcare for Providers
Amit Mishra
 
Big Data applications in Health Care
Leo Barella
 

What's hot (20)

PPTX
Big data in healthcare: an OTC case
Thomas Euben
 
PDF
Big data analytics in healthcare industry
Bhagath Gopinath
 
PPTX
Big data in health care
yogita gaikwad
 
PDF
Big Data Solutions for Healthcare
Odinot Stanislas
 
PPTX
Big Data in Medicine
Nasir Arafat
 
PDF
Big Data Analytics in Health Care: A Review Paper
AIRCC Publishing Corporation
 
PPTX
Big Data in healthcare - opportunities and issues
Jaco van Duivenboden
 
PPTX
The Hive Think Tank: Unpacking AI for Healthcare
The Hive
 
PDF
AI in Healthcare
Paul Agapow
 
PPTX
Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...
Innovation Enterprise
 
PDF
Data Governance Talking Points: Simple Lessons From the Trenches
Health Catalyst
 
PPTX
Using Big Data to Personalize the Healthcare Experience in Cancer, Genomics a...
DrBonnie360
 
PPTX
Governance And Data Protection In The Health Sector - Billy Hawkes
healthcareisi
 
PDF
Big Data Analytics in Hospitals By Dr.Mahboob ali khan Phd
Healthcare consultant
 
PDF
Benefits of Big Data in Health Care A Revolution
ijtsrd
 
PPTX
Smart health prediction using data mining by customsoft
Custom Soft
 
PDF
Lumiata
YTH
 
PPTX
Csc8710 001 winter2014-mohammed_shahnawazali-ff2687_presentation_final
ff2687
 
PDF
IRJET- Integration of Big Data Analytics in Healthcare Systems
IRJET Journal
 
PDF
Unpacking AI for Healthcare
Lumiata
 
Big data in healthcare: an OTC case
Thomas Euben
 
Big data analytics in healthcare industry
Bhagath Gopinath
 
Big data in health care
yogita gaikwad
 
Big Data Solutions for Healthcare
Odinot Stanislas
 
Big Data in Medicine
Nasir Arafat
 
Big Data Analytics in Health Care: A Review Paper
AIRCC Publishing Corporation
 
Big Data in healthcare - opportunities and issues
Jaco van Duivenboden
 
The Hive Think Tank: Unpacking AI for Healthcare
The Hive
 
AI in Healthcare
Paul Agapow
 
Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...
Innovation Enterprise
 
Data Governance Talking Points: Simple Lessons From the Trenches
Health Catalyst
 
Using Big Data to Personalize the Healthcare Experience in Cancer, Genomics a...
DrBonnie360
 
Governance And Data Protection In The Health Sector - Billy Hawkes
healthcareisi
 
Big Data Analytics in Hospitals By Dr.Mahboob ali khan Phd
Healthcare consultant
 
Benefits of Big Data in Health Care A Revolution
ijtsrd
 
Smart health prediction using data mining by customsoft
Custom Soft
 
Lumiata
YTH
 
Csc8710 001 winter2014-mohammed_shahnawazali-ff2687_presentation_final
ff2687
 
IRJET- Integration of Big Data Analytics in Healthcare Systems
IRJET Journal
 
Unpacking AI for Healthcare
Lumiata
 
Ad

Similar to Big Data and Smart Healthcare (20)

PPT
Content analytics for healthcare
Xavier Constant
 
PDF
Testing the applicability of digital decision support on a nationwide EHR
ICDEcCnferenece
 
PPTX
Big Data in Clinical Research
Mike Hogarth, MD, FACMI, FACP
 
PPTX
Smart Data in Health – How we will exploit personal, clinical, and social “Bi...
Amit Sheth
 
PPT
Turning Big Data Insights into Action through Advanced Analytics
Craig Rhinehart Rhinehart
 
PPTX
Predictive Analytics and Machine Learning for Healthcare - Diabetes
Dr Purnendu Sekhar Das
 
PPT
Manoj Saxena, GM IBM Watson -- Keynote at Innotech 2011
Manoj Saxena
 
PDF
Real-time Analytics for the Healthcare Industry: Arrythmia Detection- Impetus...
Impetus Technologies
 
PPTX
HXR 2016: Data Insights: Mining, Modeling, and Visualizations- Niraj Katwala
HxRefactored
 
PPTX
Clinical Healthcare Data Analytics
dansouk
 
PPTX
Working With Large-Scale Clinical Datasets
Craig Smail
 
PDF
IRJET- Medical Database Mining for Heart Disease Precautions and Early Ca...
IRJET Journal
 
PPTX
Harnessing the potential of unstructured healthcare data
ezDI
 
PDF
InterSystems presentatie: Making sense of non-structured data in the medical ...
InterSystems Benelux
 
PDF
Clinical_Decision_Support_For_Heart_Disease
Sunil Kakade
 
PPTX
The Power and Promise of Unstructured Patient Data
Healthline
 
PPTX
Demystifying Text Analytics and NLP in Healthcare
Health Catalyst
 
PDF
ICIC 2014 SEMCARE - SEMantic Data Platform for HealthCARE
Dr. Haxel Consult
 
DOCX
Heart disease prediction system
SWAMI06
 
Content analytics for healthcare
Xavier Constant
 
Testing the applicability of digital decision support on a nationwide EHR
ICDEcCnferenece
 
Big Data in Clinical Research
Mike Hogarth, MD, FACMI, FACP
 
Smart Data in Health – How we will exploit personal, clinical, and social “Bi...
Amit Sheth
 
Turning Big Data Insights into Action through Advanced Analytics
Craig Rhinehart Rhinehart
 
Predictive Analytics and Machine Learning for Healthcare - Diabetes
Dr Purnendu Sekhar Das
 
Manoj Saxena, GM IBM Watson -- Keynote at Innotech 2011
Manoj Saxena
 
Real-time Analytics for the Healthcare Industry: Arrythmia Detection- Impetus...
Impetus Technologies
 
HXR 2016: Data Insights: Mining, Modeling, and Visualizations- Niraj Katwala
HxRefactored
 
Clinical Healthcare Data Analytics
dansouk
 
Working With Large-Scale Clinical Datasets
Craig Smail
 
IRJET- Medical Database Mining for Heart Disease Precautions and Early Ca...
IRJET Journal
 
Harnessing the potential of unstructured healthcare data
ezDI
 
InterSystems presentatie: Making sense of non-structured data in the medical ...
InterSystems Benelux
 
Clinical_Decision_Support_For_Heart_Disease
Sunil Kakade
 
The Power and Promise of Unstructured Patient Data
Healthline
 
Demystifying Text Analytics and NLP in Healthcare
Health Catalyst
 
ICIC 2014 SEMCARE - SEMantic Data Platform for HealthCARE
Dr. Haxel Consult
 
Heart disease prediction system
SWAMI06
 
Ad

More from Sujan Perera (6)

PPTX
Linking Implicit entities - DBpedia Meetup
Sujan Perera
 
PPTX
Knowledge-driven Implicit Information Extraction
Sujan Perera
 
PPTX
Implicit Entity Recognition in Clinical Documents
Sujan Perera
 
PPTX
Challenges in understanding clinical notes: Why NLP engines fall short and wh...
Sujan Perera
 
PPTX
Implicit Entity Linking in Tweets
Sujan Perera
 
PPT
Semantic Web Class Project
Sujan Perera
 
Linking Implicit entities - DBpedia Meetup
Sujan Perera
 
Knowledge-driven Implicit Information Extraction
Sujan Perera
 
Implicit Entity Recognition in Clinical Documents
Sujan Perera
 
Challenges in understanding clinical notes: Why NLP engines fall short and wh...
Sujan Perera
 
Implicit Entity Linking in Tweets
Sujan Perera
 
Semantic Web Class Project
Sujan Perera
 

Recently uploaded (20)

PPTX
22PCOAM21 Session 2 Understanding Data Source.pptx
Guru Nanak Technical Institutions
 
PDF
Zero carbon Building Design Guidelines V4
BassemOsman1
 
PPTX
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
PDF
The Effect of Artifact Removal from EEG Signals on the Detection of Epileptic...
Partho Prosad
 
PDF
settlement FOR FOUNDATION ENGINEERS.pdf
Endalkazene
 
PPTX
MSME 4.0 Template idea hackathon pdf to understand
alaudeenaarish
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
FLEX-LNG-Company-Presentation-Nov-2017.pdf
jbloggzs
 
PDF
Introduction to Ship Engine Room Systems.pdf
Mahmoud Moghtaderi
 
PPTX
Online Cab Booking and Management System.pptx
diptipaneri80
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PPTX
IoT_Smart_Agriculture_Presentations.pptx
poojakumari696707
 
PPTX
Information Retrieval and Extraction - Module 7
premSankar19
 
PDF
AI-Driven IoT-Enabled UAV Inspection Framework for Predictive Maintenance and...
ijcncjournal019
 
PPTX
quantum computing transition from classical mechanics.pptx
gvlbcy
 
DOCX
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
PDF
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
PPTX
sunil mishra pptmmmmmmmmmmmmmmmmmmmmmmmmm
singhamit111
 
PDF
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
PDF
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
22PCOAM21 Session 2 Understanding Data Source.pptx
Guru Nanak Technical Institutions
 
Zero carbon Building Design Guidelines V4
BassemOsman1
 
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
The Effect of Artifact Removal from EEG Signals on the Detection of Epileptic...
Partho Prosad
 
settlement FOR FOUNDATION ENGINEERS.pdf
Endalkazene
 
MSME 4.0 Template idea hackathon pdf to understand
alaudeenaarish
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
FLEX-LNG-Company-Presentation-Nov-2017.pdf
jbloggzs
 
Introduction to Ship Engine Room Systems.pdf
Mahmoud Moghtaderi
 
Online Cab Booking and Management System.pptx
diptipaneri80
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
IoT_Smart_Agriculture_Presentations.pptx
poojakumari696707
 
Information Retrieval and Extraction - Module 7
premSankar19
 
AI-Driven IoT-Enabled UAV Inspection Framework for Predictive Maintenance and...
ijcncjournal019
 
quantum computing transition from classical mechanics.pptx
gvlbcy
 
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
sunil mishra pptmmmmmmmmmmmmmmmmmmmmmmmmm
singhamit111
 
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 

Big Data and Smart Healthcare

  • 1. 1 Sujan Perera Kno.e.sis Center, Wright State University Big Data and Smart Healthcare Wright State Honors Institute Symposium
  • 2. Healthcare is Changing • Introduction of new federal rules and incentive programs • Hospitals are forced to change the process (30-day readmission, ICD10 adaptation, quality measures) • Free and Open health information • Rise of discussions/forums/social media • 70-75% Americans online have used internet to find health information1 • Rapid growth of health related devices • Variety of cheap sensors for health status/activity monitoring • IBM Watson • Adaptation of Watson technology to Healthcare 1 http://www.additiveanalytics.com/blog/infographic-healthcare-social-media
  • 3. Challenges on the way • Huge amount of data being generated • Scientific knowledge, social forums, patient records • Variety of data formats (text, images, videos) • Find the signal from noise (actionable information) • Expert can’t keep up with the new information • Need expert knowledge to interpret data (esp. combination of observations) • Trustworthiness • Especially on social forums • Privacy
  • 4. It is clear that we need mechanisms to automate some parts of data processing and help humans in decision making. This talk will concentrate on how to improve the machine understanding of unstructured data
  • 5. Structured vs Unstructured Data Patient Disorders ICD-9 Code Patient1 Hypertension 401 Patient2 Atrial fibrillation 427.31 Patient1 Pulmonary hypertension 416 Patient3 Edema 782.3 Patient4 hyperthyroidism 242.9 Coronary artery disease, status post four-vessel coronary artery bypass graft surgery on , by Dr. X with a left internal mammary artery to the left anterior descending artery, sequential vein graft to the ramus and first diagonal, and a vein graft to the posterior descending artery. He had normal left ventricular function. He is having some symptoms that are unclear if they are angina or not. I am therefore going to get him scheduled for an exercise Cardiolite stress test. VS
  • 6. • Structured data is incomplete and not accurate2,3 • 80% of patient data is unstructured1 • Stake holders interested in unstructured data • Medical professionals • Scientists • Insurance Companies • Policy makers • Interesting Applications • Search • Prediction • Applications like CAC and CDI • Data and knowledge mining • Decision Support Unstructured Data 1 http://www.zdnet.com/within-two-years-80-percent-of-medical-data-will-be-unstructured-7000013707 2Strengths and Limitations of CMS Administrative Data in Research 3Comparison of clinical and administrative data sources for hospital coronary artery bypass graft surgery report cards
  • 7. Patient Data Distribution Structured data Unstructured data Lab results HbA1C, BP, ECG
  • 8. • Key indicators for readmission prediction reside in unstructured patient notes • facilities • “Holter monitor was ordered by Lisa. She failed to get this because she did not have transportation” • non-compliance • “Atrial fibrillation with poorly controlled ventricular rate due to noncompliance.” • financial status • “The patient mentioned that Bystolic is expensive and cannot afford it now.” How Important is Unstructured Data
  • 9. • ICD10 adaptation – need to understand the relationships E08 - Diabetes mellitus due to underlying condition E08.0 - Diabetes mellitus due to underlying condition with hyperosmolarity E08.00 - without nonketotic hyperglycemic-hyperosmolar coma (NKHHC) E08.01 - with coma E08.1 - Diabetes mellitus due to underlying condition with ketoacidosis E08.10 - without coma E08.11 – with coma • The underlying condition can be congenital rubella, Cushing's syndrome, cystic fibrosis, malignant neoplasm, malnutrition, pancreatitis How Important is Unstructured Data
  • 10. Search Mining Decision Support Knowledge Discovery Prediction NLP + Semantics The Solution
  • 11. • Semantic Web – Provides a common framework that allows data to be shared and reused across application, enterprise, and community boundaries – Offers mechanisms to query data and reason over them • Natural Language Processing – Enable computers to understand natural language input The Solution
  • 12. An Example He is off both Diovan and Lotrel. I am unsure if it is due to underlying renal insufficiency. He has actually been on atenolol alone for his hypertension. Raw Text Concepts Knowledge Inference diovan lotrel renal insufficiency atenolol hypertension diovanvaltuna valsartan antihypertensive agent atenolol tenominatenix kidney failure renal insufficiency kidney disease disorder blood pressure disorder hypertension systoloc hypertension pulmonary hypertension Patient taking diovan for hypertension Patient has kidney disease Patient is on antihypertensive drugs is used to treat is a drug disorder
  • 13. cTAKES ezNLP ezKB <problem value="Asthma" cui="C0004096"/> <med value="Losartan" code="52175:RXNORM" /> <med value="Spiriva" code="274535:RXNORM" /> <procedure value="EKG" cui="C1623258" /> ezFIND ezMeasure ezCDIezCAC www.ezdi.us ezHealth Platform
  • 15. Thank You Visit us: www.knoesis.org

Editor's Notes

  • #3: Healthcare is changing by socially, economically, by law and technologically