SlideShare a Scribd company logo
1
Large Scale Data Management,
Computation, and Analysis for
Quantitative Imaging Research
Ashish Sharma, Ph.D.
Department of Biomedical Informatics,
Emory University
Fred Prior, Ph.D.
Department of Biomedical Informatics,
Univ. of Arkansas for Medical Sciences
2
High Quality Data
The Role of Data Curation
Data Dump or
Information Resource
3
Challenges to Advances
in Quantitative Imaging
“Data! Data! Data!” he cried impatiently. “I
can’t make bricks without clay.”
— Sherlock Holmes
• Large, well-curated, repositories
• High Quality Data
• Radiology, Pathology, RT
• Clinical, Genomic
• Derived Feature Sets
• Security and Access Control
• Flexible Access
• Co-Located Computation
• Reproducible pipelines
• Systematic, Reproducible tools for
imaging phenotype/biomarker
exploration
4
Resources Provided by the UAMS + Emory QIN
Features
5
Built on The Cancer Imaging Archive
TCIA encourages and supports cancer-
related open science communities by
hosting and managing the data archive,,
and relevant resources to facilitate
collaborative research.
6
RT Collections and
Curation Process Partial collections:
• RTOG 0522
• NRG-1106 (private)
• NRG-1308 (Private)
TCGA-HNSC
Additional collections in
process or planned
7
Automated RT,
Image Set Curation
Individual  Bulk Correction
Reduce the time needed to
correct data inconsistencies
Curation Time:
Days  few minutes
(15 Collections  120K images)
Extend POSDA to all curation
related checks and activities
8
Digital Pathology
caMicroscope
• Visualization of Pathology
Images and Features
• Data Management
• Native Support for Scanner
Format (Scanner  caMic)
• Dynamic Services (launch
analysis from browser)
• Scientific Mashups
(DataScope and DataCafe)
Supported by NCIP/CTIIP Project; ITCR U24
9
Pragmatic Randomized Trial of Proton versus Photon
Therapy for Locally Advanced Breast Cancer
• Treatment plans for Photon (left) and
Proton (right) Adjuvant therapy for Lung
cancer
• Dose Value Histogram Demonstrates that
with IMRT, 50% of the heart receives ~ 36
Gy, while proton delivers < 3Gy.
Dose Value Histogram
Intensity Modulated
Radiation Therapy
Proton Therapy
10
Data Access
REST APIs to access TCIA
Data
APIs to access protected
data (coming soon)
Mashups from fusion of
diverse datasets to support
• Data exploration
• Hypothesis formulation
• Discovery of latent
relationships and patterns
11
Reproducible
Science
Incentives for data sharing
TCIA assigns DOIs for individual
collections as well as data subsets
that were used in publications
One click access to data via DOI
Use this DOI to get a Nature
Scientific Data publication
Will be developing mechanisms to
track usage
12
Co-Located Data Analysis
13
Radiomics Portal
(Galaxy + Lessons from
Connectome Workbench)
Fast access to co-located
data
Imaging, RT, Radiomic
Features (TCIA)
Clinical and other
structured
(i2b2/Eureka!)
Lung Segmentation Pilot
Tool Dissemination
14
Lung Analysis
Pipeline
1. Segment lung field from CT
2. Use three parallel
segmentation algorithms to
identify all closed structures
3. Voting algorithm produces final
segmented regions
4. Feature Extraction (shape,
texture, margin,...)
5. Feature Classification
(optional)
6. Image-based phenotype
Top, Green  Expert Manual Segmentation
Bottom, Blue  Automated Segmentation
The results of three independent automatic
segmentation algorithms are combined using a
voting mechanism to arrive at final automatic
segmentation.
15
Biomarker Exploration Pipeline
• Data Loading: Radiomics data is
integrated into Eureka! Custom sources
can be incorporated via REST APIs
• Construct Radiomic Phenotypes: Using
the Eureka GUI, users can explore their
radiomics data and construct radiomic
phenotypes, that incorporate clinical,
imaging and computed features. These
phenotypes are captured as graphs and
stored in a graph database (Neo4J)
i2b2
TCIA
MongoDB &
Others…
Radiomic
Phenotypes
16
Eureka! Clinical Analytics
• Open source (Apache 2 license) RESTful clinical phenotyping system
Clinical phenotyping: computing temporal patterns from EHR data
reflecting disease, therapeutic response and prognosis
• Supports loading data and computed phenotypes from data warehouses
and flat files into i2b2, Neo4j and flat files
• One REST service + web interface controlling many ETL processes setup
with configurable metadata stored in an extended i2b2 metadata schema
17
Use Cases
• Construct phenotypes by
fusing clinical and derived
feature data
• Compute phenotypes on
available data
• Ongoing pilot with NLST
data
• LungRADS
• Study changes to LungRADS
screening recommendations by
incorporating demographics
18
Summary
• TCIA is a stable yet continually evolving and expanding
information resource for QIN and other Cancer Imaging
initiatives
• Radiomic & Radiogenomic analyses require information fusion
and HPC pipelines to deal with large subject populations and
advanced computationally intensive algorithms
• We are exploring the potential of OODA graph statistics for
multi-parametric biomarker discovery and validation.
• NCI QIN: Resources for development
and validation of Radiomic Analyses &
Adaptive Therapy — PI: Prior, Sharma
(UAMS, Emory)
• NCIP/Leidos caMicroscope — A Digital
Pathology Integrative Query System;
PI: Sharma
• PCORI, Pragmatic Randomized Trial of
Proton versus Photon Therapy for
Locally Advanced Breast Cancer to
compare the effectiveness of photon vs.
proton Radiotherapy. (PI: Bekelman)
• Leidos Biomedical Research, Contract
16X011 for NCI, Maintenance and
Extension of The Cancer Imaging
Archive (TCIA), (PI: Prior)
Ad

More Related Content

What's hot (20)

High Dimensional Fused-Informatics
High Dimensional Fused-InformaticsHigh Dimensional Fused-Informatics
High Dimensional Fused-Informatics
Joel Saltz
 
Provenance abstraction for implementing security: Learning Health System and ...
Provenance abstraction for implementing security: Learning Health System and ...Provenance abstraction for implementing security: Learning Health System and ...
Provenance abstraction for implementing security: Learning Health System and ...
Vasa Curcin
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
Pistoia Alliance
 
openEHR: UK 100,000 Genomes project
openEHR: UK 100,000 Genomes projectopenEHR: UK 100,000 Genomes project
openEHR: UK 100,000 Genomes project
openEHR Foundation
 
The Future of Personalized Medicine
The Future of Personalized MedicineThe Future of Personalized Medicine
The Future of Personalized Medicine
Edgewater
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance
 
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron PantanowitzComputer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Cirdan
 
Machine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataMachine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of Data
Joel Saltz
 
Data Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer DiagnosisData Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer Diagnosis
DataminingTools Inc
 
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
European School of Oncology
 
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
Kees van Bochove
 
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
D3 Consutling
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: EMIF (European Medical In...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: EMIF (European Medical In...tranSMART Community Meeting 5-7 Nov 13 - Session 5: EMIF (European Medical In...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: EMIF (European Medical In...
David Peyruc
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Kees van Bochove
 
openEHR v COVID-19
openEHR v COVID-19openEHR v COVID-19
openEHR v COVID-19
openEHR-Japan
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
cscpconf
 
How Technologies can help us comply
How Technologies can help us complyHow Technologies can help us comply
How Technologies can help us comply
3GDR
 
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
IRJET Journal
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19
openEHR-Japan
 
Research Data Management for Clinical Trials and Quality Improvement
Research Data Management for Clinical Trials and Quality ImprovementResearch Data Management for Clinical Trials and Quality Improvement
Research Data Management for Clinical Trials and Quality Improvement
Margaret Henderson
 
High Dimensional Fused-Informatics
High Dimensional Fused-InformaticsHigh Dimensional Fused-Informatics
High Dimensional Fused-Informatics
Joel Saltz
 
Provenance abstraction for implementing security: Learning Health System and ...
Provenance abstraction for implementing security: Learning Health System and ...Provenance abstraction for implementing security: Learning Health System and ...
Provenance abstraction for implementing security: Learning Health System and ...
Vasa Curcin
 
openEHR: UK 100,000 Genomes project
openEHR: UK 100,000 Genomes projectopenEHR: UK 100,000 Genomes project
openEHR: UK 100,000 Genomes project
openEHR Foundation
 
The Future of Personalized Medicine
The Future of Personalized MedicineThe Future of Personalized Medicine
The Future of Personalized Medicine
Edgewater
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance
 
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron PantanowitzComputer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Cirdan
 
Machine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataMachine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of Data
Joel Saltz
 
Data Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer DiagnosisData Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer Diagnosis
DataminingTools Inc
 
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
European School of Oncology
 
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
Kees van Bochove
 
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
D3 Consutling
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: EMIF (European Medical In...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: EMIF (European Medical In...tranSMART Community Meeting 5-7 Nov 13 - Session 5: EMIF (European Medical In...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: EMIF (European Medical In...
David Peyruc
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Kees van Bochove
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
cscpconf
 
How Technologies can help us comply
How Technologies can help us complyHow Technologies can help us comply
How Technologies can help us comply
3GDR
 
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
IRJET Journal
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19
openEHR-Japan
 
Research Data Management for Clinical Trials and Quality Improvement
Research Data Management for Clinical Trials and Quality ImprovementResearch Data Management for Clinical Trials and Quality Improvement
Research Data Management for Clinical Trials and Quality Improvement
Margaret Henderson
 

Viewers also liked (19)

Radiomics: Novel Paradigm of Deep Learning for Clinical Decision Support towa...
Radiomics: Novel Paradigm of Deep Learning for Clinical Decision Support towa...Radiomics: Novel Paradigm of Deep Learning for Clinical Decision Support towa...
Radiomics: Novel Paradigm of Deep Learning for Clinical Decision Support towa...
Wookjin Choi
 
E-book Thesis Sara Carvalho
E-book Thesis  Sara CarvalhoE-book Thesis  Sara Carvalho
E-book Thesis Sara Carvalho
Sara Carvalho
 
Kom vive la fête
Kom vive la fêteKom vive la fête
Kom vive la fête
rcmuziek7
 
University of Toronto - Radiomics for Oncology - 2017
University of Toronto  - Radiomics for Oncology - 2017University of Toronto  - Radiomics for Oncology - 2017
University of Toronto - Radiomics for Oncology - 2017
Andre Dekker
 
Intel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analyticsIntel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analytics
Carestream
 
Impacto assistencial dos laudos estruturados em radiologia: revisão integrativa
Impacto assistencial dos laudos estruturados em radiologia: revisão integrativaImpacto assistencial dos laudos estruturados em radiologia: revisão integrativa
Impacto assistencial dos laudos estruturados em radiologia: revisão integrativa
Thiago Julio, MD
 
useR2011 - Whitcher
useR2011 - WhitcheruseR2011 - Whitcher
useR2011 - Whitcher
rusersla
 
Radiology nurse kpi
Radiology nurse kpiRadiology nurse kpi
Radiology nurse kpi
asfoskenfur
 
The Case for Imaging Analytics
The Case for Imaging AnalyticsThe Case for Imaging Analytics
The Case for Imaging Analytics
Joachim Surich
 
The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
 The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A... The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
Carestream
 
Biomedical Imaging Informatics
Biomedical Imaging InformaticsBiomedical Imaging Informatics
Biomedical Imaging Informatics
Dr. Rupak Chakravarty
 
Radioterapia
RadioterapiaRadioterapia
Radioterapia
mar-lib E
 
Conquest for dummies: how to setup a pacs server
Conquest for dummies: how to setup a pacs serverConquest for dummies: how to setup a pacs server
Conquest for dummies: how to setup a pacs server
Bogazici University
 
Radioterapia 4D
Radioterapia 4DRadioterapia 4D
Radioterapia 4D
Gemelli Advanced Radiation Therapy
 
BALKAN MCO 2011 - E. Vrdoljak - Radiotherapy
BALKAN MCO 2011 - E. Vrdoljak - RadiotherapyBALKAN MCO 2011 - E. Vrdoljak - Radiotherapy
BALKAN MCO 2011 - E. Vrdoljak - Radiotherapy
European School of Oncology
 
Inteligência de Negócios em Radiologia / Business Intelligence in Radiology
Inteligência de Negócios em Radiologia / Business Intelligence in Radiology Inteligência de Negócios em Radiologia / Business Intelligence in Radiology
Inteligência de Negócios em Radiologia / Business Intelligence in Radiology
Thiago Julio, MD
 
Respiratory Gating with IMRT
Respiratory Gating with IMRTRespiratory Gating with IMRT
Respiratory Gating with IMRT
kathrnrt
 
DICOM Structure Basics
DICOM Structure BasicsDICOM Structure Basics
DICOM Structure Basics
Gunjan Patel
 
4D Radiotherapy
4D Radiotherapy4D Radiotherapy
4D Radiotherapy
fondas vakalis
 
Radiomics: Novel Paradigm of Deep Learning for Clinical Decision Support towa...
Radiomics: Novel Paradigm of Deep Learning for Clinical Decision Support towa...Radiomics: Novel Paradigm of Deep Learning for Clinical Decision Support towa...
Radiomics: Novel Paradigm of Deep Learning for Clinical Decision Support towa...
Wookjin Choi
 
E-book Thesis Sara Carvalho
E-book Thesis  Sara CarvalhoE-book Thesis  Sara Carvalho
E-book Thesis Sara Carvalho
Sara Carvalho
 
Kom vive la fête
Kom vive la fêteKom vive la fête
Kom vive la fête
rcmuziek7
 
University of Toronto - Radiomics for Oncology - 2017
University of Toronto  - Radiomics for Oncology - 2017University of Toronto  - Radiomics for Oncology - 2017
University of Toronto - Radiomics for Oncology - 2017
Andre Dekker
 
Intel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analyticsIntel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analytics
Carestream
 
Impacto assistencial dos laudos estruturados em radiologia: revisão integrativa
Impacto assistencial dos laudos estruturados em radiologia: revisão integrativaImpacto assistencial dos laudos estruturados em radiologia: revisão integrativa
Impacto assistencial dos laudos estruturados em radiologia: revisão integrativa
Thiago Julio, MD
 
useR2011 - Whitcher
useR2011 - WhitcheruseR2011 - Whitcher
useR2011 - Whitcher
rusersla
 
Radiology nurse kpi
Radiology nurse kpiRadiology nurse kpi
Radiology nurse kpi
asfoskenfur
 
The Case for Imaging Analytics
The Case for Imaging AnalyticsThe Case for Imaging Analytics
The Case for Imaging Analytics
Joachim Surich
 
The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
 The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A... The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
The Complementary Roles of Computer-Aided Diagnosis and Quantitative Image A...
Carestream
 
Radioterapia
RadioterapiaRadioterapia
Radioterapia
mar-lib E
 
Conquest for dummies: how to setup a pacs server
Conquest for dummies: how to setup a pacs serverConquest for dummies: how to setup a pacs server
Conquest for dummies: how to setup a pacs server
Bogazici University
 
Inteligência de Negócios em Radiologia / Business Intelligence in Radiology
Inteligência de Negócios em Radiologia / Business Intelligence in Radiology Inteligência de Negócios em Radiologia / Business Intelligence in Radiology
Inteligência de Negócios em Radiologia / Business Intelligence in Radiology
Thiago Julio, MD
 
Respiratory Gating with IMRT
Respiratory Gating with IMRTRespiratory Gating with IMRT
Respiratory Gating with IMRT
kathrnrt
 
DICOM Structure Basics
DICOM Structure BasicsDICOM Structure Basics
DICOM Structure Basics
Gunjan Patel
 
Ad

Similar to Radiomics Data Management, Computation, and Analysis for QIN F2F 2016 (20)

Public Databases for Radiomics Research: Current Status and Future Directions
Public Databases for Radiomics Research: Current Status and Future DirectionsPublic Databases for Radiomics Research: Current Status and Future Directions
Public Databases for Radiomics Research: Current Status and Future Directions
CancerImagingInforma
 
Crowds Cure Canver: Annotating Data from The Cancer Imaging Archive
Crowds Cure Canver: Annotating Data from The Cancer Imaging ArchiveCrowds Cure Canver: Annotating Data from The Cancer Imaging Archive
Crowds Cure Canver: Annotating Data from The Cancer Imaging Archive
CancerImagingInforma
 
LABORATORY INFORMATION SYSTEM RADIOLOGY INFORMATION SYSTEM
LABORATORY INFORMATION SYSTEM RADIOLOGY INFORMATION SYSTEMLABORATORY INFORMATION SYSTEM RADIOLOGY INFORMATION SYSTEM
LABORATORY INFORMATION SYSTEM RADIOLOGY INFORMATION SYSTEM
Aj Raj
 
2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)
Michael Atkins
 
Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...
lucenerevolution
 
PRISM Project Update
PRISM Project UpdatePRISM Project Update
PRISM Project Update
imgcommcall
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
Michel Dumontier
 
Cancer Analytics Poster
Cancer Analytics PosterCancer Analytics Poster
Cancer Analytics Poster
Michael Atkins
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical Informatics
Chimezie Ogbuji
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
David Peyruc
 
Building an informatics solution to sustain AI-guided cell profiling with hig...
Building an informatics solution to sustain AI-guided cell profiling with hig...Building an informatics solution to sustain AI-guided cell profiling with hig...
Building an informatics solution to sustain AI-guided cell profiling with hig...
Ola Spjuth
 
Towards Automated AI-guided Drug Discovery Labs
Towards Automated AI-guided Drug Discovery LabsTowards Automated AI-guided Drug Discovery Labs
Towards Automated AI-guided Drug Discovery Labs
Ola Spjuth
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 
The Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham TaylorThe Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham Taylor
Human Variome Project
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014
Joel Saltz
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Luis Marco Ruiz
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Luis Marco Ruiz
 
FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_Presentation
Yatpang Cheung
 
Translational Biomedical Informatics 2010: Infrastructure and Scaling
Translational Biomedical Informatics 2010: Infrastructure and ScalingTranslational Biomedical Informatics 2010: Infrastructure and Scaling
Translational Biomedical Informatics 2010: Infrastructure and Scaling
The Ohio State University Wexner Medical Center
 
CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECAProject
 
Public Databases for Radiomics Research: Current Status and Future Directions
Public Databases for Radiomics Research: Current Status and Future DirectionsPublic Databases for Radiomics Research: Current Status and Future Directions
Public Databases for Radiomics Research: Current Status and Future Directions
CancerImagingInforma
 
Crowds Cure Canver: Annotating Data from The Cancer Imaging Archive
Crowds Cure Canver: Annotating Data from The Cancer Imaging ArchiveCrowds Cure Canver: Annotating Data from The Cancer Imaging Archive
Crowds Cure Canver: Annotating Data from The Cancer Imaging Archive
CancerImagingInforma
 
LABORATORY INFORMATION SYSTEM RADIOLOGY INFORMATION SYSTEM
LABORATORY INFORMATION SYSTEM RADIOLOGY INFORMATION SYSTEMLABORATORY INFORMATION SYSTEM RADIOLOGY INFORMATION SYSTEM
LABORATORY INFORMATION SYSTEM RADIOLOGY INFORMATION SYSTEM
Aj Raj
 
2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)
Michael Atkins
 
Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...
lucenerevolution
 
PRISM Project Update
PRISM Project UpdatePRISM Project Update
PRISM Project Update
imgcommcall
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
Michel Dumontier
 
Cancer Analytics Poster
Cancer Analytics PosterCancer Analytics Poster
Cancer Analytics Poster
Michael Atkins
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical Informatics
Chimezie Ogbuji
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
David Peyruc
 
Building an informatics solution to sustain AI-guided cell profiling with hig...
Building an informatics solution to sustain AI-guided cell profiling with hig...Building an informatics solution to sustain AI-guided cell profiling with hig...
Building an informatics solution to sustain AI-guided cell profiling with hig...
Ola Spjuth
 
Towards Automated AI-guided Drug Discovery Labs
Towards Automated AI-guided Drug Discovery LabsTowards Automated AI-guided Drug Discovery Labs
Towards Automated AI-guided Drug Discovery Labs
Ola Spjuth
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 
The Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham TaylorThe Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham Taylor
Human Variome Project
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014
Joel Saltz
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Luis Marco Ruiz
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Luis Marco Ruiz
 
FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_Presentation
Yatpang Cheung
 
CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECAProject
 
Ad

Recently uploaded (20)

Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
C++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptxC++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptx
aquibnoor22079
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Conic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptxConic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptx
taiwanesechetan
 
VKS-Python Basics for Beginners and advance.pptx
VKS-Python Basics for Beginners and advance.pptxVKS-Python Basics for Beginners and advance.pptx
VKS-Python Basics for Beginners and advance.pptx
Vinod Srivastava
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Deloitte - A Framework for Process Mining Projects
Deloitte - A Framework for Process Mining ProjectsDeloitte - A Framework for Process Mining Projects
Deloitte - A Framework for Process Mining Projects
Process mining Evangelist
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
Customer Segmentation using K-Means clustering
Customer Segmentation using K-Means clusteringCustomer Segmentation using K-Means clustering
Customer Segmentation using K-Means clustering
Ingrid Nyakerario
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
VKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptxVKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptx
Vinod Srivastava
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
Data Science Courses in India iim skills
Data Science Courses in India iim skillsData Science Courses in India iim skills
Data Science Courses in India iim skills
dharnathakur29
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
C++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptxC++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptx
aquibnoor22079
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Conic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptxConic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptx
taiwanesechetan
 
VKS-Python Basics for Beginners and advance.pptx
VKS-Python Basics for Beginners and advance.pptxVKS-Python Basics for Beginners and advance.pptx
VKS-Python Basics for Beginners and advance.pptx
Vinod Srivastava
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Deloitte - A Framework for Process Mining Projects
Deloitte - A Framework for Process Mining ProjectsDeloitte - A Framework for Process Mining Projects
Deloitte - A Framework for Process Mining Projects
Process mining Evangelist
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
Customer Segmentation using K-Means clustering
Customer Segmentation using K-Means clusteringCustomer Segmentation using K-Means clustering
Customer Segmentation using K-Means clustering
Ingrid Nyakerario
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
VKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptxVKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptx
Vinod Srivastava
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
Data Science Courses in India iim skills
Data Science Courses in India iim skillsData Science Courses in India iim skills
Data Science Courses in India iim skills
dharnathakur29
 

Radiomics Data Management, Computation, and Analysis for QIN F2F 2016

  • 1. 1 Large Scale Data Management, Computation, and Analysis for Quantitative Imaging Research Ashish Sharma, Ph.D. Department of Biomedical Informatics, Emory University Fred Prior, Ph.D. Department of Biomedical Informatics, Univ. of Arkansas for Medical Sciences
  • 2. 2 High Quality Data The Role of Data Curation Data Dump or Information Resource
  • 3. 3 Challenges to Advances in Quantitative Imaging “Data! Data! Data!” he cried impatiently. “I can’t make bricks without clay.” — Sherlock Holmes • Large, well-curated, repositories • High Quality Data • Radiology, Pathology, RT • Clinical, Genomic • Derived Feature Sets • Security and Access Control • Flexible Access • Co-Located Computation • Reproducible pipelines • Systematic, Reproducible tools for imaging phenotype/biomarker exploration
  • 4. 4 Resources Provided by the UAMS + Emory QIN Features
  • 5. 5 Built on The Cancer Imaging Archive TCIA encourages and supports cancer- related open science communities by hosting and managing the data archive,, and relevant resources to facilitate collaborative research.
  • 6. 6 RT Collections and Curation Process Partial collections: • RTOG 0522 • NRG-1106 (private) • NRG-1308 (Private) TCGA-HNSC Additional collections in process or planned
  • 7. 7 Automated RT, Image Set Curation Individual  Bulk Correction Reduce the time needed to correct data inconsistencies Curation Time: Days  few minutes (15 Collections  120K images) Extend POSDA to all curation related checks and activities
  • 8. 8 Digital Pathology caMicroscope • Visualization of Pathology Images and Features • Data Management • Native Support for Scanner Format (Scanner  caMic) • Dynamic Services (launch analysis from browser) • Scientific Mashups (DataScope and DataCafe) Supported by NCIP/CTIIP Project; ITCR U24
  • 9. 9 Pragmatic Randomized Trial of Proton versus Photon Therapy for Locally Advanced Breast Cancer • Treatment plans for Photon (left) and Proton (right) Adjuvant therapy for Lung cancer • Dose Value Histogram Demonstrates that with IMRT, 50% of the heart receives ~ 36 Gy, while proton delivers < 3Gy. Dose Value Histogram Intensity Modulated Radiation Therapy Proton Therapy
  • 10. 10 Data Access REST APIs to access TCIA Data APIs to access protected data (coming soon) Mashups from fusion of diverse datasets to support • Data exploration • Hypothesis formulation • Discovery of latent relationships and patterns
  • 11. 11 Reproducible Science Incentives for data sharing TCIA assigns DOIs for individual collections as well as data subsets that were used in publications One click access to data via DOI Use this DOI to get a Nature Scientific Data publication Will be developing mechanisms to track usage
  • 13. 13 Radiomics Portal (Galaxy + Lessons from Connectome Workbench) Fast access to co-located data Imaging, RT, Radiomic Features (TCIA) Clinical and other structured (i2b2/Eureka!) Lung Segmentation Pilot Tool Dissemination
  • 14. 14 Lung Analysis Pipeline 1. Segment lung field from CT 2. Use three parallel segmentation algorithms to identify all closed structures 3. Voting algorithm produces final segmented regions 4. Feature Extraction (shape, texture, margin,...) 5. Feature Classification (optional) 6. Image-based phenotype Top, Green  Expert Manual Segmentation Bottom, Blue  Automated Segmentation The results of three independent automatic segmentation algorithms are combined using a voting mechanism to arrive at final automatic segmentation.
  • 15. 15 Biomarker Exploration Pipeline • Data Loading: Radiomics data is integrated into Eureka! Custom sources can be incorporated via REST APIs • Construct Radiomic Phenotypes: Using the Eureka GUI, users can explore their radiomics data and construct radiomic phenotypes, that incorporate clinical, imaging and computed features. These phenotypes are captured as graphs and stored in a graph database (Neo4J) i2b2 TCIA MongoDB & Others… Radiomic Phenotypes
  • 16. 16 Eureka! Clinical Analytics • Open source (Apache 2 license) RESTful clinical phenotyping system Clinical phenotyping: computing temporal patterns from EHR data reflecting disease, therapeutic response and prognosis • Supports loading data and computed phenotypes from data warehouses and flat files into i2b2, Neo4j and flat files • One REST service + web interface controlling many ETL processes setup with configurable metadata stored in an extended i2b2 metadata schema
  • 17. 17 Use Cases • Construct phenotypes by fusing clinical and derived feature data • Compute phenotypes on available data • Ongoing pilot with NLST data • LungRADS • Study changes to LungRADS screening recommendations by incorporating demographics
  • 18. 18 Summary • TCIA is a stable yet continually evolving and expanding information resource for QIN and other Cancer Imaging initiatives • Radiomic & Radiogenomic analyses require information fusion and HPC pipelines to deal with large subject populations and advanced computationally intensive algorithms • We are exploring the potential of OODA graph statistics for multi-parametric biomarker discovery and validation.
  • 19. • NCI QIN: Resources for development and validation of Radiomic Analyses & Adaptive Therapy — PI: Prior, Sharma (UAMS, Emory) • NCIP/Leidos caMicroscope — A Digital Pathology Integrative Query System; PI: Sharma • PCORI, Pragmatic Randomized Trial of Proton versus Photon Therapy for Locally Advanced Breast Cancer to compare the effectiveness of photon vs. proton Radiotherapy. (PI: Bekelman) • Leidos Biomedical Research, Contract 16X011 for NCI, Maintenance and Extension of The Cancer Imaging Archive (TCIA), (PI: Prior)

Editor's Notes