SlideShare a Scribd company logo
Pathomics, Clinical Studies, and Cancer Surveillance
Joel Saltz MD, PhD
Chair and Distinguished Professor Department of Biomedical Informatics
Professor Department of Pathology
Cherith Endowed Chair
Stony Brook University
HIMA Imaging Science
Pathology Informatics Summit 2022
• No Conflicts to Disclose
Roadmap
• Pathomics and Clinical Research
• Cell Detection, Classification
• Scalability
• FDA
• Modeling human attention
Ingredients of Computational Pathology Biomarkers
• Every diagnostic WSI
contains vast amounts of
data
– Tumor, lymphocyte
infiltration
– Cells – normal, dysplastic,
lymphocytes, necrotic
– Spatial distribution and
morphology of glands,
ducts
– Collagen orientation
Predict treatment
response and outcome
Patient stratification
Treatment selection
Cancer Epidemiology
Computational Pathology
• Just as in flow cytometry cells
are interrogated counted
clustered and classified, using
the newfound ability to
segment and classify cells
enables a spatially and formed
type of highly granular highly
quantitative characterization
of tissue.
• In this case the challenge is not to
reproduce what a human
pathologist chooses as a
classification but instead to use a
highly detailed set of labeled
building blocks to build new
biomarkers
Potential Clinical Applications of AI based TILs
Methods
TILs to predict prognosis (treatment response
and survival)
Spatial patterns of distribution of TILs maps to
ascertain the functional immune status of the
tumor microenvironment
Combine diagnostic criteria and TILs to stratify
patients, guide clinical management, and
select therapy (e.g. immunotherapy)
Collaboration with ECOG-ACRIN to bring AI to
Clinical Trials
Participating SEER Registries: New
Jersey, Kentucky, Georgia, New
York
Collaboration with SEER Registries to Bring AI Pathology to
Surveillance and to Create Real World Clinical Research Datasets
Pathomics, Clinical Studies, and Cancer Surveillance
● Deep learning based computational stain for TILS
●TIL patterns generated from 4,759 TCGA subjects (5,202 H&E slides),
13 cancer types
●Detected TILs correlate with pathologist eye and molecular estimates
●TIL patterns linked to tumor and immune molecular features, cancer
type, and outcome
●Now refined and extended to roughly 20 cancer types
https://github.com/ShahiraAbousamra/til_classification.
Pathomics tissue analytics for
Precision Medicine
(1) Tumor-TILs analyses
(2) High-resolution detection
and classification of tumor cells,
lymphocytes, and stromal cells
in the entirety of whole slide
images
(3) Support the scoring of the
number of TILs in microscopic
regions of interest chosen by
pathologists
(4) Analysis of cell composition
and features in different tumor
niches
Legend
red = lymphocytes
blue = stromal cells
green = tumor cells
TIL Pattern Descriptions
Quantitative – Arvind Rao
• Agglomerative clustering
• Cluster indices representing cluster
number, density, cluster size, distance
between clusters
• Traditional spatial statistics measures
• R package clusterCrit by Bernard
Desgraupes - Ball-Hall, Banfield-
Raftery, C Index, and Determinant
Ratio indices
Qualitative (Alex Lazar, Raj Gupta)
‘‘Brisk, diffuse’’ diffusely
infiltrative TILs scattered
throughout at least 30%
of the area of the tumor
(1,856 cases);
‘‘Brisk, band-like’’ - band-
like boundaries bordering
the tumor at its
periphery (1,185);
‘‘Nonbrisk, multi-focal’’
loosely scattered TILs
present in less
than 30% but more than
5% of the area of the
tumor (1,083);
‘‘Non-brisk, focal’’ for
TILs scattered throughout
less than 5% but greater
than 1% of the area of
the tumor (874);
‘‘None’’ < 1% TILS - in 143
cases
TIL Pattern Characterization
Cutaneous Malignant Melanoma,TIL map with “Brisk, Band-like” structural pattern
Cutaneous Malignant Melanoma,TIL map with “Brisk, Band-
like” structural pattern
”
Squamous Cell Carcinoma of the Lung, TIL map with “Brisk,
Diffuse” structural pattern
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
Breast Cancer TILS: TCGA and Carolina Breast
Cancer Study
Pathomics, Clinical Studies, and Cancer Surveillance
Carolina Breast Study – TIS and Subtype
Carolina Breast Cancer Study – “Risky Features”
Recurrence in patients with
2 or more high risk features
Ø Low intra-tumoral
strength
Ø TIL deserts
Ø Absence of TIL forests
Ø Low peritumoral strength
Ø Lbsence of lymphoid
aggregates
Similar results for TCGA
Roadmap
• Pathomics and Clinical Research
• Cell Detection, Classification
• Scalability
• FDA
• Modeling human attention
Spatial Contexts -- Cell Detection and Classification
• Classification accuracy is frequently context sensitive
• Training on new tissue types and new cell categories is time
consuming
• Method detects and classifies nuclei
• Training requires “dotting” nuclei
Spatial Contexts -- Nuclear Detection and Classification
International
Conference on
Computer Vision
Github -
https://github.com/Top
oXLab/MCSpatNet
Pipeline encompasses cell detection, cell classifier
and learning category specific spatial statistics
Spatial Statistical Learning -- Cells of a feather flock
together
Roadmap
• Pathomics and Clinical Research
• Cell Detection, Classification
• Scalability
• FDA
• Modeling human attention
A Framework for Automating I/O of Deep
Learning Methods in Biomedical Imaging
Applications
Ana Gainaru, Scott Klasky, Dmitry Ganushin, Tahsin Kurc, Joel Saltz
Goals
• Framework for automating AI workflows by separating the data and
computational planes and offloading the I/O management to
specialized processes
• AI applications often involve complex workflows involving large
datasets, multiple deep learning networks and multiple sets of hyper-
parameters
• Specific focus on applications that involve analysis of very large
imagery or simulation output datasets
• Immediate target is analysis of large collections of gigapixel
Pathology whole slide images
Community Challenge – Breast Cancer Tumor/TIL AI
Methods
Ensemble of 9 models
Input Size
UNet Encoder
512 x 512 pixels 640 x 640 768 x 768
EfficientNet-B2 ✓ ✓ ✓
ResNeXt-50-32x4d ✓ ✓ ✓
SEResNeXt-50-32x4d ✓ ✓ ✓
Jakub Kaczmarzyk
Sweep over hyperparameters (each line is one model)
Deep Learning Workflow Framework
●The framework is separated from AI applications
●Coordinates data reads and pre-processing
associated with concurrently running applications
●The framework reuses the data loaded from
storage for all the applications that require it and
keeps track of the models generated
●I/O framework will be able to manage data
transport and storage in a more efficient manner
●Layered on Oak Ridge National Laboratory
ADIOS-2 framework
Two algorithm ensemble - TIL analysis of 28,379 TCGA WSIs in 1 hour
and 20 minutes on 6000 GPUs (500 nodes) on ORNL Summit
Roadmap
• Pathomics and Clinical Research
• Nuclear Detection, Classification
• Scalability
• FDA
• Modeling human attention
AI Pathology Algorithm Validation FDA Led Collaborations
Dataset of pathologist annotations for algorithms that process whole slide images
• Researchers from FDA, academic colleagues in
collaboration with the Alliance for Digital
Pathology, are collecting pathologist
annotations as data for AI/ML algorithm
validation
• Application - tumor infiltrating lymphocyte
(TIL) detection and quantitation..
• Training materials and workflows to
crowdsource pathologist image annotations on
two modes: an optical microscope and two
digital platforms.
• The microscope platform allows the same ROIs
to be evaluated in both modes.
• Pursue an FDA Medical Device Development
Tool Qualification
Validating Artificial Intelligence Methods for Clinical Use
A Pathologist-Annotated Dataset for Validating
Artificial Intelligence: A Description and Pilot
Study
Roadmap
• Pathomics and Clinical Research
• Cell Detection, Classification
• Scalability
• FDA
• Modeling human attention
Visual Attention Analysis of Pathologists Examining Whole
Slide Images of Cancer
• Multidisciplinary collaboration:
Biomedical Informatics,
Computer Science, Psychology,
Pathology
• Souradeep Chakraborty, Ke Ma,
Rajarsi Gupta, Beatrice Knudsen,
Gregory J. Zelinsky, Joel Saltz,
Dimitris Samaras
• Stony Brook, University of Utah
➢ Understand and model relationship between human
attention and Pathologist classification
➢ Model the spatiotemporal distribution of human visual
attention
➢ Employ attention models to improve precision of AI
Pathology tasks
GOAL: Study pathologists’ attention as they examine whole slide images of prostate cancer tissue
H&E
image
Tumor
segmentation
Visual attention
heatmap
Figure: Processing of the captured attention data to obtain visual attention heatmaps and scanpaths
Data processing
Figure: Processing of the captured attention data to obtain visual attention heatmaps and scanpaths
Data processing
Figure: Processing of the captured attention data to obtain visual attention heatmaps and scanpaths
Data processing
Proposed model for attention prediction
Figure: Our model, ProstAttNet (Prostate Attention Net) for predicting visual attention on WSI patches
➢ We formulate attention prediction as a classification task
○ Goal: classify a WSI patch into one of the N attention intensity bins (N = 5 in our study).
Ø We construct the final attention heatmap by assembling the predicted patch-wise heatmaps followed by
gaussian smoothing and map normalization
Results: Qualitative evaluation
Figure: Comparison of the predicted attention heatmap using our ProstAttNet model with the ground truth
segmentation map and the ground truth attention heatmap
Ø Test dataset à 17 whole slide images (from the TCGA-PRAD dataset)
Ø The predicted attention heatmap correlates well with the ground truth attention heatmap and the ground truth tumor
segmentations
Thanks!
• Pathomics and Clinical Research
• Cell Detection, Classification
• Scalability
• FDA
• Modeling human attention
Stony Brook Deep Learning Pathology Faculty
Dimitis Samaras, Chao Chen, Tahsin Kurc, Prateek Prasanna, Raj Gupta
Vu Nguyen– Graduate Student
Computer Science
Anne Zhao – Assistant
Professor, Dept Pathology
Stony Brook
Raj Gupta – Department of
BMI Stony Brook
Deep Learning and
Tissue
Characterization
Trainee Team circa
2018
Le Hou– software engineer,
google core, Mountain View
Han Le– Software Development
Engineer II, Amazon, Seattle
The Stony Brook Multi-Scale Student Group:
David Belinsky, Mahmudul Hasan
Prantik Howlader
SEER UG3 Team
• Stony Brook
– Joel Saltz MD, PhD
– Tahsin Kurc PhD
– Dimitris Samara PhD
– Erich Bremer
– Le Hou
– Shahira Abousamra
– Raj Gupta
– Han Le
– Bridge Wang
• Emory
– Ashish Sharma PhD
– Ryan Birmingham
– Nan Li
• Georgia SEER Registry
– Kevin Ward MPH, PhD
• Rutgers
– David J. Foran PhD
– Evita Sadimin, MD
– Wenjin Chen, PhD
– Doreen Loh M.S.
– Jian Ren, Ph.D
– Christine Minerowicz, MD
• Rutgers SEER/ NJ State Cancer
Registry
– Antoinette Stroup, PhD
– Adrian Botchway, CTR
– Gerald Harris, PhD
• University Kentucky; Kentucky
SEER Registry
– Eric B. Durbin, DrPH, MS
– Isaac Hands, MPH
– John Williams, MA
– Justin Levens
QuIP ITCR Team
Stony Brook University
Joel Saltz
Tahsin Kurc
Raj Gupta
Dimitris Samaras
Erich Bremer
Fusheng Wang
Tammy DiPrima
Le Hou
NCI / DCEG
Jonas S Almeida
Emory University
Ashish Sharma
Ryan Birmingham
Nan Li
University of Tennessee
Knoxville
Jeremy Logan
Scott Klasky
Harvard University
Rick Cummings
ITCR/IMAT Supplement
Richard Levinson
Raj Gupta
Tahsin Kurc
Joel Saltz
Han Le
Maozheng Zhao
Dimitris Samaras
50
The PRISM Team
• Fred Prior, PhD
• Jonathan Bona, PhD
• Kirk Smith
• Lawrence Tarbox, PhD
• Mathias Brochhausen, PhD
• Roosevelt Dobbins
• Tracy Nolan
• William Bennett
• Ashish Sharma, PhD
• Annie Gu
• Mohanapriya Narapareddy
• Monjoy Saha, PhD
• Pradeeban Kathiravelu, PhD
• Joel Saltz, MD, PhD
• Erich Bremer
• Rajrisi Gupta MD
• Tahsin Kurc, PhD
• Tammy DiPrima
• TJ Fitzgerald, MD
• Fran Laurie
Thanks - Funding
• U24CA180924, U24CA215109, NCIP/Leidos 14X138 and
HHSN261200800001E, UG3CA225021-01 from the NCI;
R01LM011119-01 and R01LM009239 from the NLM, Award
2123920 from the National Science Foundation Bob Beals
and Betsy Barton, Stony Brook Mount Sinai Seed Funding,
Pancreatic Cancer Action Network
• This research used resources provided by the National
Science Foundation XSEDE Science Gateways program and
the Pittsburgh Supercomputer Center BRIDGES-AI facility

More Related Content

PPTX
Digital Pathology, FDA Approval and Precision Medicine
PDF
Twenty Years of Whole Slide Imaging - the Coming Phase Change
PPTX
Pathomics Based Biomarkers, Tools, and Methods
PPTX
AI and whole slide imaging biomarkers
PPTX
Bioinformatics-R program의 실례
PPTX
Twenty Years of Whole Slide Imaging - the Coming Phase Change
PDF
FRACTAL PARAMETERS OF TUMOUR MICROSCOPIC IMAGES AS PROGNOSTIC INDICATORS OF C...
PDF
Fractal Parameters of Tumour Microscopic Images as Prognostic Indicators of C...
Digital Pathology, FDA Approval and Precision Medicine
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Pathomics Based Biomarkers, Tools, and Methods
AI and whole slide imaging biomarkers
Bioinformatics-R program의 실례
Twenty Years of Whole Slide Imaging - the Coming Phase Change
FRACTAL PARAMETERS OF TUMOUR MICROSCOPIC IMAGES AS PROGNOSTIC INDICATORS OF C...
Fractal Parameters of Tumour Microscopic Images as Prognostic Indicators of C...

Similar to Pathomics, Clinical Studies, and Cancer Surveillance (20)

PPTX
Artificial Intelligence in pathology
PPTX
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
PPTX
radiotherapy.pptx
PPTX
ICBO 2014, October 8, 2014
PPTX
Second Review Research in quantifying lymphocytes.pptx
PDF
NEUROSENTINEL PRODIGY (Brain Tumor) using CNN
PPTX
Brain tumor detection and segmentation using deep learning
PPTX
Prediction for breast cancer using various machine learning algorithms
PPTX
High Dimensional Fused-Informatics
PDF
Computational Pathology Workshop July 8 2014
PDF
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
PPTX
morphometry.pptx
PDF
Challenges and opportunities for machine learning in biomedical research
PPTX
Federal Research & Development for the Florida system Sept 2014
PDF
IRJET- Breast Cancer Detection from Histopathology Images: A Review
PPTX
Data Science, Big Data and You
PPTX
u-Breast Cancer Detection Using Deep Learning.pptx
PPTX
Information Technology and Radiology: challenges and future perspectives
PPTX
UNMSymposium2014
PPTX
Pathomics Based Biomarkers and Precision Medicine
Artificial Intelligence in pathology
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
radiotherapy.pptx
ICBO 2014, October 8, 2014
Second Review Research in quantifying lymphocytes.pptx
NEUROSENTINEL PRODIGY (Brain Tumor) using CNN
Brain tumor detection and segmentation using deep learning
Prediction for breast cancer using various machine learning algorithms
High Dimensional Fused-Informatics
Computational Pathology Workshop July 8 2014
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
morphometry.pptx
Challenges and opportunities for machine learning in biomedical research
Federal Research & Development for the Florida system Sept 2014
IRJET- Breast Cancer Detection from Histopathology Images: A Review
Data Science, Big Data and You
u-Breast Cancer Detection Using Deep Learning.pptx
Information Technology and Radiology: challenges and future perspectives
UNMSymposium2014
Pathomics Based Biomarkers and Precision Medicine
Ad

More from Joel Saltz (20)

PPTX
Learning, Training,  Classification,  Common Sense and Exascale Computing
PDF
Integrative Everything, Deep Learning and Streaming Data
PPTX
Machine Learning and Deep Contemplation of Data
PDF
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
PDF
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
PDF
Generation and Use of Quantitative Pathology Phenotype
PDF
Big Data and Extreme Scale Computing
PDF
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
PPT
Exascale Computing and Experimental Sensor Data
PDF
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
PPTX
Data and Computational Challenges in Integrative Biomedical Informatics
PPTX
Integrative Multi-Scale Analyses
PPTX
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
PPTX
Role of Biomedical Informatics in Translational Cancer Research
PPTX
Extreme Spatio-Temporal Data Analysis
PPTX
MICCAI - Workshop on High Performance and Distributed Computing for Medical I...
PPTX
Presentation at UHC Annual Meeting
PPTX
Indiana 4 2011 Final Final
PDF
Wci Pop Sci Feb 2011
PPTX
Actsi bip overview jan 2011
Learning, Training,  Classification,  Common Sense and Exascale Computing
Integrative Everything, Deep Learning and Streaming Data
Machine Learning and Deep Contemplation of Data
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
Generation and Use of Quantitative Pathology Phenotype
Big Data and Extreme Scale Computing
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Exascale Computing and Experimental Sensor Data
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Data and Computational Challenges in Integrative Biomedical Informatics
Integrative Multi-Scale Analyses
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Role of Biomedical Informatics in Translational Cancer Research
Extreme Spatio-Temporal Data Analysis
MICCAI - Workshop on High Performance and Distributed Computing for Medical I...
Presentation at UHC Annual Meeting
Indiana 4 2011 Final Final
Wci Pop Sci Feb 2011
Actsi bip overview jan 2011
Ad

Recently uploaded (20)

PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPT
Teaching material agriculture food technology
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Empathic Computing: Creating Shared Understanding
PDF
NewMind AI Weekly Chronicles - August'25 Week I
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Cloud computing and distributed systems.
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
KodekX | Application Modernization Development
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
Diabetes mellitus diagnosis method based random forest with bat algorithm
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Big Data Technologies - Introduction.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Teaching material agriculture food technology
sap open course for s4hana steps from ECC to s4
Empathic Computing: Creating Shared Understanding
NewMind AI Weekly Chronicles - August'25 Week I
The AUB Centre for AI in Media Proposal.docx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Cloud computing and distributed systems.
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Network Security Unit 5.pdf for BCA BBA.
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
KodekX | Application Modernization Development
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Review of recent advances in non-invasive hemoglobin estimation

Pathomics, Clinical Studies, and Cancer Surveillance

  • 1. Pathomics, Clinical Studies, and Cancer Surveillance Joel Saltz MD, PhD Chair and Distinguished Professor Department of Biomedical Informatics Professor Department of Pathology Cherith Endowed Chair Stony Brook University HIMA Imaging Science Pathology Informatics Summit 2022
  • 2. • No Conflicts to Disclose
  • 3. Roadmap • Pathomics and Clinical Research • Cell Detection, Classification • Scalability • FDA • Modeling human attention
  • 4. Ingredients of Computational Pathology Biomarkers • Every diagnostic WSI contains vast amounts of data – Tumor, lymphocyte infiltration – Cells – normal, dysplastic, lymphocytes, necrotic – Spatial distribution and morphology of glands, ducts – Collagen orientation Predict treatment response and outcome Patient stratification Treatment selection Cancer Epidemiology
  • 5. Computational Pathology • Just as in flow cytometry cells are interrogated counted clustered and classified, using the newfound ability to segment and classify cells enables a spatially and formed type of highly granular highly quantitative characterization of tissue. • In this case the challenge is not to reproduce what a human pathologist chooses as a classification but instead to use a highly detailed set of labeled building blocks to build new biomarkers
  • 6. Potential Clinical Applications of AI based TILs Methods TILs to predict prognosis (treatment response and survival) Spatial patterns of distribution of TILs maps to ascertain the functional immune status of the tumor microenvironment Combine diagnostic criteria and TILs to stratify patients, guide clinical management, and select therapy (e.g. immunotherapy)
  • 7. Collaboration with ECOG-ACRIN to bring AI to Clinical Trials
  • 8. Participating SEER Registries: New Jersey, Kentucky, Georgia, New York Collaboration with SEER Registries to Bring AI Pathology to Surveillance and to Create Real World Clinical Research Datasets
  • 10. ● Deep learning based computational stain for TILS ●TIL patterns generated from 4,759 TCGA subjects (5,202 H&E slides), 13 cancer types ●Detected TILs correlate with pathologist eye and molecular estimates ●TIL patterns linked to tumor and immune molecular features, cancer type, and outcome ●Now refined and extended to roughly 20 cancer types
  • 12. Pathomics tissue analytics for Precision Medicine (1) Tumor-TILs analyses (2) High-resolution detection and classification of tumor cells, lymphocytes, and stromal cells in the entirety of whole slide images (3) Support the scoring of the number of TILs in microscopic regions of interest chosen by pathologists (4) Analysis of cell composition and features in different tumor niches Legend red = lymphocytes blue = stromal cells green = tumor cells
  • 13. TIL Pattern Descriptions Quantitative – Arvind Rao • Agglomerative clustering • Cluster indices representing cluster number, density, cluster size, distance between clusters • Traditional spatial statistics measures • R package clusterCrit by Bernard Desgraupes - Ball-Hall, Banfield- Raftery, C Index, and Determinant Ratio indices Qualitative (Alex Lazar, Raj Gupta) ‘‘Brisk, diffuse’’ diffusely infiltrative TILs scattered throughout at least 30% of the area of the tumor (1,856 cases); ‘‘Brisk, band-like’’ - band- like boundaries bordering the tumor at its periphery (1,185); ‘‘Nonbrisk, multi-focal’’ loosely scattered TILs present in less than 30% but more than 5% of the area of the tumor (1,083); ‘‘Non-brisk, focal’’ for TILs scattered throughout less than 5% but greater than 1% of the area of the tumor (874); ‘‘None’’ < 1% TILS - in 143 cases
  • 14. TIL Pattern Characterization Cutaneous Malignant Melanoma,TIL map with “Brisk, Band-like” structural pattern Cutaneous Malignant Melanoma,TIL map with “Brisk, Band- like” structural pattern ” Squamous Cell Carcinoma of the Lung, TIL map with “Brisk, Diffuse” structural pattern
  • 17. Breast Cancer TILS: TCGA and Carolina Breast Cancer Study
  • 19. Carolina Breast Study – TIS and Subtype
  • 20. Carolina Breast Cancer Study – “Risky Features” Recurrence in patients with 2 or more high risk features Ø Low intra-tumoral strength Ø TIL deserts Ø Absence of TIL forests Ø Low peritumoral strength Ø Lbsence of lymphoid aggregates Similar results for TCGA
  • 21. Roadmap • Pathomics and Clinical Research • Cell Detection, Classification • Scalability • FDA • Modeling human attention
  • 22. Spatial Contexts -- Cell Detection and Classification • Classification accuracy is frequently context sensitive • Training on new tissue types and new cell categories is time consuming • Method detects and classifies nuclei • Training requires “dotting” nuclei
  • 23. Spatial Contexts -- Nuclear Detection and Classification International Conference on Computer Vision Github - https://github.com/Top oXLab/MCSpatNet
  • 24. Pipeline encompasses cell detection, cell classifier and learning category specific spatial statistics
  • 25. Spatial Statistical Learning -- Cells of a feather flock together
  • 26. Roadmap • Pathomics and Clinical Research • Cell Detection, Classification • Scalability • FDA • Modeling human attention
  • 27. A Framework for Automating I/O of Deep Learning Methods in Biomedical Imaging Applications Ana Gainaru, Scott Klasky, Dmitry Ganushin, Tahsin Kurc, Joel Saltz
  • 28. Goals • Framework for automating AI workflows by separating the data and computational planes and offloading the I/O management to specialized processes • AI applications often involve complex workflows involving large datasets, multiple deep learning networks and multiple sets of hyper- parameters • Specific focus on applications that involve analysis of very large imagery or simulation output datasets • Immediate target is analysis of large collections of gigapixel Pathology whole slide images
  • 29. Community Challenge – Breast Cancer Tumor/TIL AI Methods
  • 30. Ensemble of 9 models Input Size UNet Encoder 512 x 512 pixels 640 x 640 768 x 768 EfficientNet-B2 ✓ ✓ ✓ ResNeXt-50-32x4d ✓ ✓ ✓ SEResNeXt-50-32x4d ✓ ✓ ✓ Jakub Kaczmarzyk
  • 31. Sweep over hyperparameters (each line is one model)
  • 32. Deep Learning Workflow Framework ●The framework is separated from AI applications ●Coordinates data reads and pre-processing associated with concurrently running applications ●The framework reuses the data loaded from storage for all the applications that require it and keeps track of the models generated ●I/O framework will be able to manage data transport and storage in a more efficient manner ●Layered on Oak Ridge National Laboratory ADIOS-2 framework Two algorithm ensemble - TIL analysis of 28,379 TCGA WSIs in 1 hour and 20 minutes on 6000 GPUs (500 nodes) on ORNL Summit
  • 33. Roadmap • Pathomics and Clinical Research • Nuclear Detection, Classification • Scalability • FDA • Modeling human attention
  • 34. AI Pathology Algorithm Validation FDA Led Collaborations Dataset of pathologist annotations for algorithms that process whole slide images • Researchers from FDA, academic colleagues in collaboration with the Alliance for Digital Pathology, are collecting pathologist annotations as data for AI/ML algorithm validation • Application - tumor infiltrating lymphocyte (TIL) detection and quantitation.. • Training materials and workflows to crowdsource pathologist image annotations on two modes: an optical microscope and two digital platforms. • The microscope platform allows the same ROIs to be evaluated in both modes. • Pursue an FDA Medical Device Development Tool Qualification
  • 35. Validating Artificial Intelligence Methods for Clinical Use A Pathologist-Annotated Dataset for Validating Artificial Intelligence: A Description and Pilot Study
  • 36. Roadmap • Pathomics and Clinical Research • Cell Detection, Classification • Scalability • FDA • Modeling human attention
  • 37. Visual Attention Analysis of Pathologists Examining Whole Slide Images of Cancer • Multidisciplinary collaboration: Biomedical Informatics, Computer Science, Psychology, Pathology • Souradeep Chakraborty, Ke Ma, Rajarsi Gupta, Beatrice Knudsen, Gregory J. Zelinsky, Joel Saltz, Dimitris Samaras • Stony Brook, University of Utah ➢ Understand and model relationship between human attention and Pathologist classification ➢ Model the spatiotemporal distribution of human visual attention ➢ Employ attention models to improve precision of AI Pathology tasks
  • 38. GOAL: Study pathologists’ attention as they examine whole slide images of prostate cancer tissue H&E image Tumor segmentation Visual attention heatmap
  • 39. Figure: Processing of the captured attention data to obtain visual attention heatmaps and scanpaths Data processing
  • 40. Figure: Processing of the captured attention data to obtain visual attention heatmaps and scanpaths Data processing
  • 41. Figure: Processing of the captured attention data to obtain visual attention heatmaps and scanpaths Data processing
  • 42. Proposed model for attention prediction Figure: Our model, ProstAttNet (Prostate Attention Net) for predicting visual attention on WSI patches ➢ We formulate attention prediction as a classification task ○ Goal: classify a WSI patch into one of the N attention intensity bins (N = 5 in our study). Ø We construct the final attention heatmap by assembling the predicted patch-wise heatmaps followed by gaussian smoothing and map normalization
  • 43. Results: Qualitative evaluation Figure: Comparison of the predicted attention heatmap using our ProstAttNet model with the ground truth segmentation map and the ground truth attention heatmap Ø Test dataset à 17 whole slide images (from the TCGA-PRAD dataset) Ø The predicted attention heatmap correlates well with the ground truth attention heatmap and the ground truth tumor segmentations
  • 44. Thanks! • Pathomics and Clinical Research • Cell Detection, Classification • Scalability • FDA • Modeling human attention
  • 45. Stony Brook Deep Learning Pathology Faculty Dimitis Samaras, Chao Chen, Tahsin Kurc, Prateek Prasanna, Raj Gupta
  • 46. Vu Nguyen– Graduate Student Computer Science Anne Zhao – Assistant Professor, Dept Pathology Stony Brook Raj Gupta – Department of BMI Stony Brook Deep Learning and Tissue Characterization Trainee Team circa 2018 Le Hou– software engineer, google core, Mountain View Han Le– Software Development Engineer II, Amazon, Seattle
  • 47. The Stony Brook Multi-Scale Student Group: David Belinsky, Mahmudul Hasan Prantik Howlader
  • 48. SEER UG3 Team • Stony Brook – Joel Saltz MD, PhD – Tahsin Kurc PhD – Dimitris Samara PhD – Erich Bremer – Le Hou – Shahira Abousamra – Raj Gupta – Han Le – Bridge Wang • Emory – Ashish Sharma PhD – Ryan Birmingham – Nan Li • Georgia SEER Registry – Kevin Ward MPH, PhD • Rutgers – David J. Foran PhD – Evita Sadimin, MD – Wenjin Chen, PhD – Doreen Loh M.S. – Jian Ren, Ph.D – Christine Minerowicz, MD • Rutgers SEER/ NJ State Cancer Registry – Antoinette Stroup, PhD – Adrian Botchway, CTR – Gerald Harris, PhD • University Kentucky; Kentucky SEER Registry – Eric B. Durbin, DrPH, MS – Isaac Hands, MPH – John Williams, MA – Justin Levens
  • 49. QuIP ITCR Team Stony Brook University Joel Saltz Tahsin Kurc Raj Gupta Dimitris Samaras Erich Bremer Fusheng Wang Tammy DiPrima Le Hou NCI / DCEG Jonas S Almeida Emory University Ashish Sharma Ryan Birmingham Nan Li University of Tennessee Knoxville Jeremy Logan Scott Klasky Harvard University Rick Cummings ITCR/IMAT Supplement Richard Levinson Raj Gupta Tahsin Kurc Joel Saltz Han Le Maozheng Zhao Dimitris Samaras
  • 50. 50 The PRISM Team • Fred Prior, PhD • Jonathan Bona, PhD • Kirk Smith • Lawrence Tarbox, PhD • Mathias Brochhausen, PhD • Roosevelt Dobbins • Tracy Nolan • William Bennett • Ashish Sharma, PhD • Annie Gu • Mohanapriya Narapareddy • Monjoy Saha, PhD • Pradeeban Kathiravelu, PhD • Joel Saltz, MD, PhD • Erich Bremer • Rajrisi Gupta MD • Tahsin Kurc, PhD • Tammy DiPrima • TJ Fitzgerald, MD • Fran Laurie
  • 51. Thanks - Funding • U24CA180924, U24CA215109, NCIP/Leidos 14X138 and HHSN261200800001E, UG3CA225021-01 from the NCI; R01LM011119-01 and R01LM009239 from the NLM, Award 2123920 from the National Science Foundation Bob Beals and Betsy Barton, Stony Brook Mount Sinai Seed Funding, Pancreatic Cancer Action Network • This research used resources provided by the National Science Foundation XSEDE Science Gateways program and the Pittsburgh Supercomputer Center BRIDGES-AI facility