SlideShare a Scribd company logo
Chemical Interaction Matrix:
Gerald Lushington / LiS Consulting
http://geraldlushington.com / glushington@yahoo.com
Personalized Medicine
Comprehensive Biochemical
& Chemical Biology Understanding
Big data: NGS, medical outcomes, etc.
Personalized Medicine
Comprehensive Biochemical
& Chemical Biology Understanding
Informatics
& Creativity
HTS &
Chemical
Proteomics
Big data: NGS, medical outcomes, etc.
Example Challenges:
●Toxicology: single toxin may modulate several
different biochemical processes
●Cancer: malignant cells have multiple biochemical
sensitivities that may be targeted
●Spectral disorders (e.g., Autism, Alzheimers, etc.):
distinct phenotypes produce similar symptoms
Discovery Paradigm:
Chemical screening prospective hits
Chemical proteomics prospective targets
How to attain comprehensive understanding?
Data Comprehension Reality
TargetsCompounds
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
How to make sense of diffuse multimode
data?
Mechanism of Action (MOA) discovery:
find compound subsets that conserve
common mechanism
Excellent (but imperfect) example:
TEST (Toxicology Estimation Software Tool)
http://www.epa.gov/nrmrl/std/qsar/qsar.html
TEST
Multiple data sets covering toxicity outcomes
for numerous compounds
Predict toxicity of query compounds via on-the-fly
training to similar pre-characterized analogs
TEST
Multiple data sets covering toxicity outcomes
for numerous compounds
Predict toxicity of query compounds via on-the-fly
training to similar pre-characterized analogs
Use Tanimoto distances over molecular
fingerprints: no validated relevance specific
outcomes
Procedure:
1. Assemble Matrix of compounds vs.
activity & features
MOA method: feature / compound selection
Procedure:
1. Assemble Matrix of compounds vs.
activity & features
2. Normalize
MOA method: feature / compound selection
Procedure:
1. Assemble Matrix of compounds vs.
activity & features
2. Normalize
3. Fold activity into features as per:
Ci = |Act* - Xi*|
X values: 0 = perfect correlation
1 = perfect anticorrelation
MOA method: feature / compound selection
Procedure:
1. Assemble Matrix of compounds vs.
activity & features
2. Normalize
3. Fold activity into features as per:
Ci = |Act* - Xi*|
4. Bicluster
MOA method: feature / compound selection
Procedure:
1. Assemble Matrix of compounds vs.
activity & features
2. Normalize
3. Fold activity into features as per:
Ci = |Act* - Xi*|
4. Bicluster
Clusters Contiguous correlative or anticorrelative
regions or matrix
Within clusters: molecules may share
MOA; features may correlate with
activity
Confidence: correlative & predictive
quality of model derived from cluster
MOA method: feature / compound selection
Example: Oral Bioavailability
Oral update depends on:
● Polar solubility
● Membrane permeability
● Interaction with various transporters
Data (from Tingjun Hou): 773 molecules
http://modem.ucsd.edu/adme/databases/databases_bioavailability.htm
Descriptors (from VolSurf and DVS): 298 features
passing information content and linear independence (R < 0.90) filters
Example: Oral Bioavailability
Oral update depends on:
● Polar solubility
● Membrane permeability
● Interaction with various transporters
Data (from Tingjun Hou): 773 molecules
http://modem.ucsd.edu/adme/databases/databases_bioavailability.htm
Descriptors (from VolSurf and DVS): 298 features
passing information content and linear independence (R < 0.90) filters
Preliminary Model (Weka: Bootstrap Aggregating / RepTree):
Q2
(5-fold) = 0.4712
Example: Oral Bioavailability
Oral update depends on:
● Polar solubility
● Membrane permeability
● Interaction with various transporters
Data (from Tingjun Hou): 773 molecules
http://modem.ucsd.edu/adme/databases/databases_bioavailability.htm
Descriptors (from VolSurf and DVS): 298 features
passing information content and linear independence (R < 0.90) filters
Preliminary Model (Weka: Bootstrap Aggregating / RepTree):
Q2
(5-fold) = 0.4712 CFS & RF:
reduced to
27 features
Q2
(5-fold) = 0.4739
Biclustering: Before and After
Clusters as local training sets:
Clusters as local training sets:
Condense to 18 high quality clusters that cover almost
entire training space (omit only 10 of 768 cpds)
Conclusions
Correlative & predictive performance of subset models
gives strong confidence in MOA conservation in
clusters
Head-to-head comparison with chemical proteomics
data should provide strong basis for target
identification
Questions / Suggestions?
glushington@yahoo.com

More Related Content

PPTX
Omic Data Integration Strategies
PPTX
Complex Systems Biology Informed Data Analysis and Machine Learning
PPTX
Data analysis workflows part 1 2015
PPTX
Metabolomic data analysis and visualization tools
PPTX
Data analysis workflows part 2 2015
PPTX
Connecting Metabolomic Data with Context
PPTX
Systems genetics approaches to understand complex traits
PPT
Chemoinformatic
Omic Data Integration Strategies
Complex Systems Biology Informed Data Analysis and Machine Learning
Data analysis workflows part 1 2015
Metabolomic data analysis and visualization tools
Data analysis workflows part 2 2015
Connecting Metabolomic Data with Context
Systems genetics approaches to understand complex traits
Chemoinformatic

What's hot (20)

PPTX
Revealing disease-associated pathways by network integration of untargeted me...
PPSX
STRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERY
PPT
Multivarite and network tools for biological data analysis
PPTX
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
PPTX
Chemo informatics scope and applications
PPTX
Deep learning based multi-omics integration, a survey
PPT
Docking
PPT
“Docking Studies and Drug Design”
PPTX
Cheminformatics in drug design
PPT
Structure Based Drug Design
PPT
Drug discovery Using Bioinformatic
PPT
DRUG DESIGN BASED ON BIOINFORMATICS TOOLS
PPTX
ABT 609 PPT
PDF
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
PPT
Text and data integration
PPT
Trifunctional cross linker
PPTX
Machine Learning Powered Metabolomic Network Analysis
PDF
Virtual screening
PPT
Molecular modelling for in silico drug discovery
Revealing disease-associated pathways by network integration of untargeted me...
STRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERY
Multivarite and network tools for biological data analysis
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Chemo informatics scope and applications
Deep learning based multi-omics integration, a survey
Docking
“Docking Studies and Drug Design”
Cheminformatics in drug design
Structure Based Drug Design
Drug discovery Using Bioinformatic
DRUG DESIGN BASED ON BIOINFORMATICS TOOLS
ABT 609 PPT
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
Text and data integration
Trifunctional cross linker
Machine Learning Powered Metabolomic Network Analysis
Virtual screening
Molecular modelling for in silico drug discovery
Ad

Similar to A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action (20)

PDF
Chemical Proteomics Methods and Protocols 1st Edition Marcus Bantscheff (Auth.)
PDF
Technology R&D Theme 1: Differential Networks
PPT
2011-10-11 Open PHACTS at BioIT World Europe
PPT
PPT
Talk at Yale University April 26th 2011: Applying Computational Models for To...
DOCX
Applied Bioinformatics Assignment 5docx
PDF
A Brief Introduction to Metabolomics
PPTX
cadd-191129134050 (1).pptx
PDF
Java tutorial: Programmatic Access to Molecular Interactions
PPTX
Multivariate data analysis
PPT
2011-11-28 Open PHACTS at RSC CICAG
PPTX
Role of bioinformatics of drug designing
PPTX
Drug discovery anthony crasto
PPT
Promiscuous patterns and perils in PubChem and the MLSCN
PDF
Deep learning methods in metagenomics: a review
PDF
AI and Machine Learning for Secondary Metabolite Prediction
PPT
Significance of computational tools in drug discovery
PPTX
Systems Immunology -- 2014
PPTX
Drug discovery hit to lead
PDF
Trial Types_PIIS0092867420302099.pdf
Chemical Proteomics Methods and Protocols 1st Edition Marcus Bantscheff (Auth.)
Technology R&D Theme 1: Differential Networks
2011-10-11 Open PHACTS at BioIT World Europe
Talk at Yale University April 26th 2011: Applying Computational Models for To...
Applied Bioinformatics Assignment 5docx
A Brief Introduction to Metabolomics
cadd-191129134050 (1).pptx
Java tutorial: Programmatic Access to Molecular Interactions
Multivariate data analysis
2011-11-28 Open PHACTS at RSC CICAG
Role of bioinformatics of drug designing
Drug discovery anthony crasto
Promiscuous patterns and perils in PubChem and the MLSCN
Deep learning methods in metagenomics: a review
AI and Machine Learning for Secondary Metabolite Prediction
Significance of computational tools in drug discovery
Systems Immunology -- 2014
Drug discovery hit to lead
Trial Types_PIIS0092867420302099.pdf
Ad

More from Gerald Lushington (6)

PDF
Report ghl20130320
PDF
Gerald Lushington presentation on Biologically Relevant Chemical Diversity An...
PDF
LiS services
PDF
Open source
PDF
Personalized medicine via molecular interrogation, data mining and systems bi...
PDF
Introduction to Data Mining / Bioinformatics
Report ghl20130320
Gerald Lushington presentation on Biologically Relevant Chemical Diversity An...
LiS services
Open source
Personalized medicine via molecular interrogation, data mining and systems bi...
Introduction to Data Mining / Bioinformatics

Recently uploaded (20)

PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Empathic Computing: Creating Shared Understanding
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
Teaching material agriculture food technology
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Programs and apps: productivity, graphics, security and other tools
Network Security Unit 5.pdf for BCA BBA.
Empathic Computing: Creating Shared Understanding
Mobile App Security Testing_ A Comprehensive Guide.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
A comparative analysis of optical character recognition models for extracting...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Encapsulation_ Review paper, used for researhc scholars
Advanced methodologies resolving dimensionality complications for autism neur...
Building Integrated photovoltaic BIPV_UPV.pdf
Teaching material agriculture food technology
Reach Out and Touch Someone: Haptics and Empathic Computing
MYSQL Presentation for SQL database connectivity
Unlocking AI with Model Context Protocol (MCP)
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Review of recent advances in non-invasive hemoglobin estimation
Spectroscopy.pptx food analysis technology
Programs and apps: productivity, graphics, security and other tools

A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action

  • 1. Chemical Interaction Matrix: Gerald Lushington / LiS Consulting http://geraldlushington.com / [email protected]
  • 2. Personalized Medicine Comprehensive Biochemical & Chemical Biology Understanding Big data: NGS, medical outcomes, etc.
  • 3. Personalized Medicine Comprehensive Biochemical & Chemical Biology Understanding Informatics & Creativity HTS & Chemical Proteomics Big data: NGS, medical outcomes, etc.
  • 4. Example Challenges: ●Toxicology: single toxin may modulate several different biochemical processes ●Cancer: malignant cells have multiple biochemical sensitivities that may be targeted ●Spectral disorders (e.g., Autism, Alzheimers, etc.): distinct phenotypes produce similar symptoms Discovery Paradigm: Chemical screening prospective hits Chemical proteomics prospective targets How to attain comprehensive understanding?
  • 9. How to make sense of diffuse multimode data? Mechanism of Action (MOA) discovery: find compound subsets that conserve common mechanism Excellent (but imperfect) example: TEST (Toxicology Estimation Software Tool) http://www.epa.gov/nrmrl/std/qsar/qsar.html
  • 10. TEST Multiple data sets covering toxicity outcomes for numerous compounds Predict toxicity of query compounds via on-the-fly training to similar pre-characterized analogs
  • 11. TEST Multiple data sets covering toxicity outcomes for numerous compounds Predict toxicity of query compounds via on-the-fly training to similar pre-characterized analogs Use Tanimoto distances over molecular fingerprints: no validated relevance specific outcomes
  • 12. Procedure: 1. Assemble Matrix of compounds vs. activity & features MOA method: feature / compound selection
  • 13. Procedure: 1. Assemble Matrix of compounds vs. activity & features 2. Normalize MOA method: feature / compound selection
  • 14. Procedure: 1. Assemble Matrix of compounds vs. activity & features 2. Normalize 3. Fold activity into features as per: Ci = |Act* - Xi*| X values: 0 = perfect correlation 1 = perfect anticorrelation MOA method: feature / compound selection
  • 15. Procedure: 1. Assemble Matrix of compounds vs. activity & features 2. Normalize 3. Fold activity into features as per: Ci = |Act* - Xi*| 4. Bicluster MOA method: feature / compound selection
  • 16. Procedure: 1. Assemble Matrix of compounds vs. activity & features 2. Normalize 3. Fold activity into features as per: Ci = |Act* - Xi*| 4. Bicluster Clusters Contiguous correlative or anticorrelative regions or matrix Within clusters: molecules may share MOA; features may correlate with activity Confidence: correlative & predictive quality of model derived from cluster MOA method: feature / compound selection
  • 17. Example: Oral Bioavailability Oral update depends on: ● Polar solubility ● Membrane permeability ● Interaction with various transporters Data (from Tingjun Hou): 773 molecules http://modem.ucsd.edu/adme/databases/databases_bioavailability.htm Descriptors (from VolSurf and DVS): 298 features passing information content and linear independence (R < 0.90) filters
  • 18. Example: Oral Bioavailability Oral update depends on: ● Polar solubility ● Membrane permeability ● Interaction with various transporters Data (from Tingjun Hou): 773 molecules http://modem.ucsd.edu/adme/databases/databases_bioavailability.htm Descriptors (from VolSurf and DVS): 298 features passing information content and linear independence (R < 0.90) filters Preliminary Model (Weka: Bootstrap Aggregating / RepTree): Q2 (5-fold) = 0.4712
  • 19. Example: Oral Bioavailability Oral update depends on: ● Polar solubility ● Membrane permeability ● Interaction with various transporters Data (from Tingjun Hou): 773 molecules http://modem.ucsd.edu/adme/databases/databases_bioavailability.htm Descriptors (from VolSurf and DVS): 298 features passing information content and linear independence (R < 0.90) filters Preliminary Model (Weka: Bootstrap Aggregating / RepTree): Q2 (5-fold) = 0.4712 CFS & RF: reduced to 27 features Q2 (5-fold) = 0.4739
  • 21. Clusters as local training sets:
  • 22. Clusters as local training sets: Condense to 18 high quality clusters that cover almost entire training space (omit only 10 of 768 cpds)
  • 23. Conclusions Correlative & predictive performance of subset models gives strong confidence in MOA conservation in clusters Head-to-head comparison with chemical proteomics data should provide strong basis for target identification Questions / Suggestions? [email protected]