SlideShare a Scribd company logo
A Predictive Ligand-Based Bayesian Model for Human Drug Induced Liver Injury Sean Ekins Collaborations in Chemistry, Jenkintown, PA. Collaborative Drug Discovery, Burlingame, CA. School of Pharmacy, Department of Pharmaceutical Sciences, University of Maryland.
Pharma reached a productivity tipping point Cost of drug development high Failure in clinic due to toxicity How to predict failure earlier
Why We Need Models Define structure activity relationship (SAR) in vitro  models - limited throughput  in silico – in vitro  approach has value in targeting testing of compounds. Computers can beat humans at chess and Jeopardy why not help with predicting toxicity?
The future: crowdsourced drug discovery Williams et al., Drug Discovery World, Winter 2009
What is DILI? Drug metabolism in the liver can convert some drugs into highly reactive intermediates,  In turn can adversely affect the structure and functions of the liver.  Drug-induced liver injury (DILI), is the number one reason drugs are not approved  and also the reason some of them were withdrawn from the market after approval Estimated global annual incidence rate of DILI is 13.9-24.0 per 100,000 inhabitants,  and DILI accounts for an estimated 3-9% of all adverse drug reactions reported to health authorities  Herbal components can cause DILI too https://dilin.dcri.duke.edu/for-researchers/info/
Drug Examples for DILI + and - Troglitazone DILI + Pioglitazone DILI - Rosiglitazone DILI - Sulindac DILI + Aspirin DILI - Diclofenac DILI + Xu et al., Toxicol Sci 105: 97-105 (2008).
Limitations of DILI? Compound has to physically have been made and be available for testing.  The screening system is still relatively low throughput compared with any primary screens  Whole compound or vendor libraries cannot be cost effectively screened for prioritization. Screening system should be representative of the human organ including drug metabolism capability.  Prediction of human therapeutic  C max  is often imprecise before clinical testing in actual patients.  Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
Statistical Methodologies Non Linear regression Genetic algorithms Neural networks Support vector machines Recursive partitioning (trees) Sammon maps Bayesian methods Kohonen maps A rich collection of descriptors. Public and proprietary data. Problems to date – small datasets Understanding applicability chemical space Tools for big datasets P-gp +ve  P-gp -ve Balakin et al.,Curr Drug Disc Technol 2:99-113, 2005.
DILI Computational Models 74 compounds -  classification models (linear discriminant analysis, artificial neural networks, and machine learning algorithms (OneR))  Internal cross-validation (accuracy 84%, sensitivity 78%, and specificity 90%). Testing on 6 and 13 compounds, respectively > 80% accuracy.  (Cruz-Monteagudo et al., J Comput Chem 29: 533-549, 2008).  A second study used binary QSAR (248 active and 283 inactive) Support vector machine models –  external 5-fold cross-validation procedures and 78% accuracy for a set of 18 compounds    (Fourches et al., Chem Res Toxicol 23: 171-183, 2010).  A third study created a knowledge base with structural alerts from 1266 chemicals.  Alerts created were used to predict results for 626 Pfizer compounds (sensitivity of 46%, specificity of 73%, and concordance of 56% for the latest version)    (Greene et al., Chem Res Toxicol 23: 1215-1222, 2010).
DILI data Tested a panel of orally administered drugs at multiples of the maximum therapeutic concentration ( C max ),  taking into account the first-pass effect of the liver and other idiosyncratic toxicokinetic/toxicodynamic factors.  The 100-fold  C max  scaling factor represented a reasonable threshold to differentiate safe versus toxic drugs for an orally dosed drug and with regard to hepatotoxicity.  Concordance of the in vitro human hepatocyte imaging assay technology (HIAT) for 300 drugs and chemicals, ~ 75% with regard to clinical hepatotoxicity, with very few false-positive results  Xu et al., Toxicol Sci 105: 97-105 (2008).
CRIMALDDI Meeting 2010 www.collaborativedrug.com Linking databases www.chemspider.com ~25 million compounds.  Linking to >300 data sources  Underpinning the semantic web.patents and publications, chemical suppliers etc. host for crowdsourced data Focus on data curation quality Used multiple databases to validate structures [email_address] Data curation
Bayesian machine learning Laplacian-corrected Bayesian classifier models were generated using Discovery Studio (version 2.5.5; Accelrys).  Training set = 295, test set = 237 compounds  Uses two-dimensional descriptors to distinguish between compounds that are DILI-positive and those that are DILI-negative ALogP ECFC_6  Apol  logD  molecular weight  number of aromatic rings  number of hydrogen bond acceptors  number of hydrogen bond donors  number of rings  number of rotatable bonds  molecular polar surface area  molecular surface area  Wiener and Zagreb indices Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010 Extended connectivity fingerprints
Bayesian machine learning Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010 Bayesian classification is a simple probabilistic classification model. It is based on Bayes’ theorem h  is the hypothesis or model d  is the observed data p ( h ) is the prior belief (probability of hypothesis  h  before observing any data) p ( d ) is the data evidence (marginal probability of the data) p ( d|h ) is the likelihood (probability of data  d  if hypothesis  h  is true)  p ( h|d ) is the posterior probability (probability of hypothesis  h  being true given the observed data  d )  A weight is calculated for each feature using a Laplacian-adjusted probability estimate to account for the different sampling frequencies of different features.  The weights are summed to provide a probability estimate
Examples of using Bayesian Models Integrated in Silico-in Vitro Strategy for Addressing Cytochrome P450 3A4 Time-Dependent Inhibition Zientek et al., Chem Res Toxicol 23: 664-676 (2010) Challenges predicting ligand-receptor interactions of promiscuous proteins: the nuclear receptor PXR  Ekins S, Kortagere S, Iyer M, Reschly EJ, Lill MA, Redinbo MR and Krasowski MD, PLoS Comput Biol 5(12): e1000594, (2009) . Computational models for drug inhibition of the human apical sodium-dependent bile acid transporter  Zheng X, et al., Mol Pharm, 6: 1591-1603, (2009) Quantitative structure activity relationship for inhibition of human organic cation/carnitine transporter Diao et al., Mol Pharm, 7: 2120-2131, (2010)
Features in DILI + Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010 Avoid Long aliphatic chains Phenols Ketones Diols  -methyl styrene Conjugated structures Cyclohexenones Amides ?
Features in DILI - Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
Results Fingerprints with high Bayesian scores that are present in many DILI compounds appeared to be reactive in nature,  Could cause time-dependent inhibition of cytochromes P450 or be precursors for metabolites that are reactive and may covalently bind to proteins.  Why are long aliphatic chains important for DILI  generally hydrophobic and perhaps enabling increased accumulation? may be hydroxylated and then form other metabolites that are in turn reactive? Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
Test set analysis compounds of most interest   well known hepatotoxic drugs (U.S. Food and Drug Administration Guidance for Industry “Drug-Induced Liver Injury: Premarketing Clinical Evaluation,” 2009), plus their less hepatotoxic comparators, if clinically available.  Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
Training vs test set PCA Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010 Yellow = test Blue = training retinyl  palmitate
Compare to newer drugs  Extracted small molecule drugs from 2006 to 2010 from the Prous Integrity database  Structure validation resulted in a set of 77 molecules (mean molecular weight 427.05  ±  280.31, range 94.11–1994.09) These molecules were distributed throughout the combined training and test sets (N = 532), representative of overlap These combined analyses suggest that the test and training sets used for the DILI model are representative of current medicinal chemistry efforts. Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
Fingolimod (Gilenya) for MS (EMEA and FDA)  Paliperidone for schizophrenia Pirfenidone for Idiopathic pulmonary fibrosis Roflumilast for pulmonary disease Predictions for newly approved EMEA compounds Can we get DILI experimental data for these?
SMARTS FIlters Smartsfilter kindly provided by Dr. Jeremy Yang (University of New Mexico, Albuquerque, NM, http://pasilla.health.unm.edu/tomcat/biocomp/smartsfilter). Substructure Alerts used to filter libraries – remove reactive groups etc. Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
SMARTS Filters vs Rule of 5 Ekins and Freundlich, Pharm Res, In press 2011  Correlation between the number of SMARTS filter failures and the number of Lipinski violations for different types of rules sets with FDA drug set from CDD (N = 2804) Suggests # of Lipinski violations may also be an indicator of undesirable chemical features that result in reactivity
Open source tools for modeling Open source descriptors CDK and C5.0 algorithm ~60,000  molecules with P-gp efflux data from Pfizer MDR <2.5 (low risk) (N = 14,175) MDR > 2.5 (high risk) (N = 10,820) Test set MDR <2.5 (N = 10,441) > 2.5 (N = 7972) Could facilitate model sharing? DILI dataset - another example Gupta RR, et al., Drug Metab Dispos, 38: 2083-2090, 2010   $  $$$$$$
Open source software for molecular descriptors and algorithms Spend only a fraction of the money on QSAR Selectively share your models with collaborators and control access Have someone else host the models / predictions The next opportunities for crowdsourcing… Models Inside company Collaborators Current investments >$1M/yr >$10-100’s M/yr Databases, servers Commercial Descriptors  Algorithms In house  data  generation Data
Pfizer Merck GSK Novartis Lilly BMS Could combining models give greater coverage of ADME/ Tox chemistry space and improve predictions? Lundbeck Allergan Bayer AZ Roche BI Merk KGaA Expanding computational model coverage of chemical space
More collaborations, integrating models into scientific social networks Drug Disc Today, 14: 261-270, 2009
Conclusions  First large-scale testing of DILI machine learning models Concordance lower than with in vitro model Statistics similar to Structural alerts from Pfizer paper SMARTS can be used to filter libraries Machine learning models better than SMARTS  Could use models to filter compounds for further testing in vitro Use published knowledge to predict DILI Combinations of models Combine datasets – create models with Open descriptors and algorithms – equivalent to commercial?? Make models widely available Collaborations to cover wider chemical space Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
Acknowledgments Merck Jim Xu Royal Society of Chemistry Antony J. Williams  Pfizer Rishi Gupta Eric Gifford Ted Liston Chris Waller Accelrys CDD [email_address] http://www.slideshare.net/ekinssean   http://www.collabchem.com/ http://www.collaborations.com/CHEMISTRY.HTM

More Related Content

PPT
Talk at Yale University April 26th 2011: Applying Computational Models for To...
PPT
Finland Helsinki Drug Research slides 2011
PPT
Nc state lecture v2 Computational Toxicology
PDF
An Introduction to:In Vitro-In VivoExtrapolation (IVIVE)
PPT
dkNET Webinar: The Mouse Metabolic Phenotyping Centers: Services and Data 01/...
PDF
Wityak 2015 JMC Lead optimization towards PoC tools for HD with a 4-pyrazol-...
PPTX
Slas talk 2016
PDF
Molecular and data visualization in drug discovery
Talk at Yale University April 26th 2011: Applying Computational Models for To...
Finland Helsinki Drug Research slides 2011
Nc state lecture v2 Computational Toxicology
An Introduction to:In Vitro-In VivoExtrapolation (IVIVE)
dkNET Webinar: The Mouse Metabolic Phenotyping Centers: Services and Data 01/...
Wityak 2015 JMC Lead optimization towards PoC tools for HD with a 4-pyrazol-...
Slas talk 2016
Molecular and data visualization in drug discovery

What's hot (20)

PDF
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
PPT
acs talk open source drug discovery
PDF
working_example_poster
PDF
Novel Hybrid Molecules of Isoxazole Chalcone Derivatives: Synthesis and Study...
PPT
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
PDF
Crimson Publishers-Allometry Scalling in Drug Development
PPTX
Alternatives to Animal Testing
PPT
Animal Experiments and Alternatives
PDF
QSAR Modeling of Bisbenzofuran Compounds using 2D-Descriptors as Antimalarial...
PDF
PAMAM/5-fluorouracil drug conjugate for targeting E6 and E7 oncoproteins in c...
PPTX
Alternative methods to animal toxicity testing
PDF
Articulo4
PDF
Computational design of novel candidate drug molecules for schistosomiasis
PPT
RecA foscused antibacterial screening
PDF
Study on multi-target mechanism of Radix et Rhizoma Rhei (Dahuang) and Semen ...
PPT
s.s.c (Alternative to animal study)
PPTX
Micronucleus Assay
PDF
a-rat-pharmacokinetic-pharmacodynamic-model-for-assessment-of-lipopolysacchar...
PDF
Martinez use of in silico models to support canine drug development
PPTX
Fragment Based Drug Discovery
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
acs talk open source drug discovery
working_example_poster
Novel Hybrid Molecules of Isoxazole Chalcone Derivatives: Synthesis and Study...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Crimson Publishers-Allometry Scalling in Drug Development
Alternatives to Animal Testing
Animal Experiments and Alternatives
QSAR Modeling of Bisbenzofuran Compounds using 2D-Descriptors as Antimalarial...
PAMAM/5-fluorouracil drug conjugate for targeting E6 and E7 oncoproteins in c...
Alternative methods to animal toxicity testing
Articulo4
Computational design of novel candidate drug molecules for schistosomiasis
RecA foscused antibacterial screening
Study on multi-target mechanism of Radix et Rhizoma Rhei (Dahuang) and Semen ...
s.s.c (Alternative to animal study)
Micronucleus Assay
a-rat-pharmacokinetic-pharmacodynamic-model-for-assessment-of-lipopolysacchar...
Martinez use of in silico models to support canine drug development
Fragment Based Drug Discovery
Ad

Viewers also liked (18)

PPTX
Xenobiotic induced liver pathology
PPTX
PDF
FDA 2013 Clinical Investigator Training Course: Serious Drug Induced Liver In...
PPT
Common drug induced liver injury in children -dr. harshad devarbhai
PPTX
Dermo toxicology
PPT
Toxic and drug induced hepatitis
PPT
Liver Trauma (Liver Injury)
PPT
Gastrocon 2016 - Drug Induced Liver Disease
PPTX
Molecular mechanism of drug induced hepatotoxicity
PPTX
Drug induced hepatotoxicity and its regulatory implications
PPTX
Blood Vessel Diseases
PPT
Drug induced hepatitis by muhammad umer
PPT
Hepatotoxicity
PPT
Alcoholic Liver Disease PPT.
PPTX
Pathology healing and repair
PPT
Introduction to pathology
PPTX
Drug induce liver disease mita
PPT
Cirrhosis of liver
Xenobiotic induced liver pathology
FDA 2013 Clinical Investigator Training Course: Serious Drug Induced Liver In...
Common drug induced liver injury in children -dr. harshad devarbhai
Dermo toxicology
Toxic and drug induced hepatitis
Liver Trauma (Liver Injury)
Gastrocon 2016 - Drug Induced Liver Disease
Molecular mechanism of drug induced hepatotoxicity
Drug induced hepatotoxicity and its regulatory implications
Blood Vessel Diseases
Drug induced hepatitis by muhammad umer
Hepatotoxicity
Alcoholic Liver Disease PPT.
Pathology healing and repair
Introduction to pathology
Drug induce liver disease mita
Cirrhosis of liver
Ad

Similar to BioIT Drug induced liver injury talk 2011 (20)

PPT
Unc slides on computational toxicology
PDF
A predictive ligand based Bayesian model for human drug induced liver injury
PPT
Montreal 8th world congress
PDF
The Future of Computational Models for Predicting Human Toxicities
PDF
Genomica Yquimiot
PPT
Network Pharmacology Tri-Con 022212
PPT
Slides for st judes
PPT
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
PPT
Collaborative Database and Computational Models for Tuberculosis Drug Discovery
PDF
Systems Pharmacology as a tool for future therapy development: a feasibility ...
PPTX
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
PPT
Accelrys UGM slides 2011
PPT
TDRtargets.org: an open-access resource for prioritizing possible drug target...
PPTX
Role of bioinformatics of drug designing
PPTX
Alternative to animal toxicit testing.pptx
PDF
Organonachip Engineered Microenvironments For Safety And Efficacy Testing 1st...
PPT
Predicting Drug Candidates Safety : the Role and Usage of Knowledge Bases
PDF
Introduction to the drug discovery process
PDF
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
PPTX
ABT 609 PPT
Unc slides on computational toxicology
A predictive ligand based Bayesian model for human drug induced liver injury
Montreal 8th world congress
The Future of Computational Models for Predicting Human Toxicities
Genomica Yquimiot
Network Pharmacology Tri-Con 022212
Slides for st judes
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug Discovery
Systems Pharmacology as a tool for future therapy development: a feasibility ...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Accelrys UGM slides 2011
TDRtargets.org: an open-access resource for prioritizing possible drug target...
Role of bioinformatics of drug designing
Alternative to animal toxicit testing.pptx
Organonachip Engineered Microenvironments For Safety And Efficacy Testing 1st...
Predicting Drug Candidates Safety : the Role and Usage of Knowledge Bases
Introduction to the drug discovery process
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
ABT 609 PPT

More from Sean Ekins (20)

PDF
Applying Artificial Intelligence in a Small Drug Discovery Company
PPTX
How to Win a small business grant.pptx
PDF
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
PPTX
A presentation at the Global Genes rare drug development symposium on governm...
PPTX
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
PPTX
Bayesian Models for Chagas Disease
PPTX
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
PPTX
Drug Discovery Today March 2017 special issue
PDF
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
PPTX
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
PPTX
Open zika presentation
PDF
academic / small company collaborations for rare and neglected diseasesv2
PPTX
CDD models case study #3
PPTX
CDD models case study #2
PPTX
CDD Models case study #1
PDF
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
PDF
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
PDF
The future of computational chemistry b ig
PDF
#ZikaOpen: Homology Models -
PDF
Pros and cons of social networking for scientists
Applying Artificial Intelligence in a Small Drug Discovery Company
How to Win a small business grant.pptx
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
A presentation at the Global Genes rare drug development symposium on governm...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Bayesian Models for Chagas Disease
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Drug Discovery Today March 2017 special issue
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Open zika presentation
academic / small company collaborations for rare and neglected diseasesv2
CDD models case study #3
CDD models case study #2
CDD Models case study #1
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
The future of computational chemistry b ig
#ZikaOpen: Homology Models -
Pros and cons of social networking for scientists

Recently uploaded (20)

PPTX
Electrolyte Disturbance in Paediatric - Nitthi.pptx
PDF
focused on the development and application of glycoHILIC, pepHILIC, and comm...
PDF
Transcultural that can help you someday.
PPTX
Neoplasia III.pptxjhghgjhfj fjfhgfgdfdfsrbvhv
PPTX
09. Diabetes in Pregnancy/ gestational.pptx
PPTX
Post Op complications in general surgery
PDF
Plant-Based Antimicrobials: A New Hope for Treating Diarrhea in HIV Patients...
PPTX
y4d nutrition and diet in pregnancy and postpartum
PPT
Rheumatology Member of Royal College of Physicians.ppt
PPTX
preoerative assessment in anesthesia and critical care medicine
PPT
neurology Member of Royal College of Physicians (MRCP).ppt
PDF
AGE(Acute Gastroenteritis)pdf. Specific.
PPT
nephrology MRCP - Member of Royal College of Physicians ppt
PPTX
NUCLEAR-MEDICINE-Copy.pptxbabaabahahahaahha
PPTX
CARDIOVASCULAR AND RENAL DRUGS.pptx for health study
PDF
04 dr. Rahajeng - dr.rahajeng-KOGI XIX 2025-ed1.pdf
PPTX
Human Reproduction: Anatomy, Physiology & Clinical Insights.pptx
PDF
The_EHRA_Book_of_Interventional Electrophysiology.pdf
PPTX
Manage HIV exposed child and a child with HIV infection.pptx
PDF
OSCE SERIES ( Questions & Answers ) - Set 5.pdf
Electrolyte Disturbance in Paediatric - Nitthi.pptx
focused on the development and application of glycoHILIC, pepHILIC, and comm...
Transcultural that can help you someday.
Neoplasia III.pptxjhghgjhfj fjfhgfgdfdfsrbvhv
09. Diabetes in Pregnancy/ gestational.pptx
Post Op complications in general surgery
Plant-Based Antimicrobials: A New Hope for Treating Diarrhea in HIV Patients...
y4d nutrition and diet in pregnancy and postpartum
Rheumatology Member of Royal College of Physicians.ppt
preoerative assessment in anesthesia and critical care medicine
neurology Member of Royal College of Physicians (MRCP).ppt
AGE(Acute Gastroenteritis)pdf. Specific.
nephrology MRCP - Member of Royal College of Physicians ppt
NUCLEAR-MEDICINE-Copy.pptxbabaabahahahaahha
CARDIOVASCULAR AND RENAL DRUGS.pptx for health study
04 dr. Rahajeng - dr.rahajeng-KOGI XIX 2025-ed1.pdf
Human Reproduction: Anatomy, Physiology & Clinical Insights.pptx
The_EHRA_Book_of_Interventional Electrophysiology.pdf
Manage HIV exposed child and a child with HIV infection.pptx
OSCE SERIES ( Questions & Answers ) - Set 5.pdf

BioIT Drug induced liver injury talk 2011

  • 1. A Predictive Ligand-Based Bayesian Model for Human Drug Induced Liver Injury Sean Ekins Collaborations in Chemistry, Jenkintown, PA. Collaborative Drug Discovery, Burlingame, CA. School of Pharmacy, Department of Pharmaceutical Sciences, University of Maryland.
  • 2. Pharma reached a productivity tipping point Cost of drug development high Failure in clinic due to toxicity How to predict failure earlier
  • 3. Why We Need Models Define structure activity relationship (SAR) in vitro models - limited throughput in silico – in vitro approach has value in targeting testing of compounds. Computers can beat humans at chess and Jeopardy why not help with predicting toxicity?
  • 4. The future: crowdsourced drug discovery Williams et al., Drug Discovery World, Winter 2009
  • 5. What is DILI? Drug metabolism in the liver can convert some drugs into highly reactive intermediates, In turn can adversely affect the structure and functions of the liver. Drug-induced liver injury (DILI), is the number one reason drugs are not approved and also the reason some of them were withdrawn from the market after approval Estimated global annual incidence rate of DILI is 13.9-24.0 per 100,000 inhabitants, and DILI accounts for an estimated 3-9% of all adverse drug reactions reported to health authorities Herbal components can cause DILI too https://dilin.dcri.duke.edu/for-researchers/info/
  • 6. Drug Examples for DILI + and - Troglitazone DILI + Pioglitazone DILI - Rosiglitazone DILI - Sulindac DILI + Aspirin DILI - Diclofenac DILI + Xu et al., Toxicol Sci 105: 97-105 (2008).
  • 7. Limitations of DILI? Compound has to physically have been made and be available for testing. The screening system is still relatively low throughput compared with any primary screens Whole compound or vendor libraries cannot be cost effectively screened for prioritization. Screening system should be representative of the human organ including drug metabolism capability. Prediction of human therapeutic C max is often imprecise before clinical testing in actual patients. Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
  • 8. Statistical Methodologies Non Linear regression Genetic algorithms Neural networks Support vector machines Recursive partitioning (trees) Sammon maps Bayesian methods Kohonen maps A rich collection of descriptors. Public and proprietary data. Problems to date – small datasets Understanding applicability chemical space Tools for big datasets P-gp +ve P-gp -ve Balakin et al.,Curr Drug Disc Technol 2:99-113, 2005.
  • 9. DILI Computational Models 74 compounds - classification models (linear discriminant analysis, artificial neural networks, and machine learning algorithms (OneR)) Internal cross-validation (accuracy 84%, sensitivity 78%, and specificity 90%). Testing on 6 and 13 compounds, respectively > 80% accuracy. (Cruz-Monteagudo et al., J Comput Chem 29: 533-549, 2008). A second study used binary QSAR (248 active and 283 inactive) Support vector machine models – external 5-fold cross-validation procedures and 78% accuracy for a set of 18 compounds (Fourches et al., Chem Res Toxicol 23: 171-183, 2010). A third study created a knowledge base with structural alerts from 1266 chemicals. Alerts created were used to predict results for 626 Pfizer compounds (sensitivity of 46%, specificity of 73%, and concordance of 56% for the latest version) (Greene et al., Chem Res Toxicol 23: 1215-1222, 2010).
  • 10. DILI data Tested a panel of orally administered drugs at multiples of the maximum therapeutic concentration ( C max ), taking into account the first-pass effect of the liver and other idiosyncratic toxicokinetic/toxicodynamic factors. The 100-fold C max scaling factor represented a reasonable threshold to differentiate safe versus toxic drugs for an orally dosed drug and with regard to hepatotoxicity. Concordance of the in vitro human hepatocyte imaging assay technology (HIAT) for 300 drugs and chemicals, ~ 75% with regard to clinical hepatotoxicity, with very few false-positive results Xu et al., Toxicol Sci 105: 97-105 (2008).
  • 11. CRIMALDDI Meeting 2010 www.collaborativedrug.com Linking databases www.chemspider.com ~25 million compounds. Linking to >300 data sources Underpinning the semantic web.patents and publications, chemical suppliers etc. host for crowdsourced data Focus on data curation quality Used multiple databases to validate structures [email_address] Data curation
  • 12. Bayesian machine learning Laplacian-corrected Bayesian classifier models were generated using Discovery Studio (version 2.5.5; Accelrys). Training set = 295, test set = 237 compounds Uses two-dimensional descriptors to distinguish between compounds that are DILI-positive and those that are DILI-negative ALogP ECFC_6 Apol logD molecular weight number of aromatic rings number of hydrogen bond acceptors number of hydrogen bond donors number of rings number of rotatable bonds molecular polar surface area molecular surface area Wiener and Zagreb indices Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010 Extended connectivity fingerprints
  • 13. Bayesian machine learning Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010 Bayesian classification is a simple probabilistic classification model. It is based on Bayes’ theorem h is the hypothesis or model d is the observed data p ( h ) is the prior belief (probability of hypothesis h before observing any data) p ( d ) is the data evidence (marginal probability of the data) p ( d|h ) is the likelihood (probability of data d if hypothesis h is true) p ( h|d ) is the posterior probability (probability of hypothesis h being true given the observed data d ) A weight is calculated for each feature using a Laplacian-adjusted probability estimate to account for the different sampling frequencies of different features. The weights are summed to provide a probability estimate
  • 14. Examples of using Bayesian Models Integrated in Silico-in Vitro Strategy for Addressing Cytochrome P450 3A4 Time-Dependent Inhibition Zientek et al., Chem Res Toxicol 23: 664-676 (2010) Challenges predicting ligand-receptor interactions of promiscuous proteins: the nuclear receptor PXR Ekins S, Kortagere S, Iyer M, Reschly EJ, Lill MA, Redinbo MR and Krasowski MD, PLoS Comput Biol 5(12): e1000594, (2009) . Computational models for drug inhibition of the human apical sodium-dependent bile acid transporter Zheng X, et al., Mol Pharm, 6: 1591-1603, (2009) Quantitative structure activity relationship for inhibition of human organic cation/carnitine transporter Diao et al., Mol Pharm, 7: 2120-2131, (2010)
  • 15. Features in DILI + Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010 Avoid Long aliphatic chains Phenols Ketones Diols  -methyl styrene Conjugated structures Cyclohexenones Amides ?
  • 16. Features in DILI - Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
  • 17. Results Fingerprints with high Bayesian scores that are present in many DILI compounds appeared to be reactive in nature, Could cause time-dependent inhibition of cytochromes P450 or be precursors for metabolites that are reactive and may covalently bind to proteins. Why are long aliphatic chains important for DILI generally hydrophobic and perhaps enabling increased accumulation? may be hydroxylated and then form other metabolites that are in turn reactive? Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
  • 18. Test set analysis compounds of most interest well known hepatotoxic drugs (U.S. Food and Drug Administration Guidance for Industry “Drug-Induced Liver Injury: Premarketing Clinical Evaluation,” 2009), plus their less hepatotoxic comparators, if clinically available. Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
  • 19. Training vs test set PCA Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010 Yellow = test Blue = training retinyl palmitate
  • 20. Compare to newer drugs Extracted small molecule drugs from 2006 to 2010 from the Prous Integrity database Structure validation resulted in a set of 77 molecules (mean molecular weight 427.05 ± 280.31, range 94.11–1994.09) These molecules were distributed throughout the combined training and test sets (N = 532), representative of overlap These combined analyses suggest that the test and training sets used for the DILI model are representative of current medicinal chemistry efforts. Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
  • 21. Fingolimod (Gilenya) for MS (EMEA and FDA) Paliperidone for schizophrenia Pirfenidone for Idiopathic pulmonary fibrosis Roflumilast for pulmonary disease Predictions for newly approved EMEA compounds Can we get DILI experimental data for these?
  • 22. SMARTS FIlters Smartsfilter kindly provided by Dr. Jeremy Yang (University of New Mexico, Albuquerque, NM, http://pasilla.health.unm.edu/tomcat/biocomp/smartsfilter). Substructure Alerts used to filter libraries – remove reactive groups etc. Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
  • 23. SMARTS Filters vs Rule of 5 Ekins and Freundlich, Pharm Res, In press 2011 Correlation between the number of SMARTS filter failures and the number of Lipinski violations for different types of rules sets with FDA drug set from CDD (N = 2804) Suggests # of Lipinski violations may also be an indicator of undesirable chemical features that result in reactivity
  • 24. Open source tools for modeling Open source descriptors CDK and C5.0 algorithm ~60,000 molecules with P-gp efflux data from Pfizer MDR <2.5 (low risk) (N = 14,175) MDR > 2.5 (high risk) (N = 10,820) Test set MDR <2.5 (N = 10,441) > 2.5 (N = 7972) Could facilitate model sharing? DILI dataset - another example Gupta RR, et al., Drug Metab Dispos, 38: 2083-2090, 2010 $ $$$$$$
  • 25. Open source software for molecular descriptors and algorithms Spend only a fraction of the money on QSAR Selectively share your models with collaborators and control access Have someone else host the models / predictions The next opportunities for crowdsourcing… Models Inside company Collaborators Current investments >$1M/yr >$10-100’s M/yr Databases, servers Commercial Descriptors Algorithms In house data generation Data
  • 26. Pfizer Merck GSK Novartis Lilly BMS Could combining models give greater coverage of ADME/ Tox chemistry space and improve predictions? Lundbeck Allergan Bayer AZ Roche BI Merk KGaA Expanding computational model coverage of chemical space
  • 27. More collaborations, integrating models into scientific social networks Drug Disc Today, 14: 261-270, 2009
  • 28. Conclusions First large-scale testing of DILI machine learning models Concordance lower than with in vitro model Statistics similar to Structural alerts from Pfizer paper SMARTS can be used to filter libraries Machine learning models better than SMARTS Could use models to filter compounds for further testing in vitro Use published knowledge to predict DILI Combinations of models Combine datasets – create models with Open descriptors and algorithms – equivalent to commercial?? Make models widely available Collaborations to cover wider chemical space Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
  • 29. Acknowledgments Merck Jim Xu Royal Society of Chemistry Antony J. Williams Pfizer Rishi Gupta Eric Gifford Ted Liston Chris Waller Accelrys CDD [email_address] http://www.slideshare.net/ekinssean http://www.collabchem.com/ http://www.collaborations.com/CHEMISTRY.HTM

Editor's Notes

  • #5: CDD Experienced Team Innovates and Executes Barry Bunin, PhD (Pres. &amp; Cofounder as first Eli Lilly EIR) Libraria (CEO, Pres.-CSO), Arris Pharmaceuticals (Sr. Scientist), Genentech, UC Berkeley (Ellman), Columbia University, author. Moses Hohman, PhD (Director Software Engineering) Northwestern Assoc. Director of Bioinformatics, Thoughtworks, Inc., U of Chicago (PhD), Harvard ( magna cum laude, Physics) Sylvia Ernst, PhD (Director Community Growth &amp; Sales) Left 800-lb Gorillas: Accelrys-Scitegic, MDL-Elsevier-Beilstein Peter Cohan (BOD &amp; Overall Sales Strategy) Symyx (VP Bus Dev &amp; President-Discovery Tools), MDL (VP Customer Marketing), www.secondderivative.com, author. Omidyar Network, Founders Fund, &amp; Lilly (BOD observers) WSGR (Corporate Counsel), Rina Accountancy (GAAP compliance) Partners: Hub Consortium Members, ChemAxon, DNDi, MMV, Sandler Center… CDD SAB: Christopher Lipinski PhD, James McKerrow, MD PhD, David Roos PhD, Adam Renslo PhD, Wes Van Voorhis, MD PhD