0% found this document useful (0 votes)

16 views

QSAR Statistical Concepts

Qsar

Uploaded by

Gunjan Nautiyal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

QSAR Statistical Concepts

Qsar

Uploaded by

Gunjan Nautiyal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

S.

Moro

Statistical concepts in QSAR.

Computational chemistry represents molecular structures as a numerical models and

simulates their behavior with the equations of quantum and classical physics. Available
programs enable scientists to easily generate and present molecular data including
geometries, energies and associated properties (electronic, spectroscopic and bulk). The
usual paradigm for displaying and manipulating these data is a table in which compounds
are defined by individual rows and molecular properties (or descriptors) are defined by the
associated columns. A QSAR attempts to find consistent relationships between the variations
in the values of molecular properties and the biological activity for a series of compounds so
that these "rules" can be used to evaluate new chemical entities.

A QSAR generally takes the form of a linear equation:

Biological Activity = Const + (c1×P1) + (c2×P2) + (c3×P3) + ...

where the parameters P1 through Pn are computed for each molecule in the series and the
coefficients c1 through cn are calculated by fitting variations in the parameters and the
biological activity. Since these relationships are generally discovered through the
application of statistical techniques, a brief introduction to the principles behind the
derivation of a QSAR follows.

The work reported from The Sandoz Institute for Medical Research on the development of
novel analgesic agents can be used as an example of a simple QSAR. In this study,
vanillylamides and vanillylthioureas related to capsaicin were prepared and their activity
was tested in an in vitro assay which measured 45Ca2+ influx into dorsal root ganglia neurons.
The data, which was reported as the EC50 (µM), is shown in Table 1 (note that compound 6f
is the most active of the series).

TABLE 1: Capsaicin Analogs Activity Data

1
S. Moro

In the absence of additional information, the only way to derive a best "guess" for the
activity of 6i is to calculate the average of the values for the current compounds in the
series. The average, 7.24, provides a guess for the value of compound 8 but, how good is this
guess? The graphical presentation of the data points is shown in Graph 1.

GRAPH 1: Capsaicin Analogs Activity Data.

The standard deviation of the data, s, shows how far the activity values are spread about
their average. This value provides an indication of the quality of the guess by showing the
amount of variability inherent in the data. The standard deviation is calculated as shown
below.

Rather than relying on this limited analysis, one would like to develop an understanding of
the factors that influence activity within this series and use this understanding to predict
activity for new compounds. In order to accomplish this objective, one needs:

• binding data measured with sufficient precision to distinguish between compounds;

• a set of parameters which can be easily obtained and which are likely to be related
to receptor affinity;
• a method for detecting a relationship between the parameters and binding data
(the QSAR) and
• a method for validating the QSAR.

The QSAR equation is a linear model which relates variations in biological activity to
variations in the values of computed (or measured) properties for a series of molecules. For
the method to work

2
S. Moro

efficiently, the compounds selected to describe the "chemical space" of the experiments
(the training set) should be diverse. In many synthesis campaigns, compounds are prepared
which are structurally similar to the lead structure. Not surprisingly, the activity values for this
series of compounds will frequently span a limited range as well. In these cases, additional
compounds must be made and tested to fill out the training set.
The quality of any QSAR will only be as good as the quality of the data which is used to
derive the model. Dose-response curves need to be smooth, contain enough points to
assure accuracy and should span two or more orders of magnitude. Multiple readings for a
given observation should be reproducible and have relatively smaller errors. The issue being
addressed is the signal-to-noise ratio.
The variation of the readings obtained by repeatedly testing the same compound should be
much smaller than the variation over the series. In cases where the data collected from
biological experiments do not follow these guidelines, other methods of data analysis should
be utilized since the QSAR models derived from the data will be questionable.
Once biological data has been collected, it is often found that the data is expressed in
terms which cannot be used in a QSAR analysis. Since QSAR is based on the relationship of
free energy to equilibrium constants, the data for a QSAR study must be expressed in terms
of the free energy changes that occur during the biological response. When examining the
potency of a drug (the dosage required to produce a biological effect), the change in free
energy can be calculated to be proportional to the inverse logarithm of the concentration
of the compound.

∆G0 = - 2.3RTlogK = log 1/[S]

Further, since biological data are generally found to be skewed, the log transformation
moves the data to a nearly normal distribution. Thus, when measuring responses under
equilibrium conditions, the most frequent transformation used is to express concentration
values (such as IC50, EC50, etc.) as log[C] or log 1/[C]. The transformed data for the
capsaicin agonists are shown in Table 2.

TABLE 2: Capsaicin Analogs Transformed Data

3
S. Moro

GRAPH 2: Capsaicin Analogs Transformed Data

Given the transformed data, our best guess for the activity of 6i is still the average of the
data set (or 0.40). As before, the error associated with this guess is calculated as the square
root of the average of the squares of the deviations from the average.

This is an example data set intended to show the general approach; real data sets would
have many more compounds and descriptors. Since the purpose of a QSAR is to highlight
relationships between activity and structural features, we would like to find one or more
structural features which relate these molecules and their associated activity. Additionally,
we would like to find a parameter that works consistently for all of the molecules in the series.

There are several potential classes of parameters used in QSAR studies. Substituent constants
and other physico-chemical parameters (such as Hammett sigma constants) measure the
electronic effects of a group on the molecule. Fragment counts are used to enumerate the
presence of specific substructures. Other parameters can include topological descriptors
and values derived from quantum chemical calculations.

The selection of parameters is an important first step in any QSAR study. If the association
between the parameter(s) selected and activity is strong, then activity predictions will be
possible. If there is only weak association, knowing the value of the parameter(s) will not help
in predicting activity. Thus, for a given study, parameters should be selected which are
relevant to the activity for the series of molecules under investigation and these parameters
should have values which are obtained in a consistent manner.

The Sandoz group divided their analysis of capsaicin analogs into three regions: the A-region
which was occupied by an aromatic ring; the B-region which was defined by an amide
bond; and the C-region which was occupied by a hydrophobic side-chain (See figure in
Table 1). The hypothesis for the C-region assumed that a small, hydrophobic substituent

4
S. Moro

would increase activity. Given this assumption, the parameters selected to best define this
characteristic were molar refractivity (size) and , the hydrophobic substituent constant.
These values are given in Table 3.

TABLE 3: Capsaicin Analogs Parameter Values

The data above can be analyzed for relationships by two means: graphically and
statistically. The most visual approach to a problem with a limited number of variables is
graphical. In this case, a plot of activity versus either molar refractivity or hydrophobicity
gives some insight into the relationship between the parameters and activity. The plots
derived by the Sandoz group are reproduced in Graph 3.

GRAPH 3: Capsaicin Analogs Parameter Values

5
S. Moro

Does the graph provide insight into the activity for compound 6i? Does knowing the value
for either the hydrophobicity or molar refractivity parameters for this compound provide a
good estimate for activity?

Since this is a simple example where only two values are examined, the answers to these
questions are a qualified yes. In more complex situations however, where multiple
parameters are correlated to activity, statistics is used to derive an equation which relates
activity to the parameter set. The linear equation which defines the best model for this set of
data is

Log EC50 = 0.764 - (0.817)π

How much confidence should we place in this model? The first step to answering this
question is to determine how well the equation predicts activities for known compounds in
the series. The equation above estimates the average value for the EC50 based on the value
for; because assays vary, it is not surprising that individual values will differ from the regression
estimate. The difference between the calculated values and the actual (or measured)
values for each compound is termed the residual from the model. The calculated values for
activity and their residuals (or the errors of the estimate for individual values) are shown in
Table 4.

TABLE 4: Capsaicin Analogs Calculated Values

The residuals are one way to quantify the error in the estimate for individual values
calculated by the regression equation for this data set. The standard error for the residuals is
calculated by taking the root-mean-square of the residuals (in this calculation, the
denominator shown as decremented by two to reflect the estimation of two parameters).

6
S. Moro

In order to be an improved model, the standard deviation of the residuals calculated from
the model should be smaller than the standard deviation of the original data. The standard
error about the mean was previously calculated to be 0.76 whereas the standard error from
the QSAR model is 0.28. Clearly, the use of linear regression has improved the accuracy of
our analysis. The plot of measured values versus calculated is shown in Graph 4 with a 45°
line.

GRAPH 4 Capsaicin Analogs Predicted Versus Actual EC50 Values

There are several assumptions inherent in deriving a QSAR model for a series of compounds.
First, it is assumed that parameters can be calculated (or measured in some cases) more
accurately and cheaply than activity can be measured. Second, it is assumed that
deviations from the best fit line follow a normal (Gaussian) distribution. Finally, it is assumed
that any variation in the line described by the QSAR equation is independent of the
magnitude of both the activity and the parameters. Given these assumptions, the quality of
the model can be gauged using a variety of techniques.

Variation in the data is quantified by the correlation coefficient, r, which measures how
closely the observed data tracks the fitted regression line. Errors in either the model or in the
data will lead to a bad fit. This indicator of fit to the regression line is calculated as:

where the “Regression Variance” is defined as the “Original Variance” minus the Variance
around the regression line. The Original Variance is the sum-of-the-squares distances of the
original data from the mean. This can be viewed graphically as shown in Graph 5.

The calculation is carried out as follows:

Original Variance = (1.07 - 0.40)2 + (0.09 - 0.40)2 + ...

Original Variance = 3.49

7
S. Moro

Variance around the line = (0.28)2 + (- 0.12)2 + (- 0.36)2 + ...

Variance around the line = 0.40

Regression Variance = Original Variance - Variance around the line

Regression Variance = 3.49 - 0.40 = 3.09

r2 = Regression Variance/Original Variance

r2 = 3.09/3.49
r2 = 0.89

Possible values reported for r2 fall between 0 and 1. An r2 of 0 means that there is no
relationship between activity and the parameter(s) selected for the study. An r2 of 1 means
there is perfect correlation. The interpretation of the r2 value for the capsaicin analogs is that
89% of the variation in the value of the Log EC50 is explained by variation in the value of , the
hydrophobicity parameter.

GRAPH 5 Capsaicin Analogs Derivation of r2 values

While the fit of the data to the regression line is excellent, how can one decide if this
correlation is based purely on chance? The higher the value for r2 the less likely that the
relationship is due to chance. If many explanatory variables are used in a regression

8
S. Moro

equation, it is possible to get a good fit to the data due to the flexibility of the fitting process;
a line will fit two points perfectly, a quadratic curve will fit three, multiple linear regression will
fit the observed data if there are enough explanatory variables2. Given the assumption that
the data has a Gaussian distribution, the F statistic below assesses the statistical significance
of the regression equation.

The F statistic is calculated from r2 and the number of data points (or degrees of freedom) in
the data set. The F ratio for the capsaicin analogs is calculated as:

This value often appears as standard output from statistical programs or it can be checked
in statistical tables to determine the significance of the regression equation. In this case, the
probability that there is no relationship between activity and the value is less than 1%
(p=0.01).

We have found that hydrophobicity values correlate well with biological activity. Does the
addition of a size parameter (MR) improve our model? In order to analyze a relationship
which is possibly influenced by several variables (or properties), it is useful to assess the
contribution of each variable. π and MR appear to be somewhat correlated in this data set
so the order of fitting can influence how much the second variable helps the first. Multiple
linear regression is used to determine the relative importance of multiple variables to the
overall fit of the data.

Multiple linear regression attempts to maximize the fit of the data to a regression equation
(minimize the squared deviations from the regression equation) for the biological activity
(maximize the r2 value) by adjusting each of the available parameters up or down.
Regression programs often approach this task in a stepwise fashion. That is, successive
regression equations will be derived in which parameters will be either added or removed
until the r2 and s values are optimized. The magnitude of the coefficients derived in this
manner indicate the relative contribution of the associated parameter to biological activity.

There are two important caveats in applying multiple regression analysis. The first is based on
the fact that, given enough parameters any data set can be fitted to a regression line. The
consequence of this is that regression analysis generally requires significantly more
compounds than parameters; a useful rule of thumb is three to six times the number of
parameters under consideration. The difficulty is that regression analysis is most effective for
interpolation and it is extrapolation that is most useful in a synthesis campaign (i.e., the
region of experimental space described by the regression analysis has been explained, but
projecting to a new, unanalyzed region can be problematic).

Using multiple regression for the capsaicin analogs, one can derive the following equation
which relates hydrophobicity and molar refractivity to biological activity.

Log EC50 = 0.762 - (0.819)π + (0.011)MR

s = 0.313, r2 = 0.888
To judge the importance of a regression term, three items need to be considered.

9
S. Moro

1. Statistical significance of the regression coefficient.

2. The magnitude of the typical effect “bixi” (in this case, 0.011x25.36).
3. Any cross-correlation with other terms.

As more terms are added to multiple linear regression, r2 always gets larger. We recomputed
the previous calculations (r2 = 0.89) carrying three significant figures so that rounding does
not lead to confusion.

These results of this analysis indicate that, within this series, steric bulk is not an important
factor in activity. The influence of the hydrophobicity constant confirms the presence of a
hydrophobic binding site. Given the limited number of substituents in this analysis, it is unlikely
that more can be learned from further analysis.

This section has developed the fundamental mathematics of QSAR studies. Several authors
have published reviews of QSAR and have discussed various aspects of the methods. Each
of the examples to follow uses these techniques to derive information about the chemical
factors which are important for activity.

Halifax Bank Statement
No ratings yet
Halifax Bank Statement
4 pages
Marine Industries in MY
No ratings yet
Marine Industries in MY
3 pages
First Periodical Test in Science 10
No ratings yet
First Periodical Test in Science 10
8 pages
3rd IAHR 1975 Hanover-All
No ratings yet
3rd IAHR 1975 Hanover-All
618 pages
An Introduction To QSAR Methodology
No ratings yet
An Introduction To QSAR Methodology
24 pages
Unit 5.1
No ratings yet
Unit 5.1
137 pages
15 Chapter 6
No ratings yet
15 Chapter 6
26 pages
QSAR and Drug Design: C Omp Ounds + Biological Activ Ity
No ratings yet
QSAR and Drug Design: C Omp Ounds + Biological Activ Ity
32 pages
Quantitative Structure
No ratings yet
Quantitative Structure
4 pages
Edusar
No ratings yet
Edusar
4 pages
Introduction To QSAR Methodology
No ratings yet
Introduction To QSAR Methodology
24 pages
QSAR - Hansch Analysis and Related Approaches in Drug Design
No ratings yet
QSAR - Hansch Analysis and Related Approaches in Drug Design
39 pages
An Importance and Advancement of QSAR Parameters in Modern Drug Design: A Review
No ratings yet
An Importance and Advancement of QSAR Parameters in Modern Drug Design: A Review
9 pages
D Cent Qsar
No ratings yet
D Cent Qsar
10 pages
Ann Qsar
No ratings yet
Ann Qsar
9 pages
QSAR Lecture
No ratings yet
QSAR Lecture
75 pages
2 D QSAR
No ratings yet
2 D QSAR
75 pages
Practica-III-2 - Um Curso Laboratorial em Química Medicinal Introduzindo A Modelagem Molecular
No ratings yet
Practica-III-2 - Um Curso Laboratorial em Química Medicinal Introduzindo A Modelagem Molecular
24 pages
Medicinal Chemistry and The Molecular Operating Environment (MOE) : Application of QSAR and Molecular Docking To Drug Discovery
No ratings yet
Medicinal Chemistry and The Molecular Operating Environment (MOE) : Application of QSAR and Molecular Docking To Drug Discovery
18 pages
Drug Discovery
No ratings yet
Drug Discovery
11 pages
International Journal of Research and Development in Pharmacy and Life Sciences
No ratings yet
International Journal of Research and Development in Pharmacy and Life Sciences
9 pages
Chemrj 2017 02 03 170 181
No ratings yet
Chemrj 2017 02 03 170 181
12 pages
Chorghade Mukund For QSAR
No ratings yet
Chorghade Mukund For QSAR
3 pages
Approaches to drug design
No ratings yet
Approaches to drug design
26 pages
QSAR
No ratings yet
QSAR
23 pages
UNIT 3.docx
No ratings yet
UNIT 3.docx
9 pages
Qsar Application in Drug Design
No ratings yet
Qsar Application in Drug Design
13 pages
Stevens' Handbook of Experimental Psychology and Cognitive Neuroscience, Methodology
From Everand
Stevens' Handbook of Experimental Psychology and Cognitive Neuroscience, Methodology
Wiley
No ratings yet
Roy r2m
No ratings yet
Roy r2m
12 pages
Desain Obat: Lia Puspitasari, M.Si.,Apt
No ratings yet
Desain Obat: Lia Puspitasari, M.Si.,Apt
31 pages
Drug Discovery by Design - QSAR
No ratings yet
Drug Discovery by Design - QSAR
143 pages
Expt 6. QSAR
No ratings yet
Expt 6. QSAR
4 pages
Lead Optimization PDF
No ratings yet
Lead Optimization PDF
22 pages
A Dftbased Qsars Study of Benzimidazoles Drugs Derivatives
No ratings yet
A Dftbased Qsars Study of Benzimidazoles Drugs Derivatives
8 pages
Machine Learning. Supervised Learning Techniques and Tools: Nonlinear Models Exercises with R, SAS, Stata, Eviews and SPSS
From Everand
Machine Learning. Supervised Learning Techniques and Tools: Nonlinear Models Exercises with R, SAS, Stata, Eviews and SPSS
César Pérez López
No ratings yet
Medicinal Chemistry Unit 5
No ratings yet
Medicinal Chemistry Unit 5
98 pages
17 Mayuresh QSAR-A NOVEL TOOL IN DRUG DESIGN
No ratings yet
17 Mayuresh QSAR-A NOVEL TOOL IN DRUG DESIGN
14 pages
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Qsar 1
No ratings yet
Qsar 1
34 pages
MODULE 05
No ratings yet
MODULE 05
112 pages
QSAR Study and Molecular Design of Open-Chain Enaminones As Anticonvulsant Agents
No ratings yet
QSAR Study and Molecular Design of Open-Chain Enaminones As Anticonvulsant Agents
15 pages
Substantive Theory and Constructive Measures: A Collection of Chapters and Measurement Commentary on Causal Science
From Everand
Substantive Theory and Constructive Measures: A Collection of Chapters and Measurement Commentary on Causal Science
Mark Everett Stone
No ratings yet
Pharmaceutical Chapter 1
No ratings yet
Pharmaceutical Chapter 1
11 pages
Qsar by Hansch Analysis: Faculty of Pharmaceutical Sciences, Maharshi Dayanand University, Rohtak
No ratings yet
Qsar by Hansch Analysis: Faculty of Pharmaceutical Sciences, Maharshi Dayanand University, Rohtak
5 pages
Molecular Modeling
No ratings yet
Molecular Modeling
2 pages
Unit 2 QSAR
No ratings yet
Unit 2 QSAR
23 pages
Qsar Study of Rabbit Aortic Angiotensin II Antagonists Compounds Using Different Descriptors
No ratings yet
Qsar Study of Rabbit Aortic Angiotensin II Antagonists Compounds Using Different Descriptors
6 pages
Drug DESIGN Mod
No ratings yet
Drug DESIGN Mod
10 pages
QSAR
No ratings yet
QSAR
16 pages
Geary 5
No ratings yet
Geary 5
12 pages
Medicinal Chemistry 2
No ratings yet
Medicinal Chemistry 2
20 pages
Annual Drug Data Report Vol-1 1971
50% (2)
Annual Drug Data Report Vol-1 1971
228 pages
1-s2.0-S2468111324000409-main
No ratings yet
1-s2.0-S2468111324000409-main
35 pages
Fundamentals of Modern Mathematics: A Practical Review
From Everand
Fundamentals of Modern Mathematics: A Practical Review
David B. MacNeil
No ratings yet
Analytical Methods of Optimization
From Everand
Analytical Methods of Optimization
D. F. Lawden
No ratings yet
QSAR
No ratings yet
QSAR
25 pages
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Structure Activity Relationship
No ratings yet
Structure Activity Relationship
37 pages
Qsar New 1
No ratings yet
Qsar New 1
21 pages
Si V3
No ratings yet
Si V3
78 pages
Sulfanilamide
No ratings yet
Sulfanilamide
2 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Aromatic Acid
No ratings yet
Aromatic Acid
61 pages
Komputasi Pertemuan 4
No ratings yet
Komputasi Pertemuan 4
63 pages
Joints and Electrical Final Sub Circuits
No ratings yet
Joints and Electrical Final Sub Circuits
25 pages
iCFDR NGO Brochure
No ratings yet
iCFDR NGO Brochure
2 pages
KL_Contractor Procurement Capability Maturity Model (CMM)
No ratings yet
KL_Contractor Procurement Capability Maturity Model (CMM)
5 pages
Webauth TR
No ratings yet
Webauth TR
24 pages
Risk of Open Holes
No ratings yet
Risk of Open Holes
4 pages
Design Thinking Process Poster PDF
No ratings yet
Design Thinking Process Poster PDF
1 page
What Is Afforestation
No ratings yet
What Is Afforestation
6 pages
2 0 0 0 / 1 / 2 R C 3 8 - A I R F o R C e 1 / 2: Mountain Bike Hardware - Manual Parts List January 2003
No ratings yet
2 0 0 0 / 1 / 2 R C 3 8 - A I R F o R C e 1 / 2: Mountain Bike Hardware - Manual Parts List January 2003
1 page
Hydro Resources Contractors Group Corp. Vs NIA (441 SCRA 614)
No ratings yet
Hydro Resources Contractors Group Corp. Vs NIA (441 SCRA 614)
22 pages
Gravity Circuit Optimisation
No ratings yet
Gravity Circuit Optimisation
10 pages
TOPIC 4 DCC30103 - Contruction of Rigid Pavement
0% (1)
TOPIC 4 DCC30103 - Contruction of Rigid Pavement
23 pages
Task
No ratings yet
Task
44 pages
Web Programming Lab Manual 26 May
No ratings yet
Web Programming Lab Manual 26 May
26 pages
FEKO. Script Examples
No ratings yet
FEKO. Script Examples
182 pages
POM Unit 4
No ratings yet
POM Unit 4
16 pages
Prolongation Cost.
No ratings yet
Prolongation Cost.
3 pages
Unit 2
No ratings yet
Unit 2
51 pages
World Intellectual Property Organization
No ratings yet
World Intellectual Property Organization
11 pages
Basic Computer
93% (14)
Basic Computer
44 pages
TORRES de Lima Vs City of Manila
No ratings yet
TORRES de Lima Vs City of Manila
2 pages
Sales Confirmation: Alpha Trading S.P.A. Compagnie Tunisienne de Navigation
No ratings yet
Sales Confirmation: Alpha Trading S.P.A. Compagnie Tunisienne de Navigation
1 page
03 - Competitor Analysis and Interfirm Rivalry
No ratings yet
03 - Competitor Analysis and Interfirm Rivalry
36 pages
CSCI262/CSCI862 System Security Spring 2021 Assignment 3 (12 Marks, Worth 12%)
No ratings yet
CSCI262/CSCI862 System Security Spring 2021 Assignment 3 (12 Marks, Worth 12%)
3 pages
Study 14
No ratings yet
Study 14
18 pages
Spade Terminals (DIN 46340) : ZB10 ZB12 - ZB14
No ratings yet
Spade Terminals (DIN 46340) : ZB10 ZB12 - ZB14
1 page
PAC Meeting February 1 2011
No ratings yet
PAC Meeting February 1 2011
2 pages

QSAR Statistical Concepts

Uploaded by

QSAR Statistical Concepts

Uploaded by

S.

Statistical concepts in QSAR.

Computational chemistry represents molecular structures as a numerical models and

A QSAR generally takes the form of a linear equation:

Biological Activity = Const + (c1×P1) + (c2×P2) + (c3×P3) + ...

TABLE 1: Capsaicin Analogs Activity Data

GRAPH 1: Capsaicin Analogs Activity Data.

• binding data measured with sufficient precision to distinguish between compounds;

∆G0 = - 2.3RTlogK = log 1/[S]

TABLE 2: Capsaicin Analogs Transformed Data

GRAPH 2: Capsaicin Analogs Transformed Data

TABLE 3: Capsaicin Analogs Parameter Values

GRAPH 3: Capsaicin Analogs Parameter Values

Log EC50 = 0.764 - (0.817)π

TABLE 4: Capsaicin Analogs Calculated Values

GRAPH 4 Capsaicin Analogs Predicted Versus Actual EC50 Values

The calculation is carried out as follows:

Original Variance = (1.07 - 0.40)2 + (0.09 - 0.40)2 + ...

Variance around the line = (0.28)2 + (- 0.12)2 + (- 0.36)2 + ...

Regression Variance = Original Variance - Variance around the line

r2 = Regression Variance/Original Variance

GRAPH 5 Capsaicin Analogs Derivation of r2 values

Log EC50 = 0.762 - (0.819)π + (0.011)MR

1. Statistical significance of the regression coefficient.

You might also like