Database TCM
Database TCM
WANG JIFENG
2003
Founded 1905
BY
WANG JIFENG
MASTER OF SCIENCE
2003
Acknowledgment
Associate Professor Chen Yu Zong from Computational Science for his patient
research process.
Sincere gratitude is also expressed to Dr. Cai, Dr. Li, Xue Ying for their patient
Chanjuan and Lixia, who are labmates as well as friends, for being ever so willing to
share with me their valuable advice on projects, and for sharing with my joy and
I would like to thank Ms.Lucee, Ms Lindah, Ms. Hwee sim, Ms Elaine and Ms.Wei
Most of all, I am eternally grateful to my parents, my GF, for supporting me, and for
study.
Wang Jifeng
May 2003
Table of Contents
Content Page
List of Tables I
List of Figures II
Summary III
Chapter 1: Introduction 1
Studying TCM 18
2.1 Introduction 24
Prescription Formulation 36
3.1 Introduction 36
4.2.3 Scoring 78
Proteins 79
4.3 Results 84
4.4 Discussion 92
4.5 Conclusion 98
Chapter 5: Conclusions 99
References 101
List of Tables and Figures
Tables Page
Each Group 51
I
14. Sensitivity, Specificity and Overall Accuracy 66
Formulae) 69
Figures Pages
of Serenoa repens 83
II
Summary
Traditional Chinese medicine (TCM) has been used in the treatment of a variety of
Increasing effort is being made towards scientific proof, clinical evaluation and
contains the available information about all major aspects of TCM, including herbal
application.
have been widely employed in biology. Support Vector Machine (SVM), based on
statistical learning theory, is such a method that has been used in a wide range of real-
world problems such as text categorization, cancer diagnosis, glaucoma diagnosis, and
microarray gene expression data analysis. In this study, SVM is used to facilitate the
study of TCM formulae. The results indicate the capability of SVM in recognizing
non-effective formulae and it may provide some helpful hints for herbalist doctors to
several potentially effective formulae from the hundreds of randomly mixed formulae.
It is unclear whether these formulae have the therapeutic value. The method is
expected to facilitate the prescription of new and novel TCM formulae as well as the
III
validation of existing TCM formulae while more and more formulae are under
scientific studies.
number of active compounds have been isolated from these herbs and their clinical and
method, is employed to study the molecular mechanism of medicinal herbs. This study
provides the potential targets of a medicinal herb Serenoa repens in the treatment of
compounds in the extract. Besides these interactions, other bindings between particular
compounds and protein targets have not been proven by experiments. It provides a new
cost tool, this method may find application in systematic study of the molecular
IV
Chapter 1
Chapter 1: Introduction
Traditional Chinese Medicine (TCM) has been used for thousands of years [1-5].
At least it has a recorded history dating back over 2,000 years. Among numerous
legends about the origins of traditional Chinese medicine, stories about three legendary
Historians believe that Shen Nong and Fu Xi were early tribal leaders. Fu Xi is
regarded as a cultural hero who developed the trigrams of Yi Jing. Ancient books said
‘Fu Xi drew the eight trigrams, and created nine needles’. Shen Nong, the legendary
emperor who lived 5000 years ago, is regarded as the ‘Divine Farmer’ by the Chinese
people because of his attribution as the founder of herbal medicine. To test and analyze
the individual effects of different plant medicine, Shen Nong ingested them by himself.
It was said that Shen Nong tested over one hundred herbs including 70 toxic
substances in one day in order to find some drug to get rid of people's pain form
illness.
Huang Di Nei Jing (Yellow Emperor's Cannon of Internal Medicine) is the first
800 BC and 200 BC [6]. This book summarized and systematized the previous
1
Chapter 1
experience of treatment and theories of medicine, such as the meridian theory, as well
and acupuncture etc. It was regarded as the foundation for the theories of Chinese
medicine.
During the Zhou dynasty, the theory of TCM was developing very fast. The most
Yin and Yang [7], the five elements, the pathogenic factors of external environment as a
The basic theories of acupuncture were established and stone needles became
obsolete, being replaced by metal needles. At the time of the Spring and Autumn
Warring States Period, one of the most important issues in the development of TCM
was the usage of the pulse for diagnosis. Bian Que [8], a very famous doctor/physician
was the first man in the world to use this technique. He was reputed to be an excellent
paediatrics for the treatment of all kinds of illnesses. Bian Que also recorded his
experiences in the book Nan Jing (The Classic of Difficult Issues), which developed
and explained the fundamental and difficult parts of Huang Di Nei Jing.
During this Period another famous classic book Shen Nong Ben Cao Jing (Shen
Nong's Classic of Materia Medica) [9] was written. This book recorded and described
the characters of about 365 herbs and was regarded as the first pharmacopoeia of
2
Chapter 1
The Han dynasty, which lasted almost four and a half centuries, was a period of a
thriving Chinese culture. TCM was well developed as well. Zhang Zhongjing (150-219
A.D.), one of the most famous herbalist doctors, was reputed for his remarkable
medical skill and his well known medical masterpiece Shang Han Lun (Treatise on
Febrile Diseases) [10], which was used as a standard reference work for traditional
Chinese medicine, including moxibustion, needling and herbal medicine. So far his
theory and prescriptions are still of great practical value. In this period there also was
A.D). He invented the use of anaesthesia called Ma Fei San [11] to reduce the pain of
patients who was suffering surgery. He also furthered the knowledge of anatomy. He
was the first person who used narcotic drug in the world and his skill in this field was
Li Shizhen was considered to be China's greatest scholar in TCM and made great
contribution to the progress of TCM. He spent forty years to collect and taste herbal
medicines and wrote down in his most well-known Chinese herbal book Ben Cao
Gang Mu (Herbal Systematics) [12,13], which contained 1892 different herbs (with
1110 drawings), and was divided into 6 sections, 52 scrolls and 60 different categories
and also included 11,096 prescriptions, for treating hundreds of illnesses, ranging from
the common cold to drunkenness and food poisoning. Others such as Wang Shuhe,
Huang Fumi, Ge Hong, Chao Yuanfang, Sun Simiao and Wa ng Weiyi also wrote
3
Chapter 1
Indeed, TCM has a long history, and in what is regarded as a breakthrough, the
which can be effectively treated with acupuncture. Today, there are many hospitals in
China, Japan, and the other Asia countries that practice TCM exclusively, and others
that combine Eastern and Western healing methods. One of the main reasons that TCM
is still used may be their relatively low side effects, compared with western medicine.
Another reason may be that it is used as a last resort, when Western medicine is too
toxic or unable to provide the expected benefit [14]. In recent years, the effectiveness
of TCM has been gaining popularity in US and European countries, such as Germany,
France and UK [2,15]. However, there are also negative attitude toward TCM. Some
westerners think that TCM is unscientific in its understanding of the human body and
the nature of disease and its treatment. They believe that the lack of quality control and
the absence of scientific and clinical proof of their effectiveness will impede the
As the most important parts of TCM, medicinal herbs and the prescriptions of
multiple herbs, which will be discussed in the next chapter, were given much attention
in the development of TCM [4,5]. In TCM, people think that the actions of herbs in
treatment of diseases is through rectifying the balance of Yin or Yang, and then
consequently helping the body restore its normal physiological functions. Different
herbs have different characters and functions. That is why different herbs are used
when curing different diseases. To understand the mechanism of them and use them
4
Chapter 1
properly, it is necessary to study and explain their characters. Ancient experts had,
from the TCM viewpoint, summarized those characters, which include drugs'
Properties and Flavors, Meridians of herbs and Toxicity property, etc. Based on the
theories of Yin and Yang, Viscera, Channels and Collaterals, and treatment principles of
traditional Chinese medicine, the information has been summed up throughout a long
Each herb has its own properties and flavors. Generally, there are four typical types
of properties, that is, cold, hot, warm or cool [19]. These properties are experientially
summarized according to the actions of the herbs on the human body. Herbs that cure
or reduce heat syndrome (Yang syndrome) have a cold or cool property, whereas herbs
that cure or reduce cold syndrome (Yin syndrome) have hot or warm property. Cold or
cool properties are quite different from warm or hot. People think that cold and cool
are similar, and so are warm and hot. The difference is merely the action abilities. Cold
is relatively ‘stronger’ than cool and hot is ‘stronger’ than warm. Cool- or cold
-natured herbs are thought to have the effects of clearing heat, purging fire, removing
toxic substances, and nourishing Yin, so they are usually used to cure heat syndromes.
On the contrary, warm and hot –natured herbs are believed to have the effects of
dispersing cold, warming up the interior, supporting Yang, and treating collapse, and
are therefore used to treat cold syndromes. In addition to the four properties mentioned
above, there is the fifth property: mild. A mild- natured herb can be used for either hot
or cold syndromes.
5
Chapter 1
Flavor of herbs is given partly by their tastes. Sometimes they are given according
to the actions of herbs rather than tastes. Therefore, the flavors of some herbs are often
different from their true tastes. There are a total of seven flavors, including pungent,
sweet, sour, bitter, salty, tasteless and astringent [19]. The first five ones are the basic.
Herbs with different flavors usually show different pharmacological and therapeutic
actions, while the same flavor comes out the similar effects. According to Yin and Yang
theory, herbs with pungent, sweet or tasteless flavor have the attribution of Yang and
the ones with sour, bitter or salty flavor, of Yin. The effects of pungent herbs are to
disperse exopathogens from the body and promote the circulation of the vital energy
and blood. Therefore, these herbs, such as Herba Ephedrae (Ma Huang), Radix
Aucklandiae (Mu Xiang), are often used in treatment of superficial and mild illnesses.
The effects of sweet herbs are to nourish, replenish, or enrich the function of the
organs, to normalize the function of the stomach and spleen, to harmonize the
properties of different herbs and to relieve spasm and pain, etc. They are usually
Radix Codonopsis pilosulae (Dang Shen), Radix Rehmanniae Preparata (Shu Di) and
Radix Glycyrrhizae (Gan Cao). The effects of sour herbs are to induce astringency and
arrest discharge. They are often used to treat sweating. For example, Fructus
Schisandrae (Wu Wei Zi), a very important adaptogen can be used to regulate body
functions and increase the organism’s ability to deal with stress. Herbs with bitter
flavor have the effects of clearing heat, purging fire, sending down the adverse flow of
Qi to treat cough and vomiting, relaxing the bowels, eliminating dampness, etc. Such
6
Chapter 1
herbs are mostly used for syndromes of pathogenic fire, cough with dyspnea, vomiting,
constipation due to heat of excess type, damp- heat syndrome, or cold-damp syndrome
and other syndromes. Herbs with salty flavor have the effects of relieving constipation
by purgation, and softening and resolving hard mass. They are used in the treatment of
dry stool and constipation, scrofula, goiter and mass in the abdomen. Two examples
are Concha Arcae (Wa Leng Zi) and Natrii Sulfasl (Mang Xiao). The effects of
tasteless herbs are to excrete dampness and induc e diuresis. Therefore these herbs,
such as Polyporus umbellatus (Zhu Ling) and Poria (Fu Ling), are often used for
edema and dysuria. Astringent herbs have similar actions as those sour herbs, such as
Os Draconis (Long Gu), Concha Ostreae (Mu Li) and Halloysitum Rubrum (Chi Shi
Zhi).
Property Effects
Note: Cool and cold herbs do overlap, as do hot and warm herbs.
7
Chapter 1
Herbs that possess the same flavors and properties generally have similar effects.
But if only one property or flavor is the same, their actions may be quite different. For
example, both Rhizoma Coptidis (Huang Lian) and Radix Rehmanniae Preparata
(Sheng Di Huang) are cold, however the former is bitter and the latter is sweet. It has
been well known about the effects of these two herbs. Rhizoma Coptidis has the effects
of clearing heat and drying dampness and is therefore used for damp-heat syndrome,
while Radix Rehmanniae Preparata has the effects of clearing heat and nourishing Yin
and is used for the condition of consumption of Yin due to febrile diseases. Table 1 and
Table 2 give the four typical properties and five typical flavors, respectively, and the
corresponding effects.
Flavor Effect
8
Chapter 1
According to Viscera, Channels and Collaterals theory, the symptoms can reflect
the organs that are not in good conditions. So ancient herbalist doctors speculated that
a herb might selectively act upon a particular part of the body and this part depended
on the corresponding symptoms that the herb can relieve. These parts of the body are
named meridians. There are twelve types of merid ians [19], including Lung, Bladder,
Spleen, Large Intestine, Stomach, Small Intestine, Liver, Pericardium, Heart, Kidney,
Gallbladder and San Jiao. The Lung is the most delicate and most exterior of all the
accumulation of Lung heat. The Bladder transforms and excretes fluids from the body. It
Cold-Damp and Damp-Heat. The Spleen is the primary source of nourishment of the
body because it governs digestion and the production of Qi. It is most susceptible to the
evil of Dampness. The Large Intestine is used to receive food from the Small Intestine,
absorb fluids and excrete feces. When excessively exposed to cold, External Cold can
invade it resulting in abdominal pain and diarrhea. The Stomach has the function to
digest and transform food to make it available to the Spleen. The state of the Stomach is
governed by the Hot-Cold nature of ingested food in rela tionship to the patient’s
constitution and the environment. The Small Intestine receives the transformation
products of food and drink from the Stomach and separate the pure from the impure. It is
also affected by the heat or coldness of ingested food and drink. The liver is the central
organ of the body, which is primarily responsible for the storage of Blood and the
9
Chapter 1
smooth distribution of Qi throughout the body. Exterior Wind can interfere with the
smooth flow of Qi and stir up the Blood stored in the Liver to exacerbate Internal Liver
Wind and cause skin rashes of sudden onset that move around the body, often seen in
viral exanthema, drug rashes and hives. The Pericardium is membranous sac filled with
serous fluid that encloses the heart and the roots of the aorta and other large blood
vessels. Disease can be caused by cold, heat and Liver Qi stagnation. The heart controls
the blood vessels and regulates the flow of blood in the body. The kidney represents
the most important energetic organ in the body with regard to the root of energy and
the will to live, develop and reproduce. It is the foundation of Yin and Yang in the body.
Fear, anxiety, shock and prolonged stress are the major emotional issues that deplete
Kidney energy. The Gallbladder is used to store and secrete bile. The Excessive
ingestion of greasy or fatty foods leads to Dampness that becomes lodged in the
Gallbladder. The emotions of anger, repressed resentment and frustration cause Heat
and Liver and Gallbladder Fire, and when mixed with Dampness, give Damp-heat.
Climatic changes of Heat and Dampness from the exterior will induce elements of
Damp-Heat in the Gallbladder. San Jiao refers to Shang Jiao, Zhong Jiao and Xia Jiao,
which are the pathway through which Qi and Jing Ye ascend or descend. Disease can
By acting on one or more particular meridians, medicinal herbs can regulate the
body to a balance status. For example, Fructus Zizyphi Jujubae (Da Zao) can tonify Qi
in the spleen and stomach. It is indicated for poor appetite and loose stool due to
weakness of the spleen and stomach. Judged by the above indications and analysis, we
10
Chapter 1
say that the herb enters the meridians of the spleen and stomach. Herba Ephedrae (Ma
Huang) can promote sweating, soothe asthma and benefit urination. It is used for fever,
chills and absence of sweating due to invasion by exogenous pathogenic wind and cold,
dysuria, edema and so on. So people believe that the herbs can enter the lung and
bladder meridians.
The meridian theory studies the physiological function and pathological change on
the meridians and the related zangfu organs. The essential functions of the meridian
system are to transport Qi and blood, to maintain conductivity and to resist invasion of
exogenous patho genic factors. The meridian system distributes to all parts of the body.
The endless circulation of Qi and blood in the meridians is responsible for the
maintenance of life and the variety of functions which support it. The meridian theory
has been the guiding principle for the clinical practice in the realms of TCM,
meridian theory with the theories of the zangfu organs and the etiology of TCM, one
can thoroughly explain both the physiological activities and pathological change s,
which take place in the body. In this manner, a theoretical basis for the principle of
established.
medicinal herbs were weakly toxic or even non-toxic. However, some herbs are toxic
11
Chapter 1
and should also be used carefully. According the difference of their toxicities, they are
classified to three classes: toxic, extremely toxic and slightly toxic [19]. Improper use
of the toxic herbs may lead to adverse effects, so prescription of toxic herbs should be
careful according to the patient’s age and the situation of the disease, etc. Nontoxic
herbs are moderate in nature and usually do not have any side effects. For example,
Fructus Zizyphi Jujubae (Da Zao) and Poria (Fu Ling) are nontoxic herbs, while Radix
Aconiti lateralis (Fu Zi) and Semen Strychni (Ma Qian Zi) are toxic.
Some toxic herbs are effective because of the other effects on the patient, so
when used in combination of other herbs in a prescription, the toxicities are expected
the contrary, the medical effects of some toxic herbs, on the critical or obstinate
Two or more herbs are often combined in order to increase or promote their
complex clinical situations and to alter their actions. Different combinations can cause
diverse therapeutic effects. Using only one herb in a formula is called “Single herb
12
Chapter 1
mutual antagonism” [20]. The details for combining herbs are classified as follows.
1. Single herb formula. The whole formula is composed of only one herb. This
formula is used when the disease is not very serious. For example, Formula ‘Du Yi Wei
Wan’, which only contains Radix Lamiophlomidis rotatae (Du Yi Wei), is used in
2. Mutual reinforcement. Two or more herbs with similar properties are combined
to reinforce their therapeutic actions. For example, Radix et Rhizoma Rhei (Da Huang)
is combined with Natrii Sulfasl (Mang Xiao) to reinforce the function of purging
downward; Gypsum Fibrosum (Shi Gao) and Rhizoma Anemarrhenae (Zhi Mu) are
3. Mutual assistance. Two or more herbs in which one or more is the principle herb
and the others play a subsidiary role are combined. For example, Radix Astragali
(Huang Qi) is combined with Poria (Fu Ling) to replenishe Qi, strengthen the spleen
and promote urination; Gypsum Fibrosum (Shi Gao) is combined with Radix
4. Mutual restraint. In the formula, the toxicity or side effects of one herb can be
reduced or eliminated by the addition of another. For example, the toxicity of Rhizoma
(Sheng Jiang).
5. Mutual counteraction. It means that one herb in the formula can counteract or
13
Chapter 1
restrain the other herb’s toxicities and side effects. In fact, mutual restraint and mutual
counteraction refer to the same effect of herb from two different aspects. The former is
focused on how to restrain the toxicity of one given herb, while the latter pays
6. Mutual suppression. In the formula, one of the herbs weakens or suppresses the
action of the others. For example, Semen Raphani (Lai Fu Zi), combined with Radix
Ginseng (Ren Shen), weakens the function of the latter in replenishing Qi.
7. Mutual antagonism. When two herbs that have no side effects are combined
together, severe side effects may result. Traditionally, there existed “eighteen
When two herbs are combined together in one formula, they may give rise to
interaction with each other. They may be mutual reinforcement, mutual assistance,
conditions of the patient, and their characters and functions. “Mutual reinforcement”
and “mutual assistance” can enhance their effects and therefore should be used as
much as possible. “Mutual restraint” and “mutual count eraction” can reduce or
eliminate toxicities and side effects of herbs and therefore can be considered when
using poisonous herbs. “Mutual suppression” and “mutual antagonism” can weaken
efficacy of herbs or make them lose their efficacy or even give rise to toxicities and
14
Chapter 1
The side effects of TCM formula can be reduced by adding one or two herbal
ingredients. But sometimes it would be harmful to the patients [20]. So the prescription
1. Contra-syndromes. Each herb or each class of herbs has its own intended
functions and effects. Diseases or syndromes other than these intended indications are
used to induce diaphoresis and relieve asthma, and its indications are affection by
cough due to obstruction of the lung Qi, but in case of spontaneous sweating due to
exterior syndrome of deficiency type or cough due to lung deficiency, its use should be
prohibited.
specific herbs. According to the ancient books and literature of TCM, there are
eighteen pairs of incompatible medicinal herbs and nineteen pairs of mutual restraining
medicinal herbs.
These herbs are mainly with strong actions or toxicities, especially Fructus Crotonis
(Ba Dou), Semen Pharbitidis (Qian Niu Zi), Herba Cirsii japonici (Da Ji) and Rhizoma
Sparganii (San Leng). The other herbs that are pungent and hot should also be used
15
Chapter 1
with caution during pregnancy, such as Semen Persicae (Tao Ren), Flos Carthami
(Hong Hua), Radix et Rhizoma Rhei (Da Huang) and Radix Aconiti lateralis (Fu Zi).
4. Food taboo. Certain types of food may also influence the action of herbs or
induce unnecessary side effects. In general, it is advisable not to eat raw, cold, greasy,
strong smelling or spicy food while taking medicine. From the historical medical
Rehmanniae (Di Huang) and Radix Polygoni multiflori (He Shou Wu) are
contraindicated with onion, garlic and turnip; Herba Menthae (Bo He) is
contraindicated with turtle meat; Poria (Fu Ling) is contraindicated with vinegar; and
Each herb has its own functions and indications. The herbs should be selected
according to their properties, flavors, meridians and toxicity properties. For example,
Herba Ephedrae (Ma Huang) is pungent, slightly bitter and warm. It is used to treat
fever, chills and absence of sweating due to an invasion of exogenous wind and cold. It
above.
according to their effects [20]. Generally speaking, the dosage of the principal
component is bigger than the others. When this formula is used for the treatment of
patients with different conditio ns, the dosage of these herbs may be changed in
accordance with specific syndrome. In addition, the dosage of those herbs that are very
16
Chapter 1
poisoning.
the body. Illness or disease always represents a state of imbalance of Yin and Yang. We
could consider this concept as the need to find the most appropriate and stable chaotic
state, with a strong attractor. Over time, TCM has elucidated methods of restoring
balance to the body, thus curing disease. The two important methods are herbal therapy
method, herbal medicine is excellent at tonification, that is, increasing the energy of a
particular organ, and balancing the energy when there is excess in one part of the body
acupuncture and herbal medicines together – acupuncture for the acute problem, and
herbs to shore up the underlying deficiency that allowed the acute disease to manifest.
Ancient Chinese herbalist doctors have developed their own language and method
for diagnosis. The diagnosis is not directly associated with a disease, but the
description of a pattern that seems to be preeminent in the whole body, which includes
Looking, Smelling, Asking, Pulse diagnosis, etc [21]. The Chinese physical exam is
able to show a predominant pattern of imbalance in the patient, which, in the Chinese
physician’s eyes, is the disease to be treated. Many dissimilar western diagnoses have
17
Chapter 1
similar Chinese patterns of disharmony, and many similar Western diagnoses have
different Chinese patterns of disharmony. In Chinese medicine there have been periods
in which one theory of harmony and disharmony dominated over another. Thus, during
one period, the Five Element Theory of disharmony reigned supreme and during
another period, the Yin- Yang and Eight Principles Theory of disharmony reigned
treating the patient ’s explicit response to his or her inner disturbance, which is going to
1.4.2 Modern Experimental Approach and Clinical Trials for Studying TCM
From the point of view of modern medicine, the effects of herbal drugs result from
the interaction between herbal ingredients and organs of body [2,5]. It is necessary to
extract these chemicals and then analyze their physiological effects on humans or their
especially those in China and Japan. Many purified compounds are then studied or
Glycyrrhizae (Gan Cao) is one of the most extensively studies Chinese herbs [23-28].
The function of Radix Glycyrrhizae is to reinforce the function of the spleen and
replenish Qi, to remove heat and counteract toxicity, to dispel phlegm and relieve
treatment of weakness of the spleen and the stomach marked by lassitude and
weakness, cardiac palpitation and shortness of breath, cough with much phlegm,
18
Chapter 1
spasmodic pain in the epigastrium, abdomen and limbs, carbuncles and sores. It is
often used for reducing the toxic or drastic actions of other drugs. Its main ingredients
are Glycyrrhizin and glycyrrhetinic acid [25,26]. Experiments show that its extract
exerts a detoxifying effect [26,27]. Another experiment shows that glycyrrhizin also
has this effect [26,27]. In addition, Glycyrrhizin has anti- inflammatory effect similar to
hydrocortisone [28]. It was found that its decoction can inhibit intestinal smooth
muscles of rabbits in vitro, but glycyrrhizin and glycyrrhetinic acid have no such
effects [29]. Thus the effects must be due to the other compounds. Radix Ginseng (Ren
Shen) being under exploration [30,31], contains more than 200 compounds. The main
ingredients are ginsenosides, essential oils and polyacetylenes [30]. Radix Ginseng
traditionally has been used as a tonic and was thought to reinforce the vital energy, to
remedy collapse and restore the normal pulse, to benefit the spleen and lung, to
promote the production of body fluid, and to calm the nerves. Experiments have shown
that Radix Ginseng can act on the central nervous system and exert significant
hypertensive effects on acute circulatory failure after heavy blood loss and decrease
lymphocyte-blastogenesis rate [29]. These effects are related to the active compounds.
For example, ginsenoside Rg1 can raise blood pressure, while Rb1 can lower it
[31,32].
As indicated in the above examples, chemical and biological studies underlined the
19
Chapter 1
The chemical profiles of an extract should be matched against the results of biological
assays in order to assure lot-to-lot consistency of what are fairly crude extracts of herbs
with high natural variability [1]. Chemical analysis, biological assays and animal
modern biomedical research, they should also be necessary prerequisites for clinical
trials. Most of these TCM clinical trials have been undertaken in China, Taiwan, and
Japan. A number of commercial treatments based on TCM such as ‘Hu Xin Dan”,
“Puerarin injection’, ‘Shen Mai injection’ and others have been introduced into the
animal models and clinical trials, have been extensively used to study the effects and
molecular mechanism of medicinal herbs and formulae. More recently, as useful tools,
To control and evaluate the quality of medicinal herbs, the fingerprint analysis
technique is set up with the rapid development of instrument analyses and computer
networks and gray relational grade cluster [35-37]. To identify similar medicinal herbs,
formula into several different parts [39-41]. The results agreed well with the
20
Chapter 1
formulae.
fast-speed and low-cost tool for facilitating the study of molecular mechanism and
records, most scientists are not familiar with traditional Chinese medicine [1].
Moreover, because the TCM theory is very different from that of western medicine,
majority of these scientists show little interest in TCM and other traditional medicines.
21
Chapter 1
To attract their attention and help them better understand the merits of TCM, it is
necessary to provide the information of TCM first. Hence, a new database, Traditional
prescription formulae, constituent herbs, known herbal ingredients and their molecular
actions. The therapeutic effects, adverse reactions, clinical indications and applications
are provided at the levels of prescription, individual herb and herbal ingredient
flavors, sites of actions, toxicity level, and therapeutic class are also given.
review and establish reliable composite formulae of TCM [1]. To facilitate such a study,
one of the most efficient method in many real-world applications [43]. Recently SVM
have been applied to a number of biological problems [44], including gene expression
data analysis and protein classification, etc. Using this method, it is expected to find
22
Chapter 1
active compounds have been isolated from these herbs and clinical and therapeutic
Serenoa repens, which is widely used in treatment of BPH [45-49]. Although the effect
of some active compounds extracted from it has been proven by in vivo or in vitro
experiments, the whole picture is still unknown. The multiple ingredients make the
process more difficult. Hopefully, the effects of the predicted therapeutic and toxicity
23
Chapter 2
2.1 Introduction
Traditional Chinese medicine (TCM) has been used in the treatment of a variety of
therapeutic actions and modulation of other factors. One or more principal constituents
assist the effect of the principle ones. The rest serve s modulation roles such as
Because of growing interest in TCM therapeutics, increasing effort has been made
towards scientific proof, clinical evaluation and molecular study of TCM [1,2,42].
Such an effort can be facilitated by making available information about all major
effects, clinical indication and application. A few specialized databases have appeared
which provide information about different aspects of TCM. For instance, The TCMD
24
Chapter 2
provides information about Chinese medicinal plants and constituent chemicals [51],
general information about the history, theory, diagnostics, and examp les of TCM
formulae.
about all major aspects of TCM. Moreover, in the existing databases, information
ingredients and their molecular actions. The therapeutic effects, adverse reactions,
medicinal herbs such as tastes, flavors, sites of actions, toxicity level, and therapeutic
In this work, Oracle 9i is selected as the platform for our database projects and data
future analyzing tasks. Oracle 9i is a typical database management software, which has
the most variety of modules and development tools, including modules for data mining
25
Chapter 2
and online analytical process, which is essential for high-end data analysis purpose.
For many years, it leads the way in indexing and query optimization technology while
it is not worse than its competitors in other aspects. Also, Oracle 9i is a fully
object-oriented database, which conforms to the world trend toward object, oriented
programming (OOP). Oracle has kept the biggest market share for years. According to
can create a set of tables (also known as relations) to reflect the inter-relationship
between the data stored in the database. The relational approach needs the support of a
create and maintain a relational database. They are designed to control data redundancy,
to restrict unauthorized access, to provide persistent storage for program objects and
data structures, to permit inference and actions using rules, to provide multiple user
Efforts are made to collect as many of these known formulae, herbs, herbal
26
Chapter 2
research into TCM and TCM herbal constituents are leading to the discovery of new
The database currently contains entries for 1343 formulae, 1325 medicinal herbs,
4468 herbal ingredients and description about molecular actions of 502 herbal
ingredients. Each entry can be retrieved via the name, alternative name, function or
clinical manifestation at the levels of prescription formula, herb and herbal ingredient
respectively. This database will be continually updated as new data are found. And the
friendly interface will be upgraded as well to meet the user’s needs adequately.
The TCMID database web interface is shown in Figure 1. Users can easily search
the database from three starting points, namely, Formula, Herb and Compound.
‘Formula’ starting point generates a new interface through which the database can be
accessed by keyword search such as Formula Chinese Names, Formula English Names,
generates another interface to access the herbal information through keyword search
such as Herb Latin Names, Herb Chinese Names, Herbal Functions, Herbal Clinical
Manifestations, etc. ‘Compound’ starting point generates the third interface to access
the chemical information through keyword search such as Compound Name, CAS No,
27
Chapter 2
etc. Searches involving any combination of these search or selection fields are also
supported.
The full text search is case insensitive and wildcards supported. In a query, a user
can specify full name or any part of the name in a text field. Wild character of ‘%’ and
‘_‘ is allowed in text field. Here, ‘_’ represents any one character and ‘%’ represents a
string of characters of any length. For example, input of ‘Ma Huang’ in the Formula
Chinese Name field finds entries containing ‘Ma Huang’ in their name, such as Ma
Huang Tang or She Gan Ma Huang Tang. On the other hand, input of ‘Ma% Tang’
finds all the entries with ‘Ma’ and ‘Tang’ in their names, such as Ma Huang Tang.
The typical query results are illustrated in Figure 2, 3 and 4. Figure 2 gives
details about the formula, including Chinese Name, Common Name, Literature
28
Chapter 2
Description, Usage and Dosage, Storage, Precaution and Modifications. From this
page, the herbal information can be obtained by clicking each individual herbal
component.
29
Chapter 2
Usage &
1-1.5 g.
Dosage
30
Chapter 2
Figure 3 gives the details about the herb, including Chinese Name, Latin Name,
English Name, Plant Source, Properties and Flavors, Meridians, Toxicity, Chemical
Literature Reference. The chemical ingredient can link to the corresponding page
containing detail chemical information. They include Chemical Name, CAS No,
molecular weight, molecular formula, downloadable structure file and so on (Figure 4).
31
Chapter 2
TCM formulae, herbs and compounds and additional knowledge them will be added.
submission interface is included in the database (Figure 5). Submitted information will
This database currently contains 1343 entries of formulae, 1325 entries of herbs
found from the literature along with 4468 entries of compounds which are extracted
from the herbs. Among these entries of formulae, 1190 entries are relatively complete.
Their herbal components, functions, clinical manifestations are collected into the
32
Chapter 2
database. However, the others lack one or more detail features, at least the information
According to their therapeutic effects, the formulae are classified into 17 Classes,
formulae for clearing heat, formulae to dispel cold, tonic formulae, astringent formulae,
blood formulae, formulae for treating wind, formulae for treating dryness, formulae for
dispelling dampness, formulae to expel phlegm, digestive formulae, and formulae for
dispelling parasitic worms. For example, formula Xin Jia Xiang Ru Yin is used to treat
Among these entries of herbs, 1090 entries are relatively complete. Their functions,
clinical manifestations are collected into the database. One or more detail features
According to their therapeutic effects, the herbs are classified into 23 Classes,
activation and stasis removal, food digestion, for calming liver and containing wind,
for dissolving dampness by flavors, for dissolving phlegm, stopping cough, and
dissolving rottenness and growing new muscles, for promoting diuresis and
penetrating dampness, for relieving exterior syndrome, for tonifying weakness, for
33
Chapter 2
warming interior, heat clearance, mind opening, purging, regulation of Qi, spirit
and vomiting promotion. In addition, each class is divided into several sub-classes.
These classifications will facilitate the researchers in collecting and comparing the
herbs which have the similar effects. For example, Semen Lablab album (Bai Bian Dou)
is used in treatment of weakness of the spleen and stomach with loss of appetite and
loose bowels, excessive leukorrhea, vomiting, diarrhea, distress in the chest and
Class ‘for tonifying weakness’ and the sub-class ‘for tonifying Qi’.
After further investigation, we found that some herb entries are the same as or very
similar with the other ones. For example, the database contains ‘Ba Yue Zha’ and ‘Yu
Zhi Zi’. These are two different Chinese name of the same herb. A link is provided in
each entry to that of the same herb under different name. Thus, user can get sufficient
information about the herb even if he only knows one name of this herb. Most of the
compounds have the downloadable structure files, which can facilitate the studies of
ingredients of herbs in the literature and books, which is a result of collective and
persistent effort over the years. It provides a new platform to facilitate mechanistic
34
Chapter 2
study and clinical evaluation of TCM. In contrast to traditional literature and books,
the database has the following advantages: reduced data redundanc y, data consistency,
easier use of data and less storage for larger information. In addition, new information
35
Chapter 3
Formulation
3.1 Introduction
[1,2,16,50]. Each component in a formula has its particular functions. Moreover, they
are mixed together in proper amounts in accordance with specific principles for
combining herbs. When giving a prescription, herbalist doctor must take into account
the characters of each herbal component in the formula, as well as the stage of the
disease and the condition of the patient. According to the different functions of these
ingredients, they are classified to four categories [15], in order of importance, Jun,
Jun (King) are one or two herbal ingredients that give the formula its basic
characteristic actions. They may not be the greatest percentage of the formula, but the
most important clinical herbal components. They play the leading role in the treatment
36
Chapter 3
of the disease. Chen (Minister) are the ingredients that strengthen the Jun’s action
aiming at the principle symptoms. They enhance, augment or broaden the effect of the
Jun. They also play important role in relieving the secondary symptoms. Zuo
(Assistant) are these parts that have different effects in different formula. Firstly, they
can help the Jun and Chen components in their activities and can also address
secondary symptoms, especially the less important symptoms. Secondly, they will
counteract or attenuate any side effects of the first two combinations. Thirdly, they will
modify the energy of the formula closer to neutral, when more potential Jun and Chen
components are used in treatment of serious diseases. Shi (ambassador) are these parts
enable all of the components to be well absorbed, transported and then reach the right
place where they will take effects, such as liver, heart and stomach. They may also be
The components in a formula are in close cooperation with one another when
exerting their due effects. Theoretically the formulae that have prove n effective are
precisely formulated in accordance with their definite aim in treatment the disease. For
Ephedrae (Ma Huang) (9g), Ramulus Cinnamomi (Gui zhi) (6g), Semen Armeniacae
amarum (Ku Xing Ren) (9g), Radix Glycyrrhizae Preparata (Zhi Gan Cao) (6g), has
the main function to promote sweating and release the exterior. In this condition, the
lung is invaded by cold; so it is necessary to use herbs with pungent flavor and warm
property to expel pathogenic factors from the lung. Therefore, Herba Ephedrae is used
as the Jun ingredient to induce perspiration for dispelling pathogenic wind and cold
37
Chapter 3
from the body and to relieve asthmatic cough by ventilating the lung and smoothing
the flow of vital energy; Gui zhi is the Chen ingredient that aids Herba Ephedrae in
diaphoresis, warms the channels and stimulates menstrual discharge, reinforces yang,
relieves palpitation, and promotes the descending of Qi; Semen Armeniacae amarum
serves as the Zuo ingredient to relieve cough and asthma by ventilating the lung, and to
relax bowels ; and Radix Glycyrrhizae Preparata serves as the Shi ingredient to
harmonize the action of all the other ingredients and direct them to their proper
channels, to reinforce Qi and promote blood circulation. Each formula must contain a
Jun ingredient. However, some formulae may lack one or more of the other categories.
For example, Ren Shen Tang (Decoction of Ginseng) has only one ingredient: Radix
Ginseng (Ren Shen). Its function is to reinforce the vital energy, to remedy collapse
and restore the normal pulse, to benefit the spleen and lung, to promote the production
of body fluid, and to calm the nerves. In some formulae, the Jun or Chen ingredient
may also serve as the Zuo or Shi ingredient. To prescribe an effective formula, the
The composition of a formula is not static. Patients suffering from the same disease
may be given different variations of the same formula. Even the same person, during
different disease stage, is given modified formula. Factors such as the disease
condition, the constitution and age of the patient, the season, and the geographical
38
Chapter 3
doctor of Qing Dynasty, Xu Lingtai, said in his book [63] that “before trying to use any
old handed-down formula, you must first determine whether all the symptoms and
signs of the patient you are treating are consistent with those described in that
prescription and whether every ingredient is good for the case. If not you must make
the proper modifications or, if it is impossible to do so, choose another one.” From
what he said, we can see that the formula components are sometimes modified. The
Jun ingredient must remain the same in the modified formula, but other ingredients are
added or deleted according to the signs and symptoms present and the stage of the
disease. For example, Formula Ke Xue Fang has the effects of clearing fire and
resolving phlegmand. But if the condition is associated with severe deficiency of Yin
and blood, Radix Ophiopogonis (Mai Men Dong), Radix Glehniae (Sha Shen) and
Radix Trichosanthis (Tian Hua Fen) should be added. If the condition is associated
with severe cough, Radix Asteris (Zi Wan), Flos Farfarae (Kuan Dong Hua) and
Semen Armeniacae (Xing Ren) should be added. If the condition is associated with
Cough with profuse blood, Bai Bu should be added. Another example is Formula Huo
Xiang Zheng Qi San, which is to release the exterior and resolve dampness. If the
condition is associated with Indigestion, Massa Fermentata medicinalis (Shen Qu) and
Semen Raphani (Lai Fu Zi) should be added. While if the condition is no chill, fever or
headache, Radix Angelicae dahuricae (Bai Zhi) and Folium Perillae (Zi Su Ye) should
be deleted. The examples show that it is not always necessary to follow the set formula.
understand that their relative amounts are also varied according to the patient’s
39
Chapter 3
have different effects. Besides, different forms of the same prescription may have
different level of intensities or actions. The major forms of the prescriptions include
decoctions, powders, pills, capsules, and honey boluses. Usually in bolus form, the
action is slow. But when a decoction of the same ingredients is used instead, the action
becomes quicker [20]. Therefore the decoction form is more appropriate than the bolus
principles, are often considered as the major means of treating diseases. Since ancient
times, TCM physicians have summarized their experiences in this field. Many
effective prescriptions have thus been tested and documented. However, people were
disappointed to find that there was no direct and reliable approach to prescribe new
formula toward particular disease. What they can do and are doing is to try to modify
and improve the existed formulae. Moreover, how the formula may be modified is also
depended on their special experiences and the constitution of the patients. Fortunately,
scientists have made progress in developing alternative approach to facilitate the study
Weiwei et al divided the herbs in one formula into several different parts according to
their characters [39-41]. These characters were quantified to generate the feature
vectors that can be recognized by computer. From the first chapter, we notice that
40
Chapter 3
different herbs have different characters and functions. That is why different herbs are
used for treatment of different diseases. To understand the mechanism of herbs and use
them properly, it is necessary to explore their characters. Ancient experts have, from
the TCM viewpoint, summarized those basic characters, which include drugs'
Properties and Flavors, Meridians and Toxicity, etc. Based on the theories of Yin and
medicine have been developed and summed up throughout a long history of medical
practice. From the ancient Chinese medicine books, we can also find that the herbs
with similar characters have similar effects, while those with different characters have
quite different effects. In Su’s studies [40,41], they analyze several formulae that have
proven effective, including Wan Shi Niu Huang Qing Xin Wan, Xiao Chai Hu Tang,
Hua Tuo Zai Zhao Wan and Nao De Sheng Pian. The results show that the
Su’s studies, introduce in the previous chapter, provide us with a method through
which the characters of herbs can be quantified. We have noticed that a TCM
Support Vector Machine (SVM), a machine learning method, for the first time, to
41
Chapter 3
SVM is one kind of learning machine based on statistical learning theory. The
basic idea of applying SVM to pattern classification is as follows: First, the input
vectors are mapped into a feature space, either linearly or non- linearly, which is
relevant with the selection of the kernel function. And then a hyperplane of the vectors
in the feature space is constructed which separates two classes (this can be extended to
multi-class). SVM training always seeks a global optimized solution and avoids
over- fitting, so it has the ability to deal with a large number of features. A complete
description to the theory of SVMs for pattern recognition is in Vapnik’s book [43].
SVMs have been employed in a wide range of real-world problems such as text
image classification and object detection [72-77], sub-storm and flood stage
gene expression data analysis [85-87]. It has been shown that SVM is consistently
When used for classification, SVMs separate a given known set of {+1, -1} labeled
training data via a hyperplane that is maximally distant from the positive samples and
negative samples (known as Optimal Separating Hyperplane, OSH), then ‘plot’ the test
negative according to the OSH. Figure 6 illustrates this process. The solid lines show
42
Chapter 3
two possible hyperplanes, each of which correctly separates the training data into two
classes. The two dashed lines parallel to the separating hyperplane show the
boundaries in which one can move the separating hyperplane without misclassification.
For most of real- world problems, seemingly not to be linearly separable, SVMs can
handle non- linear hypotheses with any Kernel function, which automatically realizes a
nonlinear mapping onto a feature space. The optimal separating hyperplane found by
the SVM in feature space corresponds to a nonlinear decision boundary in the input
space.
Let the training data of two separable classes with n samples be represented by
G G G G
( x1 , y1 ), ( x 2 , y 2 ), " , ( x n , y n ), i = 1,2," , n, where xi ∈ R N is an N dimensional space,
43
Chapter 3
G
and yi ∈ {−1,+1} is the class index. Given a weight vector w and bias b (Figure
7), the two classes can be separated by two margins parallel to the hyperplane:
G G
wT • xi + b ≥ 1, yi = +1 (1)
G G
wT • xi + b ≤ −1, yi = −1 (2)
G
where w = ( w1, w2 ,", wn )T is a vector of n ele ments. Inequalities (1), (2) can be
G G
yi ( wT • xi + b ) ≥ 1, i = 1,2,", n (3)
M
w
Hyperplane
Margin γ K K
w T ⋅ x + b = +1
−b K K
w
K wT ⋅ x + b = 0
K K
wT ⋅ x + b = −1
Figure 7. Definition of Hyperplane and Margin. The circular dots and square dots
44
Chapter 3
M H M
H”
H’
training data.
OM
OSH
OM
45
Chapter 3
G G
wT • xi + b = 0 (4)
G G G G
G xT • w xT • w
γ ( w, b) = Gmin G − { max
G G (5)
{ x | y = +1} || w || x | y = −1} || w ||
The OSH can be found by maximizing the above distance or by minimizing the norm
G
of w under inequality constraints (3), and
G 2
γ max = γ ( w0 , b0 ) = G (6)
w0
The saddle point of the following Lagrangean gives solutions to the above
optimization problem:
G GT G n GT G
L( w, b,α ) = 12 w • w − ∑ α i[ yi ( w • xi + b) − 1] (7)
i =1
G
respect to w and b vanishes, which gives the following conditions by the calculation
46
Chapter 3
∂L ∂L
of G = 0 and = 0:
∂w wG =wG 0 ∂b b =b 0
G n
G
w0 = ∑ α i yi xi (8)
i =1
∑α y i i =0 (9)
i =1
n n n
G G
L(α ) = ∑ α i − 12 ∑∑α iα j yi y j ( xiT • x j ) (10)
i =1 i =1 j =1
n
under the constrains ∑α y i i = 0 and α i ≥ 0, i = 1,2,", n.
i =1
The points situated at the two optimal margins have non-zero coefficients α i
among the solutions to Eq.(10), and are called Support Vectors (SV). The bias b0 can
be calculated as follows:
G G G G
b0 = − 12 ( G min w0T • xi + Gmax wT0 • xi ) (11)
{ xi | y i = +1} { x i | yi = −1}
After determination of support vectors and bias, the decision function that
n
GT G GT G
f ( x ) = sign[ ∑α i yi xi • x + b0 ] = sign[∑ α i yi xi • x + b0 ] (12)
i =1 SV
As real- world problems are usually nonlinear, the following approach has been
47
Chapter 3
G
introduced into SVM to deal with these problems. The original training data x in the
input space X is projected into a high-dimensional feature space F via a Mercer kernel
operator K [67], and the OSH is constructed in this feature space. Such a procedure is
G G
f ( x ) = sign[ ∑α y K ( x , x ) + b ]
i∈{ SV }
i i i 0 (13)
∞
G G G G
K ( x, y) = ∑α m φ ( x ) T • φ ( y ), αm ≥ 0
m =1
G G G G G G G G
∫∫ K ( x , y ) g ( x ) g ( y ) dxdy > 0, ∫g ( x )dx < ∞
2
(14)
G G G G
K ( x , y ) = φ ( x )T • φ ( y ) (15)
n n n
G G n
L(α ) = ∑α
i =1
i − 1
2 ∑∑α α
i =1 j =1
i j yi y j K ( xi , x j ) − λ ∑α i yi
i =1
(16)
n
subject to ∑α y i i = 0 and α i ≥ 0, i = 1,2,", n.
i =1
G G
f ( x ) = sign[ ∑α y K ( x , x ) + b ]
i∈{ SV }
i i i 0 (17)
where
48
Chapter 3
G G G G
b0 = − 12 { Gmin (
{ x i | yi = +1}
∑α
j ∈{ SV }
j y j K ( xi , x j )) + Gmax (
{ xi | y i = −1}
∑α
j ∈{ SV }
j y j K ( xi , x j ))} (18)
X F φ(x)
X X
φ(o) φ(x)
X O
φ φ(x)
O
O φ(o) φ(x)
O
O φ(o)
X X φ(o) φ(x)
φ(o)
project the training data nonlinearly into a higher-dimensional feature space via φ .
kernels or tensor products of kernels [90]. In this work, Gaussian RBF is chosen as the
kernel function.
49
Chapter 3
In this study, 644 TCM formulae (from TCMID database, Chapter 2) are used to
test the feasibility of this method. Based on the number of herbs in a formula, these
formulae are classified to 7 groups, from Group 1 to Group 7, in which the number of
samples. 1000 negative samples, which contain the same number of herbs with the
positive samples in the same group, are randomly generated. These negative samples
compiled complexes of multiple herbs. The total numbers of samples in each group are
given in Table 3. Both positive and negative formulae in each group are further
randomly divided into two parts with even number of entries, respectively (If the
number of the positive formulae is an odd number (2n+1), they are divided into two
parts with n and n+1 formulae, respectively.). One part is used as the training set,
while the other as the testing set. Thus, from Group 1 to Group 7, the training sets
contain 42, 41, 46, 42, 50, 47 and 55 positive samples, respectively and 500 negative
samples which are randomly selected. The testing sets contain 42, 40, 46, 42, 50, 46
50
Chapter 3
must construct the vectors of each herb, and then, construct the feature vectors of the
formula according to some rules that can compile the vectors of all herbs. In this study,
the feature vectors are simply the concatenation of the vectors of all herbs. I will
Formula
51
Chapter 3
* If the herb has mild property, the value of each element should be added 0.25.
** If the herb has tasteless flavor, the value of this element should be added 0.5
*** If the herb has astringent flavor, the value of this element should be added 0.5.
52
Chapter 3
Since the characters of a herb are essential to its effects, the feature vectors should
be constructed according to these characters. The characters are quantified and then the
values of them are used to compose the feature vector. The vector contains 22 elements,
which are the quantified basic characters of herbal properties, flavors, toxicity property
and meridians that the herb can enter. Table 4. gives probable value for each element of
the feature vector and also the reason for the choice of the value.
As we have known from previous chapters, each herb has its own properties and
flavors. There are four typical types of properties, that is, cold, hot, warm or cool.
These four characters are used as four elements of feature vector. If the herb has one of
these properties, a value ‘1’ is assigned to the element. Otherwise, a value ‘0’ is
assigned to this element. The choice of value ‘1’ is to simplify the construction of the
vector.
There is also another property: mild. If the herb has this property, the value of the
elements, which represent Cold, Hot, Warm and Cool, add a value 0.25. It is because
this property does not belong to any of these four typical ones. The value 0.25 is
chosen mainly according to our experiences. So are the other values, which are chosen
to describe the elements in the following paragraphs. The discussion of the choice of
the value is given in Chapter 3.6. Some herbs are extremely cold and some extremely
hot, the value ‘1.2’ is assigned to the elements, which represent cold and hot,
respectively. To those herbs with minor cold, minor warm or minor cool, a value ‘0.8’
53
Chapter 3
1 Cold 0
2 Hot 0
3 Warm 1
4 Cool 0
5 Pungent 1
6 Sweet 0
7 Sour 0
8 Bitter 0.8
9 Salty 0
10 Toxic 0
11 Lung 1
12 Bladder 1
13 Spleen 0
14 Large Intestine 0
15 Stomach 0
16 Small Intestine 0
17 Liver 0
18 Cardiovascular 0
19 Heart 0
20 Kidney 0
21 Gallbladder 0
22 San Jiao 0
54
Chapter 3
There are five major types of herbal flavors, including pungent, sweet, sour, bitter
and salty. These characters are also used as elements of the feature vector. If the herb
has one of these flavors, a value ‘1’ is assigned to this element. If the flavor is minor
pungent, minor sweet, minor sour, minor bitter or minor salty, a value ‘0.8’ is assigned
to the corresponding element. There are also another two different flavors: tasteless
and astringent. For tasteless, a value ‘0.5’ is added to the element ‘sweet’ and for
symptoms can reflect the organs that are not in good conditions. So ancient Chinese
herbalist doctors speculated that a given herb may selectively act upon a particular part
of the body and this part depends on the corresponding symptoms that the herb can
relieve. So the vector should also reflect the properties of meridians that herbs affect.
There are a total of 12 meridians, including Lung, Bladder, Spleen, Large Intestine,
Stomach, Small Intestine, Liver, Pericardium, Heart, Kidney, Gallbladder and San Jiao.
The quantified characters of these meridians are also used as elements of the feature
vector. If one herb can enter a meridian, then a value ‘1’ is assigned to the element that
represents for the meridian. ‘0’ is assigned to the other elements, which are not related
with this herb. Table 5. gives the feature vector of Herba Ephedrae (Ma Huang), a
useful herb which has warm properties, pungent and slightly bitter flavors, and can
enter Lung and Bladder meridians. The related information can be found in TCMID
database. So does the information of the other herbs, on which the feature vectors are
built.
After building the feature vector of each herb in a formula, we can construct that of
55
Chapter 3
the formula. In this study, I connect the elements of all herbs’ vector together to
generate a bigger vector. For example, the formula, which contains 4 herbs, has the
feature vector with 88 elements, while the vector of a formula (contains 5 herbs) has
110 elements. Because of the difference of feature vector, we separated the formulae
quantity of True Positives (TP), True Negatives (TN), False Positives (FP), and False
Sensitivity=TP/(TP+FN) (19)
Specificity=TN/(TN+FP) (20)
TP + TN
Q= (21)
TP + TN + FP + FN
are also useful in assessing the prediction accuracy [93]. To estimate performance, we used
three objective indices: accuracy, specificity and sensitivity. The sensitivity as defined
defined above is the probability of correctly predicting a negative example. All these
quantities are used in the evaluation of SVM classification of formulae in this work.
56
Chapter 3
The SVM program can automatically rearrange the training and testing sets.
During the training process, the program can calculate the minimum number of the
positive samples or negative samples in the training set, which is necessary to reach an
optimal training result. If the number of the positive samples or negative samples in
the training set is higher than the minimum, the rest samples are automatically
transferred to the testing set. On the contrary, if the number is lower than the minimum,
some samples in the testing set are taken into the training set and then re-start the
training process until an optimal training result is achieved. Tables 6 to12 give the
training and testing sets of positive formulae of each groups after the optimal training
results are achieved and Table 13 summarizes the number of positive and negative
samples in the training set and testing set of each group after the training process. The
results show that the number of the positive samples in the training sets of these groups
increase (see Table 3). This is perhaps because the numbers of positive samples in the
original training set is not enough to cover sufficiently diverse range of positive
samples. So a number of positive samples in initial testing set are put into the training
set. On the other hand, except Group 1, in the other six groups, the number of the
negative samples in the training set decrease from 500 to a certain number between
210 and 290. The decrease of the number of negative samples in the training set
57
Chapter 3
Table 6. List of Positive Formulae in the Training and Testing Set of Group 1
Bai Tou Weng Tang Huang Teng Nan Chang Sheng Ma Huang Qi Tang
Bao Chi San Tang Shi Zao Tang
Bei Xie Fen Qing Yin Huo Xue Tong Mai Pian Si Jun Zi Tang
Bing Lang Chen Qi Tang Ji Jiao Li Huang Wan Si Miao Yong An Tang
Bing Peng San Jia Wei Dang Gui Bu Xue Si Ni San
Cai Shi Tui Re Tang Tang Si Sheng Wan
Cang Er Zi San Jian Shen Fang Si Wei Huang Qi Xuan Fu
Da Cheng Qi Tang Jiu Fen San Pian
Da Jian Zhong Tang Ling Gui Zhu Gan Tang Si Wei Tu Mu Xiang San
Da Xian Xiong Wan Lou Xing Chen Qi Tang Si Wu Tang
Dao Chi San Ma Huang Tang Tao Hua San
Ding Xiang Shi Di Tang Ma Huang Xing Ren Yi Yi Tong Xie Yao Fang
Ding Zhi Wan Gan Cao Tang Wei Jing Tang
Fu Zi Xie Xin Tang Ma Xin Shi Wei Tang Wu Long Jian Ji
Gan Cao Gan Jiang Ling Ma Xing Gan Shi Tang Wu Zhu Yu Tang
Zhu Tang Meng Shi Gun Tan Wan Xi Jiao Di Haung Tang
Gan Mao Tui Re Chong Ji Mu Fang Ji Tang Xiang Su San
Gan Mao Tui Re Ke Li Qian Yin Tang Jiang Xiao Er Qi Ying Wan
Gui Zhi Gan Cao Long Gu Qian Zheng San Xiao Zhi Shuan
Mu Li Tan Qing Bing San Xie Bai San
Gun Tan Wan Qiong Yu Gao Xin Shen Yin
He Zi Pi San Re Yan Ning Ke Li Xuan Bai Cheng Qi Tang
Hong Yao Tie Gao San Ao Tang Yang Xue Run Chang Jian
Huai Hua San San Liang Ban Yao Jiu Yi Nian Jin
Huang Lian Jiang Tang San San Ye Tang Yu Quan Wan
Huang Lian Jie Du Tang Shen Fu Tang Zhi Ke Pi Pa Tang Jiang
Huang Qin Tang Sheng Ma Ge Gen Tang
Testing Set
58
Chapter 3
Table 7. List of Positive Formulae in the Training and Testing Set of Group 2
Testing Set
Tao He Cheng Qi Tang Gui Zhi Jia Gui Tang Huang Qi Gui Zhi Wu Wu
Bai Hu Jia Ren Shen Tang Ling Gan Wu Wei Jiang Tang
Bao Yuan Tang Xin Tang Kang Bai Huo He Ji
Fu Fang Da Qing Ye He Ji
59
Chapter 3
Table 8. List of Positive Formulae in the Training and Testing Set of Group 3
Testing set
60
Chapter 3
Table 9. List of Positive Formulae in the Training and Testing Set of Group 4
Ba Zhen Yi Mu Wan Jia Jian Dang Gui Si Ni Shan Zha Xiao Mi Tang
Ban Long Wan Tang Shang Dong E Jiao Gao
Ban Xia Xie Xin Tang Jia Wei Gui Zhi Fu Ling She Xiang Zhen Tong Gao
Bao He Wan Wan She Yang Tang
Bu Fei Shen Ji Tang Jia Wei Zhi Zhuo Gu Ben Shen Jin Dan Jiao Nang
Bu Yang Huan Wu Tang Tang Sheng Ge Er Chong Tang
Cai Hu Qing Gan Yin Jiang Ya Ping Pian Shun Jing Tang
Chai Hu Shu Gan San Jie Biao Tui Re Fang Tian Tai Wu Yao San
Chong Cao Bu Tian Jing Jin Lian Mao Gong Yin Tiao Gan Tang
Da Qing Long Tang Ku Shen Tang Wan Ying Gao
Da Yuan Yin Lan Di Tang Wei Kang Ling Jiao Nang
Dang Gui Liu Huang Tang Li Pi Su Fei Tang Xiao Chai Hu Tang
Dang Gui Si Ni Tang Lian Po Yin Xiao Er Yi Niao Fang
Di Yu Wan Liu Yi Bing Zhu San Ma Xiao Yao Wan
Fei Er Wan You Hu Xing Jun San
Fu Fang Chan Yi Yin Luo Han Guo Zhi Ke Chong Xuan Fu Dai Zhe Tang
Fu Fang Lei Gong Teng Ji Yang He Tang
Jian Ji Ma Zi Ren Wan Yang Wei Li Qi Tang
Gan Cao Xie Xin Tang Mi Zhen Tang Yi Qi Huo Xue Pian
Gan Ji San Mu Li Ze Xie San Yu Zhen San
Ge Gen Tang Qi Bao Mei Ran Ke Li Yun Qi San
Gong Wai Yun Fang Qi Wei Du Qi Wan Zhen Xin An Shen Tang
Hong Ling San Qi Wei Pu Tao San Zhi Sou San
Hua Gan Jian Qi Wei Zhi Shen Tang Zhi Zhi Tang
Huang Lian Tang Qian Bai Bi Yan Pian Zhu Ye Mai Dong Shi Gao
Huang Qin Hua Shi Tang Qiang Huo Sheng Shi Tang Tang
Huang Tu Tang Qin Lian Si Wu Tang Zhu Ye Shi Gao Tang
Ji Ming San Qing Li Tang Sha Shen Mai Dong Tang
Sang Xing Tang Qu Feng Zhi Tong Pian
Testing Set
Dang Gui Si Ni Tang Dang Gui Jian Zhong Tang Huang Qi Jian Zhong Tang
Huo Bu San Ren Tang Ji Xue Teng Gao
61
Chapter 3
Table 10. List of Positive Formulae in the Training and Testing Set of Group 5
Ba Wei Qing Xin Chen Xiang Kuo Qing Yin Si Chong Huo Xue Tang
San Li Gan Long Pian Snag Piao Xiao San
Ba Wei Tan Xiang San Li Ling Tang Su Wei Tang
Ba Zhen Wan Ling Yi Bai Jiang Tang Su Xiao Niu Huang Wan
Ban Xia Bai Zhu Tian Ma Mi Niao Xi Jie Shi Fang Tan Du Jian
Tang Nu Bao Tong Jing Zhi Ke Tang
Bei Ling San Qi Gong Tang Tong You Tang
Bi Xue Shen Shi Tang Qi Guan Yan Wai Tie Fang Tuo Li Ding Tong Tang
Bu Dai Wan Qi Ju Di Huang Wan Wen Dan Tang
Bu Zhong Yi Qi Wan Qi Li San Wu Cao Tang
Chan Su Wan Qi Shao Ji Cao Tang Wu Jia Qiang Xin Tang
Chuan Bei Qing Fei Tang Qiang Xin Ying Xian Qi Yin
Jiang Qing Gan He Jie Tang Xiang Sha Liu Jun Zi Wan
Chuan Xiong Cha Tiao Wan Qing Gu San Xiao Ban Tang
Er Long Zuo Ci Wan Qing Guo Wan Xiao Qing Long Tang
Fu Yuan Huo Xue Tang Qing Lin He Ji Xiao Yao San
He Che Da Zao Wan Quan Long Si Zi Tang Xie Qing Wan
Hua Yu Qing San Tang Ren Shen Ding Chuan Tang Xie Shi Tang
Hui Tian Zai Zao Wan Rou Gan Xi Feng Tong Luo Yang Yin Li Yan Tang
Huo Xue Run Zao Sheng Jin Tang Yang Yin Qing Fei Tang
San Ru Xiang Ding Tong Wan Yi Chan He Ji
Jia Jian Tuo Hua Jian San Bai Tang You Gui Yin
Jia Jian Wei Rui Tang San Qi Shang Yao Pian Zao Du Fu Jiang Tang
Jia Wei Ban Xia Hou Po Tang San Ren Tang Zhen Xin Di Tan Tang
Jia Wei Fu Zi Li Zhong Tang San Shen Tang Zhi Shi Dao Zhi Wan
Jia Wei Xiao Yao San Sang Ju Yin Zhi Shu He Wei Tang
Jian Wei Cha Shen Kang Ning Pian Zhi Xue Tang
Jiao Ai Tang Shen Qi Wan Zuo Gui Wan
Jie Nue Qi Bao Yin Shu Gan Chong Ji Kang Bao Kou Fu Ye
Ku Shen Zhi Yang Tang Kai Xiong Shun Qi Wan
Testing Set
62
Chapter 3
Table 11. List of Positive Formulae in the Training and Testing Set of Group 6
Testing set
Jia Jian Bu Zhong Yi Qi Jiu Wei Xiao Ding Yin Ma Xin Qin Ting Tang
Tang Ding Chuan Tang Man Gan Jiu Wei Tang
63
Chapter 3
Table 12. List of Positive Formulae in the Training and Testing Set of Group 7
64
Chapter 3
Table 13. Number of Samples in the Training and Testing sets after Calculation Using
SVM
Positive Negative
Positive Negative
TP FN TN FP
1 74 501 10 0 498 1
2 73 251 5 3 749 0
3 80 211 10 2 786 3
4 79 267 1 4 733 0
5 84 231 10 6 768 1
6 88 290 3 2 710 0
7 94 210 14 2 790 0
The prediction accuracy of SVM in classifying TCM formulae is given in Table 14.
The ove rall accuracy of SVM for classifying Group 1 was 99.8%; the sensitivity, 100%;
the specificity, 99.8%. The overall accuracy of SVM for classifying Group 2 was
99.6%; the sensitivity, 62.5%; the specificity, 100%. The overall accuracy of SVM for
classifying Group 3 was 99.4%; the sensitivity, 83.3%; the specificity, 99.6%. The
65
Chapter 3
overall accuracy of SVM for classifying Group 4 was 99.5%; the sensitivity, 20%; the
specificity, 100%. The overall accuracy of SVM for classifying Group 5 was 99.1%;
the sensitivity, 62.5%; the specificity, 99.9%. The overall accuracy of SVM for
classifying Group 6 was 99.7%; the sensitivity, 60%; the specificity, 100%. The overall
accuracy of SVM for classifying Group 7 was 99.7%; the sensitivity, 87.5%; the
specificity, 100%.
Group Number
TP/(TP+FN) TN/(TN+FP) (TP+TN)/(TP+TN+FN+FP)
Description of Sensitivity, Specificity, Overall Accuracy, TP, FN, TN and FP in Chapter 3.5
66
Chapter 3
The high overall accuracies (>99%) may reflect the reliability of this method. At
least, it has strong ability to recognize negative samples. The specificities of all 7
groups are bigger than 99.5%. For Group 2, 4, 6 and 7, all of the negative samples are
recognized. Thus this SVM system appears to be capable of minimizing the probability
may not truly reflect its ability to pick up “true positive ”, that is, the known effective
formulae. In this study, the values of overall accuracies largely depend on the number
of “true negative ” samples, which is much higher than “true positive” samples. That is
the reason why, in some groups, Sensitivity is much lower than Specificity and Q. In
other words, we can say that recognition ability of “true positive ” is different in
different groups, or, more seriously, the evaluation of a known effective formula may
be incorrect. Table 14 shows satisfactory results for Group 1, Group 3 and Group 7
(sensitivity > 83%). These groups have a common character, that is, in the testing set,
the positive samples (TP and FN) are relatively more than, at least not less than those
samples in the other groups. In Group 1, Group 3 and Group 7, the positive sample is
10, 12 and 16, respectively. While in Group 4, the recognition ability is particularly low,
with only 20% of the formulae correctly classified. The number of the positive testing
samples in this group is 5. Similarly, in Group 6, the number of the positive testing
formulae is also 5. For this group, the sensitivity is 60%. Considering the small number
of these positive testing samples, the results in these groups may be not reliable.
Several factors that limit this method are given in the next paragraphs.
67
Chapter 3
Several factors may affect the prediction accur acy. Firstly, the number of positive
samples is critical to the predicted results. As shown in Table 14, the lower values of
sensitivity than those of specificities may be the consequence of lack of positive data.
Secondly, the number of negative samples also needs careful consideration. This
number should be higher than, but not much higher than the minimum. In this study,
the number of the negative testing samples is too high, while that of the positive testing
samples is too low. Further studies should be carried out to reduce the number of the
former and to find more positive samples for testing. Moreover, SVM prediction can
vector. The selection of feature vector may be more important. This includes the
selection of the value of each element and the construction rule of formulae’s feature
vector from herb’ feature vector. Changing to a proper value of an element may
generate reasonable results. For example, when a herb is extremely cold, the element
that represents “cold” is assigned a value 1.2 (Table 4). This value can be changed to
1.1, 1.2 or whatever, and then a new feature vector is generated. Based on a set of these
new feature vectors, an optimal training result may be achieved. This will be carried
out in the future study. There may be a number of possible methods to construct the
feature vector of the formulae from the herb’s feature vector. In this study, I select a
simple one (see Chapter 3.4). This method has an obvious disadvantage, that is, it can
only train the formulae that contain the same number of herbs. Hence, I have to
separate the formulae into several groups and then train and test them respectively. The
other methods for constructing the feature vector sho uld be employed in the future
68
Chapter 3
study.
Table 15 gives the ineffective formulae that are incorrectly predicted as effective
formulae. One possible reason for the incorrect prediction is the inherent deficiency of
this method as described above. But we cannot rule out the possibility that these
non-effective formulae and it may provide some helpful hints for herbalist doctors to
69
Chapter 3
several potentially effective formulae from the hundreds of randomly mixed formulae.
These formulae never appear in previous literature and their effects have never been
studied before. It is unclear whether these formulae have the therapeutic value. The
method is expected to facilitate the prescription of new and novel TCM formulae as
well as the validation of existing TCM formulae while more and more formulae are
70
Chapter 4
INVDOCK
4.1 Introduction
prostatic hyperplasia (BPH) and associated lower urinary tract symptoms (LUTS) has
been growing steadily [45-49,94-96]. The most widely used phytotherapeutic agent is
the extract of Serenoa repens, one kind of dwarf palm plants [45-49,97-104]. Many
clinical trials have been carried out to evaluate the effect of the extract of this herb
Permixon, one commercial product of the extract of Serenoa repens, (11 randomized
clinical trials and 2 open label trials) [100], involving 2859 patients, reveals that in
men with symptomatic BPH, the extract yields a significant improvement in peak flow
castrated rats also show inhibition effects on the increase of prostate wet weight by the
extract of Serenoa repens [107]. When compared with commonly used drugs, an
extract shows nearly the same effects, causing parallel and statistically significant
71
Chapter 4
decreases in symptom scores and increases in maximal flow rates. These results
indicate the effects of the extracts of Serenoa repens on men with symptomatic BPH.
Its main mechanism of action is still unclear; however, possible activities have
regulating prostate cell growth. Studies demonstrate that the lipido-sterol extract of
androgens [111].
In one study, LSESr significantly inhibits proliferation and induces cell death in
both epithelium and stroma of BPH issue [112]. It also appears to affect the
b-FGF-induced proliferate of prostate cells [113] and decreases the amount of EGF in
BPH [105]. Another study demonstrates that Serenoa repens may block PRL-induced
prostate growth by inhibiting several steps of PRL receptor signal transductions [104].
It exerts an anti- inflammatory effect through the inhibition of the enzyme responsible
[105].
The mechanism of this herb is unknown. It is also unclear which of the compounds
in the extract of this herb account for its therapeutic effects. Many experiments assume
that the lipid and sterol in this herb account for the therapeutic effects
72
Chapter 4
extract: 1-monolaurin and 1-monomyristin, and finds that they show moderate
biological activities in the brine shrimp lethality test and against renal (A-498) and
pancreatic (PACA-2) human tumor cells [118]. A number of compounds have been
extracted from this herb. To explore the mechanisms and pharmacology of this herb,
these ingredients need to be examined first. And it is more important to explore how
they can collectively interact with various proteins and other molecules in the human
body. But such an investigation is hindered by the high cost and time spent on
synthesis and bioassay of natural products. Therefore alternative methods are needed
to understand and analyze molecular mecha nism of active ingredients at a higher speed
As one of the strategies for mechanistic study of natural products, the software
INVDOCK is developed to calculate the binding between the compounds and the
potential therapeutic and toxicity protein targets of compounds, Chen et al has tested
on a more diverse set of compounds [42,120]. The predicted potential therapeutic and
toxicity protein targets for a number of clinical drugs were found to be in fair
combined with that of proteomics and pharmacokinetic profile can facilitate the
predicted effects.
When this work was done, I did not have the 3D structure of 5 alpha-reductase and
73
Chapter 4
androgen receptor, so I did not evaluate the effects of the compounds on these two
targets. In this work, I focus on the mecha nism of this herb involved in other processes
to provide the experimentalists the most possible targets, which, I believe, will
facilitate them in the construction of experiments. 23 active compounds from this herb
are selected to search for their potential targets through INVDOCK. Because these
compounds are the main ingredients of Serenoa repens, the collective effects of them
may account for parts of the actions of this herb in the treatment of BPH.
spheres that fill up that cavity. Each cavity entry is derived from the corresponding
PDB entry by the following procedure: Ligands and water in a PDB protein structure
are first removed. The surface of this protein, as defined by Richards [122], is then
generated. The van der Waals surface of a solvent-accessible atom is generated with
the respective parameter from the AMBER force field [123]. The inward- facing
surface covering the interface of van der Waals surfaces is computed with a probe
sphere 1.4 Å in radius. The whole protein surface is then coated and filled with a
cluster of spheres by a method similar to that of Kuntz et al. Each sphere is checked for
the extent of its surrounding space covered by protein atoms. The surrounding space is
covered if more than 50% of the direction around the sphere is covered by protein
74
Chapter 4
atoms. The covered spheres are then divided into separate cavity groups on the basis of
in that group. This is to ensure that a cavity model sufficiently covers the cavity
surface region where a tail of a ligand might be located. In some ligand-protein PDB
structures, such as HIV-1 protease complex with its inhibitor, one or both ends of the
includes both the cavity and the groove to which it is connected. Because a groove
may also be a binding site, no attempt has been made to remove grooves from a cavity
model. A visual inspection of these and a large number of othe r proteins showed that
all the cavities and their adjacent grooves were reasonably represented.
3D structures of proteins from different species have been deposited in the PDB.
ligand with respect to one species or a few particular species. Therefore, cavity entries
can be divided into classes defined by disease-related species, sequence similarity, and
mammal. Additional information such as the known ligand-binding region and the
75
Chapter 4
extent of each sphere being buried inside a cavity is also included in a cavity entry to
every entry of the protein cavity database for potential protein targets of a small
of possible binding sites within each cavity is difficult. Hence, all parts of each cavity
are subject to docking. Docking to sites inside each cavity starts from known
ligand-binding sites, followed by more interior sections and then the remaining part.
To save central processing unit (CPU) time, the program proceeds to the next protein
entry when the first successful dock is obtained without an exhaustive search for the
optimum binding mode within each cavity. Although the optimum binding mode is no t
specifically sorted, the prioritized search strategy seems to dock a ligand to a site
structures studied. All cavity entries are subject to searching unless the related protein
Wang et al. The docking of a particular conformer to a cavity is as follows: First, the
ligand is aligned within the selected site by the position of each ligand atom being
matched with the center of the spheres. Because of the relatively low-resolution nature
76
Chapter 4
limited torsion space sampling of rotatable bonds in the ligand and those in the
side-chain of the receptor amino acid residues at the binding site. Each rotatable bond
minimization on all ligand and protein atoms at the binding site to further optimize the
In both torsion optimization and energy minimization, AMBER force fields [123]
are used for covalent-bond, bond-angle, torsion, and nonbonded van der Waals and
non-hydrogen atoms. To save computing time and avoid the difficulty in modeling
distance, is used to represent hydrogen-bond terms. This potential has been shown to
[126,127]. The published Morse potential parameters [126,127] are used in this work.
Protein flexibility is known to influence ligand binding, and thus methods such as
considered in INVDOCK along with additional energy minimization for all atoms at
the binding site. However, no attempt is made to explicitly sample the ensemble of
77
Chapter 4
submitted by different groups; some of them are associated with different binding
ligands or different mutants. Moreover, there are multiple conformations in most NMR
proteins.
4.2.3 Scoring
function, ?ELP , composed of the same hydrogen-bond and nonbonded terms as those
complexes shows that the computed ? ELP is generally less than ? ET hreshold = -αN
kcal/mol, where N is the number of ligand atoms and α is a constant (∼1.0). The exact
value of α can be determined by the fitting of the linear equation ?EThreshold = -αN to
the computed ?ELP for a large set of PDB structures. This statistically derived energy
value can be used empirically as a threshold for screening likely binders. A polynomial
energy threshold. ?ELP can be required to be less than ?EThreshold when successfully
binds to its receptor noncompetitively against natural ligands and, to some extent,
other drugs that bind to the same receptor site. This binding competitiveness may be
partially taken into consideration for cavities known to be ligand-bound in at least one
78
Chapter 4
PDB entry. Ligands in PDB structures are known binders. Therefore, PDB ligands
bound to the same receptor site as that of a docked molecule may thus be considered
computed ? ELP is not only evaluated against ? EThreshold but also compared to the
ligand-protein interaction energy of the corresponding PDB ligands that bind in the
same cavity in this or other relevant PDB entries. The ligand-protein interaction energy
for the relevant PDB structures is computed by the same energy functions as that for
the docked molecule. In addition to the condition that it be lower than ?Ethreshold, ?ELP
? ECompetitor when a potential target is selected. ?ECompetitor can be taken as the highest
factor β. For finding weak binders as well as strong binders, a factor β≤1 is introduced
to scale the ligand-protein interaction energy of PDB ligands. This is because a weak
binder may have a slightly higher interaction energy than that of a PDB binder. We
have not found experimental data to determine the value of β. Here, β is tentatively
study suggests that a value of 0.8 for β leads to reasonable results statistically.
literature and books [96,130-132]. The 3D structures of these ingredients are generated
through the following steps. First, the 2D structures are got from the Combined
79
Chapter 4
are converted to 3D structures through the software Weblab. The structures are given
in Figures 11 to 14.
contains 433 protein entries that have been reported to be the target of either clinical
(2)
80
Chapter 4
81
Chapter 4
82
Chapter 4
83
Chapter 4
4.3 Results
All the 23 ingredients, including eight fatty acids, eight ethyl esters of these fatty
acids, five sterols and two monoacylglycerides, are used to scan the protein library to
identify their potential targets. Based on the information, their contribution to the
therapeutic and toxic effects of Serenoa repens is evaluated (Table 16). These
compounds constitute more than 85% of all the ingredients in the extract of Serenoa
repens, which is used for treatment of BPH and is believed to contain the
unclear, theoretically, the therapeutic effects of this herb may be due to some of these
compounds.
From the computational results, each compound can bind to several protein targets,
ranging from one to seven. Taking into account of the overlap of the same targets by
different compounds, finally, 25 protein targets are obtained (Table 17 and Table 18),
each of which can be bound by seve ral different compounds. To a given protein target,
the binding energy of different compounds varies significantly. For example, the
results show that 20 compounds can bind to prostaglandin H2 synthase (the data is not
included in the thesis), but their binding energy is very different. The highest one is
that of stigmasterol and the lowest one is that of capric acid. The energy is 47.6
kcal/mol (Table 17) and 28.4 kcal/mol (this data does not include in the thesis),
respectively. Compared with the former, the latter is a very weak binder and may have
little contribution to the interaction with the target protein. Hence, in this study, I only
consider the interactions with sufficiently strong binding energy. When the binding
84
Chapter 4
energy of a compound to a given protein is 3 kcal/mol less than the maximum, the
interaction between the compound and the protein is neglected. The compounds that
can bind to the targets are given in Table 17 and Table 18.
The action of the herbal extract has been suggested to be through the following
effects. A number of proteins involved in these processes are in the list of our predicted
targets.
The anti- inflammatory effects are mainly through interference with prostaglandin
synthase, is found highly exp ressed in prostate cancer and BPH issue, suggesting an
important role in the pathogenesis of BPH. Studies find its expression can be decreased
by the extract of Serenoa repens [135]. One study suggests phytosterols to be the most
importance of inhibition of this enzyme [104]. From our study data, two phytosterols
(stigmasterol and cycloartenol) can bind to this enzyme. On the other hand,
beta-sitosterol, believed to be the most important ingredient, is found not to bind to this
85
Chapter 4
Table 17. Predicted Proteins related with BPH. [energy >= (Max-3) kcal/mol]
86
Chapter 4
demonstrate that the extract of this herb can alter bFGF growth factors- induced growth
and proliferation [113]. In their study, it is mentioned that lupenone and the
proliferation. The inhibition effect may be through reducing the available growth factor
receptor and growth factor. Our data show that one sterol (stigmasterol) can bind to
fibroblast growth factor receptor. Moreover, fibroblast growth factor (FGF) can be
bound by two fatty acid (lauric acid and capric acid) and monomyristin.
Another growth factor, stem cell factor, which is supposed to exert a broad range of
expressed in Prostate carcinoma cell lines: DU-145 and PC-3 lines [140]. Out data
inhibition of other proteins that are critical to the growth of cells may account for the
effect of this herb. Ornithine decarboxylase (ODC) is essential for polyamine synthesis
differentiation. It has been found that not only the expression level of ODC mRNA,
but also the activity of this enzyme is higher in BPH tissue than that of normal prostate
tissues [141]. Our data show that three fatty acids, stearic acid, oleic acid and myristic
87
Chapter 4
acid can bind to this enzyme. Another rate- limiting enzyme of the polyamine pathway,
palmitic acid, caprylic acid and monolaurin. Both of them are found to be useful
targets in the treatment of prostate cancer [142]. Protein kinase C (PKC) is involved in
the proliferative response of human cultured prostatic stromal cells [143]. Our data
show that betasitosterol, monolaurin, oleic acid and stearic acid can bind to this
enzyme.
One predicted protein target is nitric oxide synthase. This enzyme catalyzes the
production of NO, which is involved in several important pathways. One study finds
that inducible nitric oxide synthase is expressed in the prostate of all benign prostatic
[144]. Monolaurin and two sterols (lupeol and stigmasterol) can bind to this enzyme.
Vitamin A (retinol) and its derivatives, the retinoids, have been implicated as
the prostate. Very little is known about the physiological role of retinoids in the
prostate. Another study finds that the nuclear retinoic acid receptors are expressed in
BPH samples [145]. Inhibition of this receptor may neutralize the proliferate effect
caused by retinal. Two fatty acids (oleic acid and Caprylic acid) and one sterol
88
Chapter 4
implicated in the pathogenesis of BPH. A number of studies have shown that the
Serenoa repens extract takes effect through reducing the generation of DHT [109-111].
It is believed that part of the anti-androgenic effects of this herb is due to the inhibition
of SCHAD. Our data show that two fatty acids (oleic acid and lauric acid) can bind to
this enzyme.
In regard to the anti-estrogenic effects of Serenoa repens, it has been found that it
can competitively block the translocation of cytosolic estrogen receptors to the nucleus.
[96,106]. Two sterols (stigmasterol and betasitosterol) can bind to this receptor. One in
Our study suggests that effects of this herb on other targets may be involved in the
treatment of this disease. In the list of our predicted proteins, besides the targets that
have been mentioned previously, there are still some other targets. So, the next task of
our study is to analyze these proteins to make clear the relationship of these proteins
with BPH. Among these proteins, many proteins are found to be involved in cell
proliferating, cell cycle, tumor me tastasis and other process (Table 18). These proteins
had higher expression in BPH or prostate cancer tissue. Targeting toward them can at
89
Chapter 4
Table 18. Other predicted important proteins. [energy >= (Max-3) kcal/mol]
90
Chapter 4
developed toward CDK to induce growth arrest of the cells at G1 phase [149]. Studies
find that the extract of this herb can induce apoptosis and inhibit cell proliferation of
BPH tissues from patients [112]. In our data, four phytosterols (Beta-sitosterol,
The other two important molecules involved in the cell cycle, Cdc25 B-Type
Tyrosine Phosphatase [150,151] and CDC42 [152,153] may be the potential targets.
From our data, they are targets of caproic acid and monolaurin, respectively.
shown that its production is elevated in prostatic carcinoma in comparison with benign
prostatic hyperplasia and normal prostate [155]. Our data show two fatty acids (capric
acid and caprylic acid) can bind to it. Plasminogen activator inhibitor 1 (PAI-1) is
suggested by one study to be involved in the tumor growth through the anti-apoptotic
effect in the benign human breast epithelial cell line MCF-10A [162]. Our data show
that two phytosterols (cycloartenol and stigmasterol) can bind to this protein.
growth and induce cell cycle arrest using MDA-MB-231 human breast cancer cells to
evaluate the effects of phytosterols. Connolly JM’s study shows inhibition effects of
fatty acids on DU145 human prostate cancer cell growth in athymic nude mice [164].
91
Chapter 4
4.3.5 Anti-metastasis
tissue remodeling and thus in various disease processes including tumor invasion and
inhibitor design and drug development. In our predicted list, four types of MMPs are
MMP13. Their concentrations are significantly higher in PCa patients with metastases
[158]. MMP1 can be bound by beta-sitosterol, MMP3 by stigmasterol and cyc loartenol,
motile state when metastasis [159]. Caprylic acid and Caproic acid can bind to this
protein. Chymase is first found in mast cells and seems to be involved in the
drugs in cardiovascular and inflammatory diseases [160]. Monolaurin can bind to this
invasion and metastasis. Cathepsin B often presents with higher amounts in malignant
tumors than in normal tissues or benign tumors. Numerous clinical studies have
92
Chapter 4
clinical outcome for patients with diverse types of tumors [156]. Stigmasterol and
higher in tumours of prostate, lung and sigmoid, than in the healthy tissues. It supports
in the pathogenesis of hypertension [157]. Out data show that ethyloleate bind to it.
Some studies have demonstrated the inhibitory effects of fatty acids [165,166] and
experiments have been carried out to prove the relations hip between these proteins and
the pathogenesis of BPH, our study seems to indicate that they play an important role
in this process.
4.4 Discussion
The results show that most of these chemical ingredients of Serenoa repens can
bind to a number of targets, which has been proven by experiments. Moreover, our
compounds. The results show in Table 19 that a compound can only bind to one or a
few protein targets, which indicates the role of this herb in the treatment of BPH is due
The results indicate that not all compounds are helpful to the overall action of this
93
Chapter 4
ethylpalmitate and ethylstearate, may have little contribution to the expected actions of
this herb. These five compounds have their own target proteins. But compared with the
other compounds, they are relatively weak binders to these targets. For example, both
but the latter strongly docked into the protein. Hence the effects of ethylstearate can be
neglected.
94
Chapter 4
Continued
Compound Predicted Targets
Cycloartenol Prostaglandin H2 Synthase (Cyclooxygenase)
Cyclin- Dependent Kinase 2
Plasminogen Activator Inhibitor
Stromelysin-1 (Mmp 3)
Lupenone Cyclin- Dependent Kinase 2
Lupeol Nitric Oxide Synthase
Cyclin- Dependent Kinase 2
Stigmasterol Plasminogen Activator Inhibitor
Stromelysin-1 (Mmp 3)
Adp-Ribosylation Factor 6
Fgf Receptor 1
Retinoic Acid Receptor
Nitric Oxide Synthase
Prostaglandin H2 Synthase (Cyclooxygenase)
Estrogen Receptor
Cathepsin B
Ethylcaprylate Tumor Necrosis Factor Receptor
Ethyllaurate Tumor Necrosis Factor Receptor
Ethyloleate Prolyl Endopeptidase
Monolaurin Chymase
Gtp-Binding Protein (Cdc42)
S-Adenosylmethionine Decarboxyla
Nitric Oxide Synthase
Protein Kinase C
Monomyristin Fibroblast Growth Factor 1
Cathepsin B
Among the interaction between the predicted targets and the ingredients of Serenoa
repens that have been mentioned above, many have not been proven or implicated by
It is not expected that exhaustive experiments have been done to determine all
protein targets of a given compound, nor the multiple compounds of one herb. Because
95
Chapter 4
the protein expression, protein concentration and accessibility vary largely in different
cell system, on which many experimental studies of compound are based, observation
of the molecular events related to the ligand-protein interaction is also difficult. These
targets do not exist or have lower level in these cells. Although the binding energy
suggests that the interaction should be possible, but not exist in these cells. The
discovery of protein profiles can provide useful information to facilitate the study
these compounds by different cells or issues. Even if a large doze of one chemical drug
is taken, when it arrives at its protein target, its concentration may become very low
compared with the initial concentration. On the contrary, a chemical with low initial
concentration may easily arrive at its targets to take affect efficiently. In this regards,
when we select the compounds, we chose not only those with great amount in the
extract but also those with little amount. Besides, to determine the potentia l ligands of
one given protein, we take into account of the binding energy. If the binding energy of
a compound was 3 kcal/mol less than the maximum, the binding of the compound is
neglected. Considered that the binding ability is exponential with the binding energy,
that are key therapeutic targets for a variety of diseases. Moreover, some of the PDB
96
Chapter 4
structures may be of little relevance to binding study for a particular molecule. These
include entries containing an incomplete section or a chain, protein mutants that are
expected to provide a more diverse set of relevant structures. Knowledge from study of
In our study, 5 alpha reductase, which has been long suggested to be the most
in the proliferatio n of the prostate. Studies show that inhibition of this enzyme could be
a useful method to cure diseases related with the prostate. This problem can be
methods [169].
97
Chapter 4
4.5 Conclusions
Our study provides the potential targets of Serenoa repens in the treatment of BPH,
compounds in the extract. Besides these interactions, other predicted bindings between
particular compounds and protein targets have not been proven by experiments.
Moreover, from our data, other important targets may be involved in the pathogenesis
of BPH and can be targeted by Serenoa repens. Thus further studies need to be carried
out to evaluate these protein targets and the role of these compounds. In conclusion, as
a relatively fast-speed and low-cost tool, this software may find application in
medicinal plants.
98
Chapter 5
Chapter 5: Conclusions
about all aspects of TCM including prescription formulae, constituent herbs, known
herbal ingredients and their molecular actions. The therapeutic effects, adverse
reactions, clinical indications and applications are provided at the levels of prescription,
medicinal herbs such as tastes, flavors, site of actions, toxicity level, and therapeutic
Almost all negative formulae are correctly classified. Especially in Group 2,4,6 and 7,
all of the negative samples are correctly recognized. The study may also have some
helpful hints for herbalist doctors to determine the effectiveness of a TCM formula.
Except Group 4 (formulae contain 7 herbs), the sensitivities of the other groups are
more than 60%. Especially, this method has satisfactory performance for groups in
which the formulae contains 4,6,10 herbs, respectively. In these groups, the
sensitivities are more than 83%. In addition, several potential effective TCM formulae
are derived from hundreds of randomly mixed recipes. These formulae never appear in
previous literature and their effects have never been studied before. It is unclear
99
Chapter 5
whether these formulae have the therapeutic value. It is expected that the method can
be an alternative approach to study formulae while more and more formulae are under
scientific studies.
The other part of this work is focused on the molecular mechanism study of herbal
interaction between the chemical ingredients of a medicinal herb Serenoa repens and a
potential targets are obtained, each of which can be bound by several different
compounds. Among the list of the targets, a number of proteins have been proven by
experiments to be bound by the compounds. Moreover, many protein targets are found
to be involved in cell proliferating, cell cycle, tumor metastasis and other processes
and have higher expression in BPH or prostate cancer tissue, indicating the important
roles in the treatment of BPH. The computational results indicate the feasibility of
Serenoa repens.
100
References
References
5. Ling YK and Yan ZH. Traditional Chine se medicine. Shanghai Science and
7. Shiang E and Li FP. The Yin-Yang (cold-hot) theory of disease. JAMA. 1971,
217(8):1108.
9. Shang Z. Discussion on the date of appearance of the title Shen nong ben caojing
10. Ouyang B. Studies on Shang Han Lun in Ming Dynasty and its influence.
101
References
11. Song Z. Research on the archaic pronunciation of Ma Bi San for Ma Fei San, a
12. Liu ZZ. A course in English on Chinese materia medica. (15)Li Shi- Zhen--a great
13. Li SZ. Ben Cao Gang Mu. People's Medical Publishing House. 1982.
14. Chen JT. Review: drug therapy in Chinese traditional medicine. Journal of
15. Diamond WJ. The clinical practice of complementary, alternative, and western
16. Ang-Lee MK, Moss J and Yuan CS. Herbal medicines and perioperative care.
17. Eisenberg DM, Davis RB, Ettner SL, Appel S, Wilkey S, Van Rompay M and
Kessler RC. Trends in alternative medicine use in the United States, 1990-1997.
18. Kam PC and Liew S. Traditional Chinese herbal medicine and anaesthesia.
19. Qu J, Zhang S and Xie R. The Chinese Materia Medica. Shanghai University of
2000.
102
References
2000, 71(10):621-629.
23. Yokozawa T, Liu ZW and Chen CP. Protective effects of Glycyrrhizae radix
25. Akao T. Effects of glycyrrhizin and glycyrrhetic acid on the growth, glycyrrhizin
26. Sun XY, Huang LJ and Liu X, Chemical ingredients of Radix Glycyrrhizae.
27. Wang F and Su YH, Pharmacological action and clinical application of Radix
13(5):303-304.
2000, 44(9):799-804.
103
References
30. Banskota AH, Tezuka Y, Le Tran Q and Kadota S. Chemical constituents and
biological activities of Vietnamese medicinal plants. Curr Top Med Chem. 2003,
3(2):227-248.
31. Singh VK, George CX, Singh N, Agarwal SS and Gupta BM. Combined
47:234-236.
32. Singh VK, Agarwal SS and Gupta BM, Immunomodulatory activity of Panax
33. Tianjin Angel Power Allied Pharmaceutical Company. Hu Xing Dan, Tianjin,
China, 1996.
34. Chiatai Qingchunbao Pharmaceutical Co. Ltd. Sheng Mai Ingection. Hangzhou,
China, 1993.
36. Wang LX, Xiao HB, Liang XM and Bi KS. Vectorial angle method for evaluating
37. Qiao Y, Li P and Wang Y, Journal of Beijing Univ. of TCM (Chinese), 1998,
21(5):17-19.
104
References
20(11):588-590.
3:33-37.
42. Chen YZ and Ung CY. Computer automated prediction of potential therapeutic
43. Vapnik VN. Statistical Learning Theory. Wiley- interscience, New York, 1995.
44. Cai CZ, Han LY, Ji ZL, Chen X and Chen YZ. SVM-Prot: web-based support
45. Levin RM and Das AK. A scientific basis for the therapeutic effects of Pygeum
47. Wilt TJ, Ishani A, Rutks I and MacDonald R. Phytotherapy for benign prostatic
48. Gerber GS. Phytotherapy for benign prostatic hyperplasia. Curr Urol Rep. 2002,
3(4):285-291.
105
References
51. He M, Yan, X, Zhou J and Xie G. Traditional Chinese medicine database and
54. Zhao C and Xu YZ, Antitumor herbal medicines and their active compositions.
57. Zhang WG, Jia WC, Li SP, Zhang J, Ou YB and Xu XL. Chinese medicated diet.
106
References
60. Li DQ, Shi LH, Gao Y and Jiang JX. Prescription of traditional Chinese medicine.
64. de Vel O, Anderson A, Corney M and Mohay G. Mining E-mail content for
65. Kim KI, Jung K, Park SH and Kim HJ. Support vector machine-based text
66. Drucker H, Wu DH and Vapnik VN. Support vector machine for spam
67. Vapnik VN. The nature of statistical learning theory, 2nd Ed. Springer, New York,
1999.
68. Tong S and Koller D. Support vector machine active learning with applications to
69. Li ZY, Tang SW and Yan SC. Multi-class SVM classifier based on pairwise
107
References
70. Thubthong N and Kijsirikul B. Support vector machines for Thai phoneme
230:355-365.
72. Ben-Yacoub S, Abdeljaoued Y and Mayoraz E. Fusion of face and speech data
73. Wu CY, Liu C and Zhou J. Eyeglasses verification by support vector machine.
74. Wang YJ, Chua CS and Ho YK. Facial feature detection and face recognition
75. Hsieh JW and Huang YS. Multiple-person tracking system for content analysis.
76. Papageorgiou C and Poggio T. A trainable system for object detection. Inter J
77. Karlsen RE, Gorsich DJ and Gerhart GR. Target classification via support vector
78. Zhao Q, Principe JC, Brennan VL, Xu DX and Wang Z. Synthetic aperture radar
79. Gavrishchaka VV and Ganguli SB. Support vector machine as an efficient tool
108
References
80. Liong SY, Sivapragasam C. Flood stage forecasting with support vector machines.
81. Fritsche HA. Tumor markers and pattern recognition analysis: a new diagnostic
82. Bao L and Sun Z. Identifying genes related to drug anticancer mechanisms using
Golub TR. Multiclass cancer diagnosis using tumor gene expression signatures.
84. Chan K, Lee TW, Sample PA, Goldbaum MH, Weinreb RN and Sejnowski TJ,
85. Furey TS, Cristianini N, Duffy N, Bednarski DW, Schummer M and Haussler D.
86. Pavlidis P, Weston J, Cai JS and Noble WS. Learning gene functional
87. Brown MPS, Grundy WN, Lin D, Cristianini N, Sugnet CW, Furey TS, Ares M Jr
109
References
by using support vector machines. Proc Natl Acad Sci USA. 2000, 97:262-267.
89. Cai YD, Liu XJ, Xu XB and Chou KC. Support Vector Machines for Predicting
90. Wacher VJ, Salphati L and Benet LZ. Active secretion and enterocytic drug
metabolism barriers to drug absorption. Adv Drug Del Rev. 1996, 20:99-112.
91. Ding CH and Dubchak I. Multi-class protein fold recognition using support
92. Baldi P, Brunak S, Chauvin Y, Anderson CAF and Nielsen H. Assessing the
2000, 16:412-424.
93. Roulston JE. Screening with Tumor Markers. Mol Biotechnol. 2002, 20:153-162.
95. Shoskes DA. Phytotherapy and other alternative forms of care for the patient
97. Wilt T, Ishani A and Mac Donald R. Serenoa repens for benign prostatic
3:CD001423.
110
References
98. Debruyne F, Koch G, Boyle P, Da Silva FC, Gillenwater JG, Hamdy FC, Perrin P,
99. Koch E. Extracts from fruits of saw palmetto (Sabal serrulata) and roots of
101. Wilt TJ, Ishani A, Stark G, MacDonald R, Lau J and Mulrow C. Saw palmetto
102. Gerber GS, Zagaja GP, Bales GT, Chodak GW and Contreras BA. Saw palmetto
103. Plosker GL and Brogden RN. Serenoa repens (Permixon). A review of its
111
References
Tavani M and D'Eramo G. Plant extracts in BPH. Minerva Urol Nefrol. 1993,
45(4):143-149.
37(2):77-83.
Bazan NG, Authie D and Braquet P. Effect of Serenoa repens extract (Permixon)
108. Carraro JC, Raynaud JP, Koch G, Chisholm GD, Di Silverio F, Teillac P, Da Silva
109. Bayne CW, Donnelly F, Ross M and Habib FK. Serenoa repens (Permixon): a
112
References
110. Delos S, Iehle C, Martin PM and Raynaud JP. Inhibition of the activity of 'basic'
20(1):515-519.
Lefrere Belda MA, Abbou CC, Raynaud JP and Chopin DK. Induction of
45(3):255-266.
Effect of the lipidosterolic extract of Serenoa repens (Permixon) and its major
114. Vacher P, Prevarskaya N, Skryma R, Audy MC, Vacher AM, Odessa MF and
115. Paubert-Braquet M, Mencia Huerta JM, Cousse H and Braquet P. Effect of the
113
References
57(3):299-304.
119. Chen YZ and Zhi DG. Ligand-protein inverse docking and its potential use in the
43(2):217-226.
120. Chen YZ and Ung CY. Prediction of potential toxicity and side effect protein
121. Kuntz ID, Blaney JM, Oatley SJ, Langridge R and Ferrin TE. A geometric
122. Richards FM. Areas, volumes, packing and protein structure. Annu Rev Biophys
114
References
123. Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KM Jr, Ferguson DM,
Spellmeyer DC, Fox T, Caldwell JW and Kollman PA. A second generation force
field for the simulation of proteins and nucleic acids. J Am Chem Soc. 1995,
117:5179-5197.
124. Wang J, Kollman PA and Kuntz ID. Flexible ligand docking: a multistep strategy
calculations for NH3-NH3 and NH3-NH4+. Int J Quantum Chem Symp. 1974,
1:49-53.
126. Chen YZ and Prohofsky EW. The role of a minor groove spine of hydration in
127. Chen YZ and Prohofsky EW. Premelting base pair opening probability and drug
130. Ernst E. The risk-benefit profile of commonly used herbal therapies: Ginkgo, St.
John’s Wort, Ginseng, Echinacea, Saw Palmetto, and Kava. Ann Intern Med.
115
References
2000. 136:42-53.
131. Palin MF, Faguy M, LeHoux JG and Pelletier G. Inhibitory effects of Serenoa
132. Cristoni A, Pierro FD and Bombardelli E. Botanical derivatives for the prostate.
133. Chen X, Ji ZL and Chen YZ. TTD: Therapeutic Target Database. Nucleic Acids
134. Dvorkin L and Song KY. Herbs for Benign prostatic hyperplasia. Annals of
135. Goldmann WH, Sharma AL, Currier SJ, Johnston PD, Rana A and Sharma CP.
Saw palmetto berry extract inhibits cell growth and Cox-2 expression in prostatic
136. Boget S, Cereser C, Parvaz P, Leriche A and Revol A. Fibroblast growth factor
137. Deshmukh N, Scotson J, Dodson AR, Smith PH, Ke Y and Foster CS.
80(6):869-874.
growth factor receptor signaling in prostate cancer cell survival. J Natl Cancer
116
References
139. Simak R, Capodieci P, Cohen DW, Fair WR, Scher H, Melamed J, Drobnjak M,
kit- ligand in benign and malignant prostatic tissues. Histol Histopathol. 2000,
15(2):365-374.
145. Pasquali D, Thaller C and Eichele G. Abnormal level of retinoic acid in prostate
117
References
146. He XY, Merz G, Yang YZ, Pullakart R, Mehta P, Schulz H and Yang SY. Function
147. Mohan RR, Challa A, Gupta S, Bostwick DG, Ahmad N, Agarwal R, Marengo
148. Rosenblum ER, Stauber RE, Van Thiel DH, Campbell IM and Gavaler JS.
149. Carnero A. Targeting the cell cycle for cancer therapy. Br J Cancer. 2002,
87(2):129-133.
152. Aznar S and Lacal JC. Rho signals to cell growth and apoptosis. Cancer Lett.
2001, 165(1):1-10.
153. Erickson JW and Cerione RA. Multiple roles for Cdc42 in cell regulation. Curr
118
References
154. Murant SJ, Handley J, Stower M, Reid N, Cussenot O and Maitland NJ.
155. Arenas MI, Romo E, Royuela M, Fraile B and Paniagua R. E-, N- and P-cadherin,
oligopeptidase in human peripheral tissues and body fluids. Eur J Clin Chem Clin
158. Jung K, Nowak L, Lein M, Priem F, Schnorr D and Loening SA. Matrix
159. Turner CE and Brown MC. Cell motility: ARNO and ARF6 at the cutting edge.
9:947-954.
119
References
162. Kwaan HC, Wang J, Svoboda K and Declerck PJ. Plasminogen activator
163. Awad AB, Williams H and Fink CS. Phytosterols reduce in vitro metastatic
40(2):157-164.
164. Connolly JM, Coleman M and Rose DP. Effects of dietary fatty acids on DU145
human prostate cancer cell growth in athymic nude mice. Nutr Cancer. 1997,
29(2):114-119.
Hioki K and Tsubura A. Dietary effects of fatty acids on growth and metastasis of
KPL-1 human breast cancer cells in vivo and in vitro. Anticancer Res. 1998,
18(3A):1621-1627.
Effects of fatty acids on liver metastasis of ACL-15 rat colon cancer cells. Nutr
167. Awad AB, Fink CS, Williams H and Kim U. In vitro and in vivo (SCID mice)
168. Awad AB, Downie A, Fink CS and Kim U. Dietary phytosterol inhibits the
120
References
169. Sali A. 100,000 protein structures for the biologist. Nat Struct Biol. 1998,
5(12):1029-1032.
121