Basics of Bioinformatics Lecture Notes of the Graduate Summer School on Bioinformatics of China scribd download
Basics of Bioinformatics Lecture Notes of the Graduate Summer School on Bioinformatics of China scribd download
Visit the link below to download the full version of this book:
https://medipdf.com/product/basics-of-bioinformatics-lecture-notes-of-the-gradua
te-summer-school-on-bioinformatics-of-china/
Basics of Bioinformatics
Lecture Notes of the Graduate Summer
School on Bioinformatics of China
123
Editors
Rui Jiang Michael Q. Zhang
Xuegong Zhang Department of Molecular and Cell Biology
Department of Automation The University of Texas at Dallas
Tsinghua University Richardson, TX, USA
Beijing
China, People’s Republic Tsinghua National Laboratory
for Information Science and Technology
Tsinghua University
Beijing, China, People’s Republic
This ambitious volume is the result of the successful 2007 Graduate Summer
School on Bioinformatics of China held at Tsinghua University. It is remarkable
for its range of topics as well as the depth of coverage. Bioinformatics draws on
many subjects for analysis of the data generated by the biological sciences and
biotechnology. This foreword will describe briefly each of the 12 chapters and close
with additional general comments about the field. Many of the chapters overlap and
include useful introductions to concepts such as gene or Bayesian methods. This is
a valuable aspect of the volume allowing a student various angles of approach to a
new topic.
Chapter 1, “Basics for Bioinformatics,” defines bioinformatics as “the storage,
manipulation and interpretation of biological data especially data of nucleic acids
and amino acids, and studies molecular rules and systems that govern or affect
the structure, function and evolution of various forms of life from computational
approaches.” Thus, the first subject they turn to is molecular biology, a subject that
has had an enormous development in the last decades and shows no signs of slowing
down. Without a basic knowledge of biology, the bioinformatics student is greatly
handicapped. From basic biology the authors turn to biotechnology, in particular,
methods for DNA sequencing, microarrays, and proteomics. DNA sequencing is
undergoing a revolution. The mass of data collected in a decade of the Human
Genome Project from 1990 to 2001 can be generated in 1 day in 2010. This is
changing the science of biology at the same time. A 1,000 genome project became
a 10,000 genome project 2 years later, and one expects another zero any time now.
Chromatin Immunoprecipitation or ChIP allows access to DNA bound by proteins
and thus to a large number of important biological processes. Another topic under
the umbrella of biological sciences is genetics, the study of heredity and inherited
characteristics (phenotypes). Heredity is encoded in DNA and thus is closely related
to the goals of bioinformatics. This whole area of genetics beginning with Mendel’s
laws deserves careful attention, and genetics is a key aspect of the so-called genetic
mapping and other techniques where the chromosomal locations of disease genes
are sought.
v
vi Foreword
concepts, problems and challenges deals with the area of knowledge discovery and
data mining that has emerged as an important research direction for extracting useful
information from vast repositories of data of various types. The basic concepts,
problems and challenges are first briefly discussed. Some of the major data mining
tasks like classification, clustering and association rule mining are then described in
some detail. This is followed by a description of some tools that are frequently used
for data mining. Two case examples of supervised and unsupervised classification
for satellite image analysis are presented. Finally an extensive bibliography is
provided.”
The valuable chapter on Applied Bioinformatics Tools (Chap. 9) provides a step-
by-step description of the application tools used in the course and data sources as
well as a list of the problems. It should be strongly emphasized that no one learns
this material without actually having hands-on experience with the derivations and
the applications. This is not a subject for contemplation only!
Protein structure and function is a vast and critically important topic. In this
collection it is covered by Chap. 10, “Foundations for the Study of Structure and
Function of Proteins.” There the detailed structure of amino acids is presented
with their role in the various levels of protein structure (including amino acid
sequence, secondary structure, tertiary structure, and spatial arrangements of the
subunits). The geometry of the polypeptide chain is key to these studies as are the
forces causing the three-dimensional structures (including electrostatic and van der
Waals forces). Secondary structural units are classified into ’-helix, “-sheets, and
“-turns. Structural motifs and folds are described. Protein structure prediction is an
active field, and various approaches are described including homology modeling
and machine learning.
Systems biology is a recently described approach to combining system-wide data
of biology in order to gain a global understanding of a biological system, such as
a bacterial cell. The science is far from succeeding in this endeavor in general,
let alone having powerful techniques to understand the biology of multicellular
organisms. It is a grand challenge goal at this time. The fascinating chapter on
Computational Systems Biology Approaches for Deciphering Traditional Chinese
Medicine (Chap. 11) seeks to apply the computational systems biology (CSB)
approach to traditional Chinese medicine (TCM). The chapter sets up parallel
concepts between CSB and CTM. In Sect. 11.3.2 the main focus is “on a CSB-based
case study for TCM ZHENG—a systems biology approach with the combination
of computational analysis and animal experiment to investigate Cold ZHENG and
Hot ZHENG in the context of the neuro-endocrine-immune (NEI) system.” With
increasing emphasis on the so-called nontraditional medicine, these studies have
great potential to unlock new understandings for both CSB and TCM.
Finally I close with a few remarks about this general area. Biology is a major
science for our new century; perhaps it will be the major science of the twenty-
first century. However, if someone is not excited by biology, then they should find a
subject that does excite them. I have almost continuously found the new discoveries
such as introns or microRNA absolutely amazing. It is such a young science when
such profound wonders keep showing up. Clearly no one analysis subject can
Foreword ix
solve all the problems arising in modern computational molecular biology. Statistics
alone, computer science alone, experimental molecular biology alone, none of these
are sufficient in isolation. Protein structure studies require an entire additional set
of tools such as classical mechanics. And as systems biology comes into play,
systems of differential equations and scientific computing will surely be important.
None of us can learn everything, but everyone working in this area needs a set of
well-understood tools. We all learn new techniques as we proceed, learning things
required to solve the problems. This requires people who evolve with the subject.
This is exciting, but I admit it is hard work too. Bioinformatics will evolve as it
confronts new data created by the latest biotechnology and biological sciences.
xi
xii Contents