Cheminformatics: Journal of Proteomics & Bioinformatics January 2010
Cheminformatics: Journal of Proteomics & Bioinformatics January 2010
net/publication/46279984
Cheminformatics
CITATIONS READS
6 498
2 authors, including:
Nutan Prakash
Virani Science College
10 PUBLICATIONS 62 CITATIONS
SEE PROFILE
All content following this page was uploaded by Nutan Prakash on 06 June 2014.
Cheminformatics
Nutan Prakash* and Dinta A. Gareja
Department of Biotechnology, Shree M . & N. Virani Science College, India
Abstract
Cheminformatics is the application of computational methods to chemical problems, with particular emphasis on
the manipulation of structural information. Cheminformatics is a relatively new field of information technology that
focuses on the collection, storage, analysis, and manipulation of chemical data. The chemical data of interest typically
includes information on small molecule formulas, structures, properties, spectra, and activities (biological or industrial).
Cheminformatics originally emerged as a vehicle to help the drug discovery and development process; however
Cheminformatics now plays an increasingly important role in many areas of biology, chemistry, and biochemistry.
Cheminformatics can also be applied to data analysis for various industries like paper and pulp, dyes and such applied
industries.
Environmental
effects & Hazards Spectroscopy
Analysis and *Corresponding author: Nutan Prakash, Department of Biotechnology, Shree M .
Modeling
& N. Virani Science College, India, E-mail: [email protected]
Chenical and Received July 31, 2010; Accepted August 31, 2010; Published August 31,
Chemical Environment
physical reference Information 2010
Data Systems
Citation: Prakash N, Gareja DA (2010) Cheminformatics. J Proteomics Bioinform
Pharmocology 3: 249-252. doi:10.4172/jpb.1000147
Toxicology
Regulations
Copyright: © 2010 Prakash N, et al. This is an open-access article distributed
under the terms of the Creative Commons Attribution License, which permits
Figure 1: unrestricted use, distribution, and reproduction in any medium, provided the
original author and source are credited.
J Proteomics Bioinform
lish ing
Pub G
CS r
ou
OM
J Proteomics Bioinform
lish ing
Pub G
CS r
ou
OM
solve the chemical problems. So, Cheminformatics is the mixing rest with the user’s decision. In parallel, another area that is gaining
of those information resources to transform data into information greater importance is the development of filtering procedures
and information into knowledge for the intended purpose of which identifies molecules that exhibit some sort of undesirable
making better decisions faster in the area of drug lead identification characteristic (toxicity, high reactivity etc.)
and organization. Cheminformatics is the use of computer and
Without a proper knowledge base, lead optimization is a search
informational techniques, applied to a range of problems in the
in the vast darkness of chemistry space. It may lead to the wrong
field of chemistry. Also known as Cheminformatics and chemical
direction in the drug discovery program. Establishing a proper
informatics, these techniques are used in pharmaceutical companies
database with complete test results may lead to organizational
in the process of Drug Discovery.
success in drug discovery developments (Figure 8).
Current Status Combinatorial chemistry has opened up new strategies for a
Recent advances in virtual screening track computational more comprehensive parallel approach to sweeping and searching
capability and as the processing power of computers improves, so do during lead optimization, which has necessitated the development of
screening speed and complexity. Parameters such a structure, function suitable and new library design principles.
or chemical space allow for a nearly limitless array of screening
options. The use of screening data for development decision making
Recent Development in Cheminformatics
is predicated from the management and interpretation of the data. The technological developments in combinatorial synthesis and
Extraction of information from the data is the vital link between HTS have brought about an increase, by several orders of magnitude,
theoretical design and the drug candidate. Finally, it is the integration in the volumes of data that need to be processed in drug discovery
of iterative results from computation to activity that drives the cycle programmes. This explosion of both structural and bioactivity
forward. Library chemistry and high-throughput screening require data has further hastened the need to integrate two areas of
the greater use of chemo informatics to increase their effectiveness. chemical computation that had previously developed, on the whole,
However to identify types of procedure which yield the best result separately. Chemical information techniques have been developed
and address factors such as cost, availability and synthetic feasibility for the storage and retrieval of information from databases of
chemical articles and chemical structures, both corporate and public.
The computer processing required for such system is relatively
Identify Select Design Screen Final Entry
simple in nature, although extremely impressive in terms of the data
Disease Target Primary
Screen
Identify
“HIT”
Selection
of best leads
Into
Development
volumes involved (hundreds of thousands or millions of molecules).
Conversely, molecular modeling techniques have traditionally been
used for the detailed analysis of datasets that contain a few tens, or at
1,00,000
Compunds
5 Compunds most a few hundreds, of molecules, with the aim of using knowledge
about their conformations and energies, inter alia, to predict their
biological activities. Extending these methods for analyzing SARs
Figure 7: Drug discovery funnel. to data volumes typical of those routinely handled in chemical
information systems, is a data mining challenge that is now being
faced by most drug discovery organizations. Third, there is no doubt
Drug
that informatics is an idea whose time has come, or is coming, in an
Identify Leads Optimize Candidates increasingly wide range of disciplines. Bioinformatics is, of course,
now a widely recognized discipline, the establishment of which has
been driven mainly by the data explosion resulting from the Human
Effective
85% Genome and related sequencing projects. Medical informatics and
100% “Chemo”
Filter health informatics are also well established and references are
starting to appear to, for example, educational informatics and neural
INPUT Survival Rate
science informatics.
Figure 8: Need for effective chemo informatics filter. Grand Challenges for Cheminformatics
There are three “grand challenge” areas. They should be an
important focus for cheminformatics.
Overcoming stalld drug discovery
After the impressive successes in drug discovery toward the
end of the last century, productivity in the pharmaceutical industry
has declined as expenses have gone up. Cheminformatics can
help by enabling fast, cheap virtual experiments to prioritize real
experiments. As more drug discovery research is carried out in
academia, institutes and small companies, and solutions will require
pieces from cheminformatics, bioinformatics and other disciplines,
cheminformatics knowledge and tools should be made as widely
Figure 9: available as possible.
J Proteomics Bioinform
lish ing
Pub G
CS r
ou
OM
References
1. Augen J (2002) The evolving role of information technology in the drug
discovery process. Drug Discov Today 7: 315-323.
2. Barrett RW, Dower WJ, Fodor SPA, Gallop MA, Gordon EM (1994) Applications
of Combinatorial Technologies to Drug Discovery. 1. Background and Peptide
Combinatorial Libraries. J Med Chem 37: 1233-1251.
3. Brown FK (1998) Chemo informatics: what is it and how does it impact drug
discovery. Annu Rep Med Chem 33: 375-384.
4. Crippen CA, Parks GM, Topliss JG (1998) The measurement of molecular
diversity by receptor site interaction simulation. J Comput Aided Mol Des 12:
pp441-449.
the biggest challenges for mankind this century. Fundamental to this 8. Hall DG, Manku S, Wang F (2001) Solution- and Solid-Phase Strategies for
the Design, Synthesis, and Screening of Libraries Based on Natural Product
will be finding chemicals which are less polluting or less toxic to the Templates: A Comprehensive Survey. J Comb Chem 3: 125-150.
environment, or improving chemical use to minimize environmental
9. Hecht P (January 2002) High-throughput screening: beating the odds with
impact (e.g. in petrochemicals). Cheminformatics already has much informatics-driven chemistry. Curr Drug Discov 21-24.
to offer through computational toxicology and predictive modeling. 10. Karthikeyan M, Krishnan S (2002) Cheminformatics: A tool for modern drug
discovery. International Journal of Information Technology and Management 1:
Uuderstanding life from a chemical perspective 69-82.
Chemicals are being found to be increasingly important in cellular 11. Ludlow RF, Otto S (2008) Systems chemistry. Chem Soc Rev 37: 101-108.
functions, for example through small molecule modulators and 12. Moore M, Nisbet LJ (1997) Will natural products remain an important source of
epigenetic. This has led to fields such as chemical biology, and more drug research for the future? Curr Opin Biotechnol 8: pp708-712.
recently systems chemistry (Ludlow and Otto, 2008) and systems 13. Oprea TI, Tropsha A, Faulon JL, Rintoul MD (2007) Systems chemical. Nat
Chem Biol 3: 447-50.
chemical biology (Oprea et al., 2007), which seek to understand
biological systems from a chemistry perspective. Integration of 14. PubChem [http://pubchem.ncbi.nlm.nih.gov/].
Cheminformatics and bioinformatics methods will be key to this. 15. Wild DJ (2009) Grand Challenges for Cheminformatics. J Cheminform 1: 1.
J Proteomics Bioinform
lishing
Pub Gr
CS
ou
OM