Protein structure prediction and modeling

Uploaded by

Israa M. Shamkh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Protein structure prediction and modeling

Uploaded by

Israa M. Shamkh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

HOMOLOGY

MODELLING
Homology modelling
INTRODUCTION:

Homology modeling, also known as comparative modeling of protein is the
technique which allows to construct an unknown atomic-resolution model of the
"target" protein from:
1. Its amino acid sequence and
2.An experimental 3D structure of a related homologous protein (the
"template").
 Prediction of the three dimensional structure of a given protein sequence i.e. target
protein from the amino acid sequence of a homologous (template) protein for
which an X-ray or NMR structure is available based on an alignment to one or more
known protein structures.
 If similarity between the target sequence and the template sequence is
detected, structural similarity can be assumed.
 In general, 30% sequence identity is required to generate an useful model.
Sequence similarity & structural similarity
As long as the length of two sequences and the percentage of identical residues fall in the
region marked as “safe” the two sequences are practically guaranteed to adopt a similar
structure.
Homology modeling concept
Structure prediction by homology modelling
An example
 To know the structure of sequence A (150 amino acids long), 1ST of all compare sequence A
to all the sequences of known structures stored in the PDB (using, for example, BLAST), if
a sequence B (300 amino acids long) containing a region of 150 amino acids that match
sequence A with 50% identical residues.
 As this match (alignment) clearly falls in the safe zone(50%) , we can simply take the
known structure of sequence B (the template), cut out the fragment corresponding to the
aligned region, mutate those amino acids that differ between sequences A and B, and
finally arrive at our model for structure A. Structure A is called the target and is of course
not known at the time of modeling.
HISTORY
 The first homology modelling studies were done using wire and plastic models of bonds and
atoms as early as the 1960’s. The models were constructed by taking the coordinates of a
known protein structure and modified by hand for those amino acids that did not match the
structure.
 In 1969 David Phillips, Brown and co-workers published the first paper regarding
homology modelling. They modelled -lactalbumin based on the structure of hen- egg
white lysozyme. The sequence identity between these two proteins was 39%.
Steps of homology modelling
Protein Sequence
1. Template recognition
and initial alignment
Sequence alignment Database Searches
2. Alignment correction
3. Backbone generation Good Secondary structure
Structure prediction
4. Loop modeling homologue?

5. Side-chain modeling Improve alignment

using secondary
structure prediction
6.Model optimization
7.Model Validation Homology modelling
Minimisation

Three dimensional
Check model structure
1.Template recognition and initial alignment
 Template recognition & selection involves searching the PDB for homologous
proteins with determined structures. The search can be performed using simple
sequence alignment programs such as BLAST or FASTA as the percentage identity
between the Target sequence and a possible template is high enough in the safe
zone, to be detected with these programs.

 To obtain a list of hits-the modeling templates and corresponding alignments the

program compares the query sequence to all the sequences of known structures in
the PDB using mainly two matrices:

1. A residue exchange matrix

2. An alignment matrix .
2. Alignment correction

Sometimes it may be difficult to align two sequences in a region where the
percentage sequence identity is very low. One can then use other sequences from
homologous proteins to find a solution.

 For ex: To align the sequence LTLTLTLT with YAYAYAYAY which is nearly
impossible, then only a third sequence, TYTYTYTYT, that aligns easily to both of
them can solve the issue.


2 is correct, because it leads to a small gap, compared to a huge hole
associated with alignment 1.
3.Backbone generation
 When the alignment is correct, the backbone of the target can be created.
 The coordinates of the template-backbone are copied to the target.
 When the residues are identical, the side-chain coordinates are also copied.
4.LOOP MODELLING
 After the sequence alignment, there are often regions created by insertions and
deletions that lead to gaps in alignment. These gaps are modeled by loop
modeling, which is less accurate. Currently, two main techniques are used to
approach the problem:
 The database searching method - this involves finding loops from known protein
structures and superimposing them onto the two stem regions (main chains mostly)
of the target protein. Some specialized programs like FREAD and CODA can be
used.
 The ab initio method - this generates many random loops and searches for one
that has reasonably low energy and φ and ψ angles in the allowable regions in the
Ramachandran plot.
The red loop is modeled with the green
residues as anchor residues. The insertion of
2 residues results in a longer loop.
5.Side-chain modeling
 This is important in evaluating protein–ligand interactions at active sites and
protein–protein interactions at the contact interface.
 A side chain can be built by searching every possible conformation for every torsion
angle of the side chain to select the one that has the lowest interaction energy with
neighboring atoms.
 A rotamer library can also be used, which has all the favorable side chain torsion
angles extracted from known protein crystal structures.
6: model optimization
 energy minimization procedure on the entire model, by adjusting the relative
position of the atoms so that the overall conformation of the molecule has the
lowest possible energy potential. The goal is to relieve steric collisions without
altering the overall structure.
 Optimization can also be done by Molecular Dynamic Simulation which moves the
atoms toward a global minimum by applying various stimulation conditions
(heating, cooling, considering water molecules) thus having a better chance at
finding the true structure.
 Energy = Stretching Energy +Bending Energy +Torsion Energy +Non- Bonded
Interaction Energy
7.Model validation

 Every homology model contains errors. Two main reasons are:

1. The percentage sequence identity between template and target. If it is greater
than 90%, the accuracy of the model can be compared to crystallographically
determined structures & if less than 30% large error occurs
2. The number of errors in templates
 The final model has to be evaluated for checking the φ–ψ angles, chirality,
bond lengths, close contacts and also the stereo chemical properties.
Modeling Programs like Modeller, SWISS MODEL, Schrodinger, 3D-
JIGSAW.

A successful model depends on template selection, algorithm used and the
validation of the model.
Advantages
 It can find the location of alpha carbons of key residues inside the folded
protein.
 It can help to guide the mutagenesis experiments, or hypothesize structure-
function relationships.
 The positions of conserved regions of the protein surface can help identify
putative active sites, binding pockets and ligands.

Disadvantages
 Homology models are unable to predict conformations of insertions or
deletions, or side chain positions with a high level of accuracy.
 Homology models are not useful in modeling and ligand docking studies
necessary for the drug designing and development process. However, it may be
helpful for the same, if the sequence identity with the template is greater than
70%.
Ramachandran plot

 In a polypeptide the main chain (N-Calpha) and

(Calpha-C bonds) relatively are free to rotate.
These rotations are represented by the torsion
angles phi (φ) and psi(ψ ), respectively.

 A Ramachandran plot (or a [φ,ψ] plot), originally

developed in 1963 by G. N. Ramachandran, C.
Ramakrishnan, and V. Sasisekharan,is a way to
visualize backbone dihedral angles ψ against φ
of amino acid residues in protein structure.
Ramachandran plot
A Ramachandran plot can be used:
 One is to show in theory which values, or
conformations, of the ψ and φ angles are
possible for an amino-acid residue in a
protein.
 second is to show the empirical distribution
of datapoints observed in a single structure
in usage for structure validation, or else in
a database of many structures.

Python for Chemistry: An introduction to Python algorithms, Simulations, and Programing for Chemistry (English Edition)
From Everand
Python for Chemistry: An introduction to Python algorithms, Simulations, and Programing for Chemistry (English Edition)
Dr. M. Kanagasabapathy
5/5 (1)
Protein Modelling
No ratings yet
Protein Modelling
53 pages
Homolgy Modeling
No ratings yet
Homolgy Modeling
19 pages
Homology Modeling, Also Known As Comparative Modeling of
No ratings yet
Homology Modeling, Also Known As Comparative Modeling of
19 pages
2. Protein Structure Prediction
No ratings yet
2. Protein Structure Prediction
34 pages
Dr. Qudsia Yousafi
No ratings yet
Dr. Qudsia Yousafi
30 pages
Protein Modeling in Biochemistry
No ratings yet
Protein Modeling in Biochemistry
29 pages
Homology modeling
No ratings yet
Homology modeling
5 pages
Pre-Assessment Questions
No ratings yet
Pre-Assessment Questions
18 pages
Homology Modeling
No ratings yet
Homology Modeling
22 pages
Homology Modeling: Ref: Structural Bioinformatics, P.E Bourne Molecular Modeling, Folkers
No ratings yet
Homology Modeling: Ref: Structural Bioinformatics, P.E Bourne Molecular Modeling, Folkers
16 pages
Experiment-7(HOMOLOGY MODELING)
No ratings yet
Experiment-7(HOMOLOGY MODELING)
12 pages
Protein Structure Prediction Using Homology Modeling
No ratings yet
Protein Structure Prediction Using Homology Modeling
11 pages
Protein Structure Prediction.pptx
No ratings yet
Protein Structure Prediction.pptx
23 pages
Protein Modeling
No ratings yet
Protein Modeling
17 pages
Tertiary Structure Prediction Methods: Any Given Protein Sequence
No ratings yet
Tertiary Structure Prediction Methods: Any Given Protein Sequence
29 pages
Bioinformatics Notes - 17Bt54: Module - 4
No ratings yet
Bioinformatics Notes - 17Bt54: Module - 4
48 pages
Genome Sequencing Projects: Increase in The Number of Protein Sequences
No ratings yet
Genome Sequencing Projects: Increase in The Number of Protein Sequences
27 pages
Homology modeling
No ratings yet
Homology modeling
2 pages
Homology Modelling
No ratings yet
Homology Modelling
29 pages
Document (2) (14)
No ratings yet
Document (2) (14)
3 pages
Structural bioinformatics
No ratings yet
Structural bioinformatics
23 pages
Protein Structure Prediction
No ratings yet
Protein Structure Prediction
17 pages
Methods in Molecular Biology Volume Vol. 857
No ratings yet
Methods in Molecular Biology Volume Vol. 857
432 pages
Homo Logy
No ratings yet
Homo Logy
8 pages
Workshop Protein Modeling PDF
No ratings yet
Workshop Protein Modeling PDF
54 pages
Protein Modelling: (Building 3D Models of Proteins)
No ratings yet
Protein Modelling: (Building 3D Models of Proteins)
19 pages
3-D Structure of Proteins: Laws of Physics Theory of Evolution
No ratings yet
3-D Structure of Proteins: Laws of Physics Theory of Evolution
9 pages
Lec6-Protein Structure Prediction
No ratings yet
Lec6-Protein Structure Prediction
16 pages
BIF101 - II - Spring 2024
No ratings yet
BIF101 - II - Spring 2024
8 pages
7 HomologyModelling 12oct2020
No ratings yet
7 HomologyModelling 12oct2020
8 pages
Protein Side Chain Correction
No ratings yet
Protein Side Chain Correction
28 pages
3rdunitii
No ratings yet
3rdunitii
12 pages
Sanchez CurrOpinStructBiol 1997
No ratings yet
Sanchez CurrOpinStructBiol 1997
9 pages
Protein Modelling
No ratings yet
Protein Modelling
15 pages
Homology Modeling of Proteins Using Multiple Models and Consensus Sequence Alignment 1st Edition by Jahnavi Prasad, Michael Silberstein, Carlos Camacho, Sandor Vajda 9783540200765 pdf download
100% (2)
Homology Modeling of Proteins Using Multiple Models and Consensus Sequence Alignment 1st Edition by Jahnavi Prasad, Michael Silberstein, Carlos Camacho, Sandor Vajda 9783540200765 pdf download
41 pages
Structural Bioinformatics and Protein Structure Prediction (1)
No ratings yet
Structural Bioinformatics and Protein Structure Prediction (1)
14 pages
Homology Modeling of Proteins Using Multiple Models and Consensus Sequence Alignment 1st Edition by Jahnavi Prasad, Michael Silberstein, Carlos Camacho, Sandor Vajda 9783540200765 download
100% (2)
Homology Modeling of Proteins Using Multiple Models and Consensus Sequence Alignment 1st Edition by Jahnavi Prasad, Michael Silberstein, Carlos Camacho, Sandor Vajda 9783540200765 download
52 pages
Protein Tertiaty Structure Prediction
No ratings yet
Protein Tertiaty Structure Prediction
12 pages
Bif401 Solved Final Papers 2017
No ratings yet
Bif401 Solved Final Papers 2017
8 pages
Homology Modeling of Proteins Using Multiple Models and Consensus Sequence Alignment 1st Edition by Jahnavi Prasad, Michael Silberstein, Carlos Camacho, Sandor Vajda 9783540200765download
No ratings yet
Homology Modeling of Proteins Using Multiple Models and Consensus Sequence Alignment 1st Edition by Jahnavi Prasad, Michael Silberstein, Carlos Camacho, Sandor Vajda 9783540200765download
42 pages
modelling.ppt
No ratings yet
modelling.ppt
32 pages
Protein Structure Determination: Goal
No ratings yet
Protein Structure Determination: Goal
8 pages
Protein Structure Modeling
No ratings yet
Protein Structure Modeling
21 pages
Lecture 13- Protein 3 D Structure
No ratings yet
Lecture 13- Protein 3 D Structure
20 pages
Homology Modeling Tutorial
No ratings yet
Homology Modeling Tutorial
11 pages
Unit 3
No ratings yet
Unit 3
9 pages
Protein Tertiary Structures: Prediction From Amino Acid Sequences
No ratings yet
Protein Tertiary Structures: Prediction From Amino Acid Sequences
7 pages
Homology Modelling Notes PDF
No ratings yet
Homology Modelling Notes PDF
30 pages
Protein Modeling: Protein Structure Prediction Other Topics
No ratings yet
Protein Modeling: Protein Structure Prediction Other Topics
76 pages
Homology Model Prediction
No ratings yet
Homology Model Prediction
1 page
Protein Structure
No ratings yet
Protein Structure
52 pages
13 Application of Homology Modeling
No ratings yet
13 Application of Homology Modeling
7 pages
Thesis On Homology Modeling
100% (3)
Thesis On Homology Modeling
6 pages
TR_20211112_许锦波_基于深度学习的蛋白质结构预测
No ratings yet
TR_20211112_许锦波_基于深度学习的蛋白质结构预测
47 pages
Bio Chap Notes
No ratings yet
Bio Chap Notes
26 pages
Protein Modelling
No ratings yet
Protein Modelling
20 pages
Protein Structure Modelling
No ratings yet
Protein Structure Modelling
3 pages
De Novo Protein Design
No ratings yet
De Novo Protein Design
6 pages
Tools For Analyzing Comparative Protein Structure
No ratings yet
Tools For Analyzing Comparative Protein Structure
7 pages
Psychology BPS Textbooks in Psychology 1st Edition Miles Hewstone download pdf
100% (5)
Psychology BPS Textbooks in Psychology 1st Edition Miles Hewstone download pdf
61 pages
Embedded Sample Paper
0% (3)
Embedded Sample Paper
16 pages
LIFTING RING Safety Procedures
No ratings yet
LIFTING RING Safety Procedures
3 pages
(Ebook) Teaching and Learning History: Understanding the Past 11-18 by Alison Kitson and Chris Husbands with Susan Steward ISBN 0335238203 pdf download
100% (1)
(Ebook) Teaching and Learning History: Understanding the Past 11-18 by Alison Kitson and Chris Husbands with Susan Steward ISBN 0335238203 pdf download
50 pages
Year 10 - Number and Algebra - Pre Victorian Curriculum Assessment - 9.5 11 - Sample
No ratings yet
Year 10 - Number and Algebra - Pre Victorian Curriculum Assessment - 9.5 11 - Sample
12 pages
Milan Nayek: Contact Details
No ratings yet
Milan Nayek: Contact Details
2 pages
AUTOSAR CP SWS CommunicationStackTypes
No ratings yet
AUTOSAR CP SWS CommunicationStackTypes
26 pages
Applications of The Coanda Effect Ocr
No ratings yet
Applications of The Coanda Effect Ocr
9 pages
Tetric N-Ceram Bulk Fill and Bluephase N: Special
No ratings yet
Tetric N-Ceram Bulk Fill and Bluephase N: Special
24 pages
Concrete For Starters
No ratings yet
Concrete For Starters
38 pages
Soalan Sainsf1
No ratings yet
Soalan Sainsf1
13 pages
Chapter 8 - Mental Health and Well Being in Middle and Late Adolescence
100% (12)
Chapter 8 - Mental Health and Well Being in Middle and Late Adolescence
45 pages
Experiment 9 Iot
No ratings yet
Experiment 9 Iot
5 pages
793F Plano Hidraulico
No ratings yet
793F Plano Hidraulico
10 pages
LAP Amity Noida
No ratings yet
LAP Amity Noida
19 pages
Lect#5
No ratings yet
Lect#5
21 pages
CRT Picture Tube
No ratings yet
CRT Picture Tube
5 pages
Ayurveda Secret Marma Therapy Massage - Self Healing Course - Udemy
No ratings yet
Ayurveda Secret Marma Therapy Massage - Self Healing Course - Udemy
1 page
Java Ee Reference Sheet
No ratings yet
Java Ee Reference Sheet
2 pages
C Sec cp1
No ratings yet
C Sec cp1
212 pages
All Assets.2020 07 29
No ratings yet
All Assets.2020 07 29
1,938 pages
Low Noise, Cascadable Silicon Bipolar MMIC Amplifier: Technical Data
No ratings yet
Low Noise, Cascadable Silicon Bipolar MMIC Amplifier: Technical Data
4 pages
20th Century Assignment#1
No ratings yet
20th Century Assignment#1
12 pages
FC One Sheets ADSD 11182021
No ratings yet
FC One Sheets ADSD 11182021
2 pages
FC13 Bundle Pusher 1 EBM To LIFT
No ratings yet
FC13 Bundle Pusher 1 EBM To LIFT
11 pages
Geotech Engineering For CC Combine
No ratings yet
Geotech Engineering For CC Combine
21 pages
Xed Q
No ratings yet
Xed Q
3 pages
UFO FILES Black Box Ufo Secrets
No ratings yet
UFO FILES Black Box Ufo Secrets
10 pages
Three-Close-Reads Student PDF Version (1)
No ratings yet
Three-Close-Reads Student PDF Version (1)
5 pages
Designing An ESP Course For Secretary Student
No ratings yet
Designing An ESP Course For Secretary Student
10 pages

Protein structure prediction and modeling

Uploaded by

Protein structure prediction and modeling

Uploaded by

HOMOLOGY

5. Side-chain modeling Improve alignment

 To obtain a list of hits-the modeling templates and corresponding alignments the

1. A residue exchange matrix

 Every homology model contains errors. Two main reasons are:

 In a polypeptide the main chain (N-Calpha) and

 A Ramachandran plot (or a [φ,ψ] plot), originally

You might also like