Ab Initio Protein Structure Prediction

Ab initio protein structure prediction uses computational methods to predict a protein's 3D structure from its amino acid sequence. It involves generating multiple structural decoys through conformational searching guided by an energy function. The native structure is selected from the decoys based on having the lowest energy or highest compatibility with the amino acid sequence. Key factors for successful prediction include an accurate energy function, efficient search methods like Monte Carlo simulation and molecular dynamics, and effective model selection approaches like clustering or knowledge-based potentials.

Uploaded by

Vishnu Ajith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views25 pages

Ab Initio Protein Structure Prediction

Uploaded by

Vishnu Ajith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

AB INITIO PROTEIN

STRUCTURE PREDICTION
BY UTSAV KS
BIOINFORMATICS

1
PROTEIN STRUCTURE PREDICTION
• Protein structure prediction (PSP) is the prediction of the three-
dimensional structure of a protein from its amino acid sequence i.e. the
prediction of its tertiary structure from its primary structure.

2
3
AB INITIO MODELLING
• ab initio modelling conducts a conformational search under the guidance
of a designed energy function.
• This procedure usually generates a number of possible conformations
(structure decoys), and final models are selected from them.

4
A successful ab initio modelling depends on three factors:

An accurate energy function with which the native structure of a protein corresponds to
the most thermodynamically stable state, compared to all possible decoy structures.
An efficient search method which can quickly identify the low-energy states through
conformational search.
Selection of native-like models from a pool of decoy structures.

5
ENERGY FUNCTIONS
• Energy classified into two groups
• Physics-based energy functions
• Knowledge-based energy functions

6
Physics-Based Energy Functions
• “In a strictly-defined physics-based ab initio method, interactions between
atoms should be based on quantum mechanics and the coulomb potential
with only a few fundamental parameters such as the electron charge and
the Planck constant; all atoms should be described by their atom types
where only the number of electrons is relevant.“
• (Hagler et al. 1974; Weiner et al. 1984)

7
A compromised force field with a large number of selected atom types is used. In each
atom type, the chemical and physical properties of the atoms are enough alike with the
parameters calculated from crystal packing or quantum mechanical theory.

8
Well-known examples of such all-atom physics- based force fields include:-
AMBER
• CHARMM
• OPLS
• GROMOS96
These potentials contain terms associated with bond lengths, angles, torsion angles, van der
Waals, and electrostatics interactions.
The major difference between them lies in the selection of atom types and the interaction
parameters.

9
Knowledge-Based Energy Function
• Refers to the empirical energy terms derived from the statistics of the solved
structures in deposited PDB.
• Can be divided into two types:
• generic and sequence-independent terms such as the hydrogen bonding and
the local backbone stiffness of a polypeptide chain.
• Amino acid or protein sequence dependent terms e.g. pair wise residue
contact potential, distance dependent atomic contact potential, and secondary
structure propensities.
10
Conformational Search Methods
• Successful ab initio modelling of protein structures depends on the availability of a
powerful conformation search method which can efficiently find the global minimum
energy structure for a given energy function with complicated energy landscape.
• Types:
• • Monte Carlo Simulations.
• Molecular Dynamics
• Genetic Algorithm.
• Mathematical Optimization

11
Monte Carlo Simulations
• Its core idea is to use random samples of parameters or inputs to explore
the behavior of a complex system or process.

12
13
Molecular Dynamics.
• MD simulation solves Newton's equations of motion at each step of atom
movement, which is probably the most faithful method depicting atomistically
what is occurring in proteins.
• The method is therefore most-often used for the study of progin folding pathways .
• The long simulation time is one of the major issues of this method, since the
incremental time scale is usually in the order of femtoseconds (10-15 s) while the
fastest folding time of a small protein (less than 100 residues) is in the millisecond
range in nature.

14
Genetic Algorithm
• The genetic algorithm is a method for solving problems that is based on natural
selection, the process that drives biological evolution.
• The genetic algorithm repeatedly modifies a population of individual solutions.
• At each step, the genetic algorithm selects individuals at random from the
current population to be parents and uses them to produce the children for the
next generation.
• Over successive generations, the population "evolves“ toward an optimal
solution.
15
Mathematical Optimization
• • Mathematical optimization is the selection of a best element (with regard
to some criteria) from some set of available alternatives.

16
Model Selection
• The selection of protein models has been emerged as a new field called
Model Quality Assessment Programs (MQAP)
• Modelling selection approaches can be classified into two types:
• energy based
• free-energy based

17
Physics-Based Energy Function
• Selects the decoy with the lowest energy.

18
Knowledge-Based Energy Function
• . Sippl developed a pair wise residue-distance based potential (Sippl 1990)
using the statistics of known PDB structures in 1990 (its newest version is
PROSA II (Sippl 1993; Wiederstein and Sippl 2007)).A variety of
knowledge-based potentials have been proposed, which include atomic
interaction potential, solvation potential, hydrogen bond potential, torsion
angle potential, etc.

19
Sequence-Structure Compatibility
Function
• Best models are selected not purely based on energy functions.
• They are selected based on the compatibility of target sequences to model
structures.
• The earliest and still successful example is that by Luthy et al.(1992), who
used threading scores to evaluate structures.
• Colovos and Yeates (1993) later used a quadratic error function to describe the
non-covalently bonded interactions among CC,CN, CO, NN, NO and 00,
where near-native structures havefewer errors than other decoys
20
Clustering of Decoy Structures.
• Cluster analysis or clustering is the task of grouping a set of objects in such a way
that objects in the same group (called a cluster) are more similar (in some sense or
another) to each other than to those in other groups (clusters).
• The cluster-centre conformation of the largest cluster is considered closer to native
structures than the majority of decoys.
• In the work by Shortle et al. (1998), for all 12 cases tested, the cluster-centre
conformation of the largest cluster was closer to native structures than the majority
of decoys. Cluster-centre structures were ranked as the top 1-5%closest to their
native structures.
21
22
23
24
25

From Protein Structure To Function With Bioinformatics (PDFDrive)
100% (1)
From Protein Structure To Function With Bioinformatics (PDFDrive)
509 pages
Bif 401 100% Solved Final Term Paper by Sulman Ali
No ratings yet
Bif 401 100% Solved Final Term Paper by Sulman Ali
5 pages
From Protein Structure To Function With Bioinformatics (PDFDrive)
100% (1)
From Protein Structure To Function With Bioinformatics (PDFDrive)
509 pages
3.7 Protein Structure Prediction and Classification
No ratings yet
3.7 Protein Structure Prediction and Classification
20 pages
Generation of 3D Structure of Protein
No ratings yet
Generation of 3D Structure of Protein
11 pages
Lecture 12 (Structural Bioinformatics)
No ratings yet
Lecture 12 (Structural Bioinformatics)
30 pages
Protein Folding
No ratings yet
Protein Folding
21 pages
Ijms 25 08426
No ratings yet
Ijms 25 08426
21 pages
Protein Tertiaty Structure Prediction
No ratings yet
Protein Tertiaty Structure Prediction
12 pages
Gene Pridiction and Orf
No ratings yet
Gene Pridiction and Orf
34 pages
Template Based Protein Structure Modeling
No ratings yet
Template Based Protein Structure Modeling
98 pages
Computation Prediction Protein Structure
No ratings yet
Computation Prediction Protein Structure
22 pages
Bioinformatics TM6
No ratings yet
Bioinformatics TM6
30 pages
SSRN 4541252
No ratings yet
SSRN 4541252
25 pages
Homolgy Modeling
No ratings yet
Homolgy Modeling
19 pages
Protein Stability Prediction-16
No ratings yet
Protein Stability Prediction-16
68 pages
TR 20211112 许锦波基于深度学习的蛋白质结构预测
No ratings yet
TR 20211112 许锦波基于深度学习的蛋白质结构预测
47 pages
Deep Learning in Protein Structural
No ratings yet
Deep Learning in Protein Structural
23 pages
Protein Sequence
No ratings yet
Protein Sequence
36 pages
Machine Learning Bio Inform Atcs
No ratings yet
Machine Learning Bio Inform Atcs
38 pages
Protein Language Models and Structure Prediction: Connection and Progression
No ratings yet
Protein Language Models and Structure Prediction: Connection and Progression
44 pages
Protein Structure Prediction and Modeling
No ratings yet
Protein Structure Prediction and Modeling
20 pages
Summer Training Report (June-July 2016) : A Sypnosis
No ratings yet
Summer Training Report (June-July 2016) : A Sypnosis
44 pages
Experiment-7 (HOMOLOGY MODELING)
No ratings yet
Experiment-7 (HOMOLOGY MODELING)
12 pages
An Initio Method8 PDF
No ratings yet
An Initio Method8 PDF
23 pages
CS273 - Protein Structure Prediction
No ratings yet
CS273 - Protein Structure Prediction
39 pages
Tertiary Structure Prediction Methods: Any Given Protein Sequence
No ratings yet
Tertiary Structure Prediction Methods: Any Given Protein Sequence
29 pages
Protein Secondary Structure Prediction: Information Sources and Architectures
No ratings yet
Protein Secondary Structure Prediction: Information Sources and Architectures
199 pages
Protein Modeling in Biochemistry
No ratings yet
Protein Modeling in Biochemistry
29 pages
Kmiecik 2016
No ratings yet
Kmiecik 2016
39 pages
Protein Structure Prediction
No ratings yet
Protein Structure Prediction
34 pages
Dr. Qudsia Yousafi
No ratings yet
Dr. Qudsia Yousafi
30 pages
Protein Side Chain Correction
No ratings yet
Protein Side Chain Correction
28 pages
Advances in Protein Structure Prediction and Design
No ratings yet
Advances in Protein Structure Prediction and Design
17 pages
Unit-5 Bioinformatics
No ratings yet
Unit-5 Bioinformatics
13 pages
DCA Attention
No ratings yet
DCA Attention
30 pages
The Thermodynamics of DNA Structural Motifs
No ratings yet
The Thermodynamics of DNA Structural Motifs
28 pages
Dca HP RBM
No ratings yet
Dca HP RBM
26 pages
Benchmarking Protein Structure Predictors To Assist Machine Learning Guided Peptide Discovery
No ratings yet
Benchmarking Protein Structure Predictors To Assist Machine Learning Guided Peptide Discovery
24 pages
2021 10 25 465658 Full
No ratings yet
2021 10 25 465658 Full
12 pages
PDBparam Online Resource For Computing Structural Parameters of Proteins
No ratings yet
PDBparam Online Resource For Computing Structural Parameters of Proteins
8 pages
Pre-Assessment Questions
No ratings yet
Pre-Assessment Questions
18 pages
Bookchapter Proteinstructure
No ratings yet
Bookchapter Proteinstructure
16 pages
Dca PM PLM
No ratings yet
Dca PM PLM
19 pages
40 Job Interview Questions and Answers: Question 1: Tell Me About Yourself
100% (8)
40 Job Interview Questions and Answers: Question 1: Tell Me About Yourself
14 pages
Lec6-Protein Structure Prediction
No ratings yet
Lec6-Protein Structure Prediction
16 pages
Xie 2014
No ratings yet
Xie 2014
16 pages
Lecture 5 Molecular Modelling
No ratings yet
Lecture 5 Molecular Modelling
13 pages
Base Paper
No ratings yet
Base Paper
14 pages
Patel Jctc2018
No ratings yet
Patel Jctc2018
7 pages
(Ebook PDF) Clinical Cases in Eye Care by Mark Rosenfieldinstant Download
100% (4)
(Ebook PDF) Clinical Cases in Eye Care by Mark Rosenfieldinstant Download
60 pages
Predicting rRNA-, RNA-, and DNA-binding Proteins From Primary Structure With Support Vector Machines
No ratings yet
Predicting rRNA-, RNA-, and DNA-binding Proteins From Primary Structure With Support Vector Machines
10 pages
Application of Strcture Prediction of Peptides and Proteins Review CSBJ 2019
No ratings yet
Application of Strcture Prediction of Peptides and Proteins Review CSBJ 2019
9 pages
Tie NG Anh DE CHUAN 1748b
No ratings yet
Tie NG Anh DE CHUAN 1748b
10 pages
GKL 789
No ratings yet
GKL 789
10 pages
Machine Learning For Protein Folding and Dynamics: Sciencedirect
No ratings yet
Machine Learning For Protein Folding and Dynamics: Sciencedirect
8 pages
GKN 589
No ratings yet
GKN 589
9 pages
Protein Desin With Deep Learning
No ratings yet
Protein Desin With Deep Learning
9 pages
TargetDBP Accurate DNA-Binding Protein Prediction Via Sequence-Based Multi-View Feature Learning
No ratings yet
TargetDBP Accurate DNA-Binding Protein Prediction Via Sequence-Based Multi-View Feature Learning
11 pages
Dingo Optimized Fuzzy CNN Technique For Efficient Protein Structure Prediction
No ratings yet
Dingo Optimized Fuzzy CNN Technique For Efficient Protein Structure Prediction
9 pages
Extra Notes On Threading
No ratings yet
Extra Notes On Threading
6 pages
The Threading Approach To Tertiary Structure Prediction
No ratings yet
The Threading Approach To Tertiary Structure Prediction
6 pages
Intellectual Property in The Context of The WTO TRIPS Agreement Challenges For Public Health
100% (1)
Intellectual Property in The Context of The WTO TRIPS Agreement Challenges For Public Health
170 pages
Form 4 Biology Activity 2.1 Preparing and Examining Plant Cells and Animal Cells
No ratings yet
Form 4 Biology Activity 2.1 Preparing and Examining Plant Cells and Animal Cells
8 pages
Gestational Age Calculation
No ratings yet
Gestational Age Calculation
4 pages
1 NLP Technique All Rearranged
100% (2)
1 NLP Technique All Rearranged
44 pages
Olympus PSD-10 ESU - User Manual
No ratings yet
Olympus PSD-10 ESU - User Manual
28 pages
Lab Report CHM580
No ratings yet
Lab Report CHM580
10 pages
Grade X Economics Chapter 1
No ratings yet
Grade X Economics Chapter 1
10 pages
Rachel Genovese - QMUL - A Critical Assessment of The Regulation On Ship Recycling
No ratings yet
Rachel Genovese - QMUL - A Critical Assessment of The Regulation On Ship Recycling
92 pages
Reaume How Is Mad Studies Different From Anti Psychiatry and Critical Psychiatry 2
No ratings yet
Reaume How Is Mad Studies Different From Anti Psychiatry and Critical Psychiatry 2
10 pages
Format of Thesis
No ratings yet
Format of Thesis
48 pages
Induction Report: Management Trainee (Operations)
100% (2)
Induction Report: Management Trainee (Operations)
24 pages
Model H-2000/H-3500 Proportioning Unit: Operating Manual 15942A-1
No ratings yet
Model H-2000/H-3500 Proportioning Unit: Operating Manual 15942A-1
44 pages
AI-Driven Design of Cell-Penetrating Peptides For Therapeutic Biotechnology
No ratings yet
AI-Driven Design of Cell-Penetrating Peptides For Therapeutic Biotechnology
16 pages
Anh Chuyen 23 24
No ratings yet
Anh Chuyen 23 24
12 pages
CA10 - HN-đã chuyển đổi
No ratings yet
CA10 - HN-đã chuyển đổi
9 pages
Marathon IEC Product Overview
No ratings yet
Marathon IEC Product Overview
8 pages
REAGEN Furazolidone (AOZ) ELISA Test Kit Manual
No ratings yet
REAGEN Furazolidone (AOZ) ELISA Test Kit Manual
13 pages
RFX 1000001255 SUPPLY, INSTALLATION, TESTING & COMMISSIONING OF 15KW SOLAR PV SYSTEM AT REREC KISUMU-pages
No ratings yet
RFX 1000001255 SUPPLY, INSTALLATION, TESTING & COMMISSIONING OF 15KW SOLAR PV SYSTEM AT REREC KISUMU-pages
2 pages
Micronics PF333 PF222 270424
No ratings yet
Micronics PF333 PF222 270424
4 pages
Initial Functional MedicineQuestionnaire
No ratings yet
Initial Functional MedicineQuestionnaire
4 pages
The Hidden Paradise: Far But Not Left Behind"
No ratings yet
The Hidden Paradise: Far But Not Left Behind"
4 pages
Personal Financial Statement
No ratings yet
Personal Financial Statement
3 pages
CMS Report
No ratings yet
CMS Report
4 pages
ABRICOTINE An Apricot Liqueur Made in France
No ratings yet
ABRICOTINE An Apricot Liqueur Made in France
3 pages
Easa Biweekly 20-2022
No ratings yet
Easa Biweekly 20-2022
2 pages
Résultats Shell Eco-Marathon
No ratings yet
Résultats Shell Eco-Marathon
1 page
Career Research Assignment - Clinical Psychologist
No ratings yet
Career Research Assignment - Clinical Psychologist
2 pages
Kinematics of the Brain Activities: Volume Iii
From Everand
Kinematics of the Brain Activities: Volume Iii
Mostafa M. Dini
No ratings yet

Ab Initio Protein Structure Prediction

Uploaded by

Ab Initio Protein Structure Prediction

Uploaded by

AB INITIO PROTEIN

You might also like