Protein Structure Prediction

Uploaded by

Vignesh Vignesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views13 pages

Protein Structure Prediction

Uploaded by

Vignesh Vignesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

SRI RAMACHANDRA

INSTITUTE OF HIGHER EDUCATION AND RESEARCH

(Deemed to be University)

Protein structure prediction

Dr. UDHAYA LAVINYA B.

ASST. PROFESSOR,
DEPT. OF BMS, SRIHER (DU)
Introduction
• Proteins differ from one another primarily in their sequence of amino acids
• This results in different spatial shape and structure and therefore different biological
functionalities in cells
• It is much easier to obtain protein sequences than to obtain their structures
• The UniProt/TrEMBL database contains currently more than 85 million of protein
sequences
• On the structure side, X-ray crystallography and NMR spectroscopy are currently the
two major experimental techniques for protein structure determination
• Both are, time- and manpower-consuming, and have their own technical limitations for
different protein targets
• As of April 2017, the number of protein structures in PDB increases to ~ 120,000, which
counts however only < 0.2% of the protein sequences in the UniProt.
Secondary structure prediction
• Protein secondary structure refers to the local conformation proteins’ polypeptide backbone
• There are two regular secondary structure states, α-helix (H) and β-strand (E), and one irregular secondary
structure type, the coil region (C)
• Sander developed a secondary structure assignment method Dictionary of Secondary Structure of Proteins
(DSSP)3
• It automatically assigns secondary structure into eight states (H, E, B, T, S, L, G, and I) according to hydrogen-
bonding patterns
• These eight states are often further simplified into three states of helix, sheet and coil
• The most widely used convention is that helix is designated as G, H and I; sheet as B and E; and all other
states are designated as a coils
• Most commonly, the secondary structure prediction problem is formulated as follows: given a protein
sequence with amino acids, predict whether each amino acid is in the α-helix (H), β-strand (E), or coil region
(C)
• Protein secondary structure prediction is usually evaluated by Q3 accuracy, which measures the percentage
of residues for three-state secondary structures to determine whether they have been predicted correctly
Secondary structure prediction
• Many statistical approaches and machine learning approaches have been developed to predict
secondary structure.
• One of the first approaches for predicting protein secondary structure, uses a combination of statistical
and heuristic rules.
• The GOR6 method formalizes the secondary structure prediction problem within an information-
theoretic framework.
• Position specific scoring matrix (PSSM) based on PSIBLAST reflects evolutionary information and has
made the most significant improvements in protein secondary structure prediction
• Many machine learning methods have been developed to predict protein secondary structure
• They exhibit good performance by exploiting evolutionary information, as well as statistic information
about amino acid subsequences
• For example, many neural network (NN) methods, hidden Markov model (HMM), support vector
machines (SVM) and K-nearest neighbors22 have had substantial success, and Q3 accuracy has reached
to 80%.
Secondary structure prediction
• The prediction accuracy has been continuously improved over the years,
especially by
• using hybrid or ensemble methods and
• incorporating evolutionary information in the form of profiles extracted from alignments of
multiple homologous sequences
• The highest Q3 accuracy without relying on structure templates is now at 82–84%
• DeepCNF is a deep learning extension of conditional neural fields (CNF), which
integrates conditional random fields and shallow neural networks.
• The overall performance of DeepCNF is significantly better than other state-of-
the-art methods, breaking the long-lasting ~80% accuracy.
• Recently SPIDER3 improved the prediction of protein secondary structure by
capturing non-local interactions using long short-term memory bidirectional
recurrent neural networks.
Frequently used tools
• PSRSM
• Protein Secondary Structure Prediction based on Data Partition and Semi-Random Subspace Method
• This method partitions the training dataset based on protein sequence length and employs a semi-random subspace technique to
train multiple classifiers. It combines predictions using a majority vote rule, achieving high accuracy across various datasets.
• Reported Q3 accuracy ranges from 85% to 86.38% on different datasets, outperforming many existing methods
• PSSpred
• Neural network-based tool that utilizes multiple sequence alignments gathered through PSI-BLAST.
• It trains separate neural networks for secondary structure prediction using amino acid frequency data.
• The final prediction is a combination of results from seven different neural network predictors
• JPred
• This server uses multiple neural networks trained on PSI-BLAST and HMMER profiles to predict both secondary structure and
solvent accessibility.
• Input Formats: Accepts sequences in various formats, including FASTA, and allows batch submissions for multiple sequences
• PSIPRED
• It employs two feed-forward neural networks to analyze outputs from PSI-BLAST profiles for secondary structure prediction
• It remains one of the most reliable tools in the field
• RaptorX-SS8
• Utilizes conditional neural fields to predict both three-state and eight-state secondary structures from protein sequences
• It is recognized for its effectiveness in structure prediction tasks
Tertiary structure prediction
• Three-dimensional arrangement of all the
atoms in a single polypeptide chain
• Crucial for the protein's functionality
• Formed through
• various interactions among the side
chains (R groups) of the amino acids
that make up the protein and
• interactions between these side chains
and the backbone of the polypeptide
Anfinsen’s dogma
Methods
• Similar sequences from the same evolutionary family often adopt similar protein structures
• This forms the foundation of homology modeling
• Most accurate way to predict protein structure by taking its homologous structure in PDB as template
• With the rapid growth of PDB database, an increasing proportion of target proteins can be predicted via
homology modeling
• When no structure with obvious sequence similarity to the target protein can be found in PDB, it is still
possible to find out proteins with structural similarity to the target protein
• The method to identify template structures from the PDB is called threading or fold recognition,
• It matches the target sequence to homologous and distant-homologous structures based on some algorithm
and take the best matches as structural template
• The basic premise for threading to work is that protein structure is highly conservative in evolution and the
number of unique structural folds are limited in nature
• Both homology modeling (based on sequence comparison) and threading methods (based on fold-
recognition) can be called template-based structure prediction methods
Frequently used tools
FALCON2
• Integrates template-based modeling (ProALIGN) and ab initio prediction (ProFOLD).
• FALCON2 simultaneously utilizes both approaches to enhance prediction accuracy. ProALIGN aligns the target protein with known
templates, while ProFOLD uses a neural network to estimate inter-residue distances.
• The server includes quality assessment tools to select the best candidate structures from predictions, demonstrating improved
accuracy through the integration of methods 1.
AlphaFold
• Deep learning-based approach.
• Developed by DeepMind, AlphaFold has achieved remarkable success in predicting protein structures by utilizing attention
mechanisms to model the relationships between amino acids.
• It has set new benchmarks in structure prediction, particularly in the CASP competitions, showcasing its ability to predict complex
structures with high accuracy.
I-TASSER
• Threading and fragment assembly.
• I-TASSER predicts protein structures by threading target sequences through known structures and assembling fragments based on
these templates.
• It is widely used for generating structural models when experimental data is lacking.
Frequently used tools
Phyre2
• Template-based modeling.
• Phyre2 predicts protein structures by aligning sequences with known structures and generating models based on
these alignments.
• It provides a user-friendly interface for researchers to input sequences and receive structural predictions.
MODELLER
• Homology modeling.
• MODELLER builds models based on homologous proteins with known structures, allowing users to create accurate
models for target proteins.
• Offers extensive options for model refinement and evaluation.
RaptorX
• Remote homology detection and threading.
• RaptorX combines template-based methods with ab initio approaches to predict protein structures effectively.
• It provides detailed structural predictions along with confidence scores.

Tramontano A. - Protein Structure Prediction 2007 - t1v3
No ratings yet
Tramontano A. - Protein Structure Prediction 2007 - t1v3
46 pages
CAPE Biology 2008 (Rest of Region) U1 P2 MS
100% (1)
CAPE Biology 2008 (Rest of Region) U1 P2 MS
11 pages
Functional Medicine Diabetes
100% (6)
Functional Medicine Diabetes
51 pages
BIOL 2401 Human Anatomy - Phys I - 7.4-7.5 Muscle Physiology
No ratings yet
BIOL 2401 Human Anatomy - Phys I - 7.4-7.5 Muscle Physiology
20 pages
3.7 Protein structure prediction and classification.pptx
No ratings yet
3.7 Protein structure prediction and classification.pptx
20 pages
ssrn-4541252
No ratings yet
ssrn-4541252
25 pages
Structural bioinformatics
No ratings yet
Structural bioinformatics
23 pages
TSP CMC 26408
No ratings yet
TSP CMC 26408
14 pages
Innovative Computing Review (ICR) : Issn: 2791-0024 ISSN: 2791-0032 Homepage
No ratings yet
Innovative Computing Review (ICR) : Issn: 2791-0024 ISSN: 2791-0032 Homepage
17 pages
bookchapter_Proteinstructure
No ratings yet
bookchapter_Proteinstructure
16 pages
2015 Article 14 Twilight Zone
No ratings yet
2015 Article 14 Twilight Zone
11 pages
Protein_Secondary_Structure_Prediction_using_Multi-input_Convolutional_Neural_Network
No ratings yet
Protein_Secondary_Structure_Prediction_using_Multi-input_Convolutional_Neural_Network
5 pages
Protein Structure Prediction
No ratings yet
Protein Structure Prediction
17 pages
biomolecules-12-00774
No ratings yet
biomolecules-12-00774
16 pages
ijms-25-08426
No ratings yet
ijms-25-08426
21 pages
Genome Sequencing Projects: Increase in The Number of Protein Sequences
No ratings yet
Genome Sequencing Projects: Increase in The Number of Protein Sequences
27 pages
Module 5 notes
No ratings yet
Module 5 notes
151 pages
Protein Structure Prediction.pptx
No ratings yet
Protein Structure Prediction.pptx
23 pages
Protein STR
No ratings yet
Protein STR
63 pages
Protein Tertiaty Structure Prediction
No ratings yet
Protein Tertiaty Structure Prediction
12 pages
Expasy links-1
No ratings yet
Expasy links-1
4 pages
Tertiary Structure Prediction Methods: Any Given Protein Sequence
No ratings yet
Tertiary Structure Prediction Methods: Any Given Protein Sequence
29 pages
Computation prediction protein structure
No ratings yet
Computation prediction protein structure
22 pages
Structural Bioinformatics and Protein Structure Prediction (1)
No ratings yet
Structural Bioinformatics and Protein Structure Prediction (1)
14 pages
s41586 021 03828 1 - Reference
No ratings yet
s41586 021 03828 1 - Reference
23 pages
Dingo Optimized Fuzzy CNN Technique For Efficient Protein Structure Prediction
No ratings yet
Dingo Optimized Fuzzy CNN Technique For Efficient Protein Structure Prediction
9 pages
Template based Protein Structure Modeling
No ratings yet
Template based Protein Structure Modeling
98 pages
Protein Modeling: Protein Structure Prediction Other Topics
No ratings yet
Protein Modeling: Protein Structure Prediction Other Topics
76 pages
3D Structure Prediction
No ratings yet
3D Structure Prediction
33 pages
SVM
No ratings yet
SVM
4 pages
s12859-018-2280-5
No ratings yet
s12859-018-2280-5
13 pages
Prediction of Protein Secondary Structure With A Reliability Score Estimated by Local Sequence Clustering
No ratings yet
Prediction of Protein Secondary Structure With A Reliability Score Estimated by Local Sequence Clustering
7 pages
TR_20211112_许锦波_基于深度学习的蛋白质结构预测
No ratings yet
TR_20211112_许锦波_基于深度学习的蛋白质结构预测
47 pages
Pre-Assessment Questions
No ratings yet
Pre-Assessment Questions
18 pages
Lecture 13- Protein 3 D Structure
No ratings yet
Lecture 13- Protein 3 D Structure
20 pages
3D Structure Prediction
No ratings yet
3D Structure Prediction
18 pages
Porter 6 Protein Secondary Structure Prediction by Leveraging Pre Trained Language Models (PLMs)
No ratings yet
Porter 6 Protein Secondary Structure Prediction by Leveraging Pre Trained Language Models (PLMs)
16 pages
Protein Secondary Structure Prediction - A Survey of the State of the Art
No ratings yet
Protein Secondary Structure Prediction - A Survey of the State of the Art
24 pages
Protein Sructure Prediction Using Phyre - Kelly & Sternberg 2009
No ratings yet
Protein Sructure Prediction Using Phyre - Kelly & Sternberg 2009
9 pages
s41586 021 03819 2 - Reference
No ratings yet
s41586 021 03819 2 - Reference
16 pages
Computational - Chapter 2 (Questions With Answers)
No ratings yet
Computational - Chapter 2 (Questions With Answers)
8 pages
Dr. Qudsia Yousafi
No ratings yet
Dr. Qudsia Yousafi
30 pages
Protein Structure Prediction Thesis
100% (3)
Protein Structure Prediction Thesis
8 pages
Ensemble of Neural Networks To Solve Class Imbalance Problem of Protein Secondary Structure Prediction
No ratings yet
Ensemble of Neural Networks To Solve Class Imbalance Problem of Protein Secondary Structure Prediction
12 pages
Prediction_of_Protein_Tertiary_Structure_Using_Pre-Trained_Self-Supervised_Learning_Based_on_Transformer
No ratings yet
Prediction_of_Protein_Tertiary_Structure_Using_Pre-Trained_Self-Supervised_Learning_Based_on_Transformer
8 pages
Protein Structure Prediction Using Homology Modeling
No ratings yet
Protein Structure Prediction Using Homology Modeling
11 pages
GKL 789
No ratings yet
GKL 789
10 pages
PSIPRED
No ratings yet
PSIPRED
8 pages
Lecture 12 (Structural Bioinformatics) Cbdb30310921cec2c447276bb2d88a8f
No ratings yet
Lecture 12 (Structural Bioinformatics) Cbdb30310921cec2c447276bb2d88a8f
30 pages
Improved Protein Structure Prediction Using Potentials From Deep Learning
No ratings yet
Improved Protein Structure Prediction Using Potentials From Deep Learning
22 pages
A Method For Multiple-Sequence-Alignment - Free Protein Structure Prediction Using A - Protein Language Model
No ratings yet
A Method For Multiple-Sequence-Alignment - Free Protein Structure Prediction Using A - Protein Language Model
12 pages
Protein Structure Determination: Goal
No ratings yet
Protein Structure Determination: Goal
8 pages
2211.16742v1
No ratings yet
2211.16742v1
44 pages
Protein Prediction
No ratings yet
Protein Prediction
100 pages
Proteins Bioinfo Latest
No ratings yet
Proteins Bioinfo Latest
45 pages
Highly Accurate Protein Structure Prediction With Alphafold: Article
No ratings yet
Highly Accurate Protein Structure Prediction With Alphafold: Article
12 pages
Advances in Protein Structure Prediction and Design
No ratings yet
Advances in Protein Structure Prediction and Design
17 pages
Protein Structure Analysis and Prediction
No ratings yet
Protein Structure Analysis and Prediction
33 pages
1911.05531v1
No ratings yet
1911.05531v1
9 pages
Bioinformatics: Merging Biology and Technology
From Everand
Bioinformatics: Merging Biology and Technology
Mani Devar
No ratings yet
Logical Modeling of Biological Systems
From Everand
Logical Modeling of Biological Systems
Luis Fariñas del Cerro
No ratings yet
Advanced Perl Techniques for Bioinformatics: Optimizing Data Analysis and Computational Biology
From Everand
Advanced Perl Techniques for Bioinformatics: Optimizing Data Analysis and Computational Biology
Adam Jones
No ratings yet
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
Worksheet in Mitosis and Meiosis
0% (1)
Worksheet in Mitosis and Meiosis
5 pages
Freezers Brochure ULT Freezer Family Pure Ultracold
No ratings yet
Freezers Brochure ULT Freezer Family Pure Ultracold
40 pages
Life Sciences 10 Learner Book Term 1 2024 (Eng)
No ratings yet
Life Sciences 10 Learner Book Term 1 2024 (Eng)
18 pages
GENBIO1 - MOD5 - Q1 - Cell Modifications What Fits In.
No ratings yet
GENBIO1 - MOD5 - Q1 - Cell Modifications What Fits In.
23 pages
Soal Uji Coba Cambridge Kelas 9
No ratings yet
Soal Uji Coba Cambridge Kelas 9
4 pages
Aice Midterm Study Guide
No ratings yet
Aice Midterm Study Guide
4 pages
Classification of Medically Important Viruses
No ratings yet
Classification of Medically Important Viruses
20 pages
Introduction To Biochemistry
No ratings yet
Introduction To Biochemistry
8 pages
Microbiology Objective Questions
No ratings yet
Microbiology Objective Questions
28 pages
Class Test 2- Classification and In and out of cells
No ratings yet
Class Test 2- Classification and In and out of cells
3 pages
Practice Exam Questions For Muscles
No ratings yet
Practice Exam Questions For Muscles
2 pages
Sueño y Sistema Inmune
No ratings yet
Sueño y Sistema Inmune
11 pages
Transcription Factors 1st Edition Official eBook Release
100% (12)
Transcription Factors 1st Edition Official eBook Release
14 pages
Plasma membrane MCQ
No ratings yet
Plasma membrane MCQ
4 pages
JLB 0631
No ratings yet
JLB 0631
8 pages
Lfsc Gr 10 March Test 2.1 Scope 2025
No ratings yet
Lfsc Gr 10 March Test 2.1 Scope 2025
1 page
علوم باللغة الانجليزية - الاسبوع الثاني - تقييمات اسبوعية
No ratings yet
علوم باللغة الانجليزية - الاسبوع الثاني - تقييمات اسبوعية
4 pages
MIF1 andMIF2Myostatin Peptide Inhibitors As Potent Muscle Mass Regulators (Eun Ju Lee, Et Al.) (2022)
No ratings yet
MIF1 andMIF2Myostatin Peptide Inhibitors As Potent Muscle Mass Regulators (Eun Ju Lee, Et Al.) (2022)
18 pages
Superfamily Database
No ratings yet
Superfamily Database
8 pages
ANA Patterns Euroimmune
No ratings yet
ANA Patterns Euroimmune
4 pages
Biochemistry 9th Edition by Campbell Farrel and McDougal ISBN Test Bank
100% (46)
Biochemistry 9th Edition by Campbell Farrel and McDougal ISBN Test Bank
25 pages
Mitosis Cell Division Flash Cards
No ratings yet
Mitosis Cell Division Flash Cards
1 page
Revista Completa - Facmed - Nov-Dic 2016
0% (1)
Revista Completa - Facmed - Nov-Dic 2016
639 pages
11 - Carbohydrate Metabolism
No ratings yet
11 - Carbohydrate Metabolism
68 pages
Entwickslungmechanik: Developmental Mechanisms
No ratings yet
Entwickslungmechanik: Developmental Mechanisms
29 pages
10 Cell Cycle and Cell Division
No ratings yet
10 Cell Cycle and Cell Division
10 pages
Antiphospholipid Thrombosis Syndrome HematoFeb2008, Vol. 22
No ratings yet
Antiphospholipid Thrombosis Syndrome HematoFeb2008, Vol. 22
173 pages

Protein Structure Prediction

Uploaded by

Protein Structure Prediction

Uploaded by

SRI RAMACHANDRA

INSTITUTE OF HIGHER EDUCATION AND RESEARCH

Protein structure prediction

Dr. UDHAYA LAVINYA B.

You might also like