Professional Documents
Culture Documents
Protein Structure Classification/domain Prediction: SCOP and CATH (Bioinformatics) .
Protein Structure Classification/domain Prediction: SCOP and CATH (Bioinformatics) .
selfexplanatory.2022
Saba Parvin Haque
M.Sc. Life Sciences
(Specialization in Neurobiology)
from “Sophia College”
(Autonomous), Mumbai.
BIOINFORMATICS
Protein Structure
classification/domain
prediction:
SCOP and CATH
Structural
https://images.app.goo.gl/1CxkhmPcXQNrCqep7
Classification
Of
SCOP was created in 1994 in the
Centre for Protein Engineering
(CPE) and the Laboratory of
Molecular Biology (LMB).
Protein
SCop
• It is a Largely manual classification of proteins
Basis of classification
• Similarities of structures
The domains in a fold are grouped into superfamilies, which have at least a distant
common ancestor (structural homology).
The domains in a fold in a superfamily are grouped into families, which have a more
recent common ancestor (sequence homology).
Protein Domain
The domains in a families are grouped into protein domains, which are essentially the
same protein (Functionally identical).
(Lin et al., 2013)
(SCOP| Structural Classification of Proteins | What Is Scope Database? | Classifications in SCOP?, 2021)
All alpha proteins: Secondary structure is composed entirely of
α-helices, with the possible exception of a few isolated β-sheets on
the periphery. Such as bromodomain, the globin fold etc.
(SCOP| Structural Classification of Proteins | What Is Scope Database? | Classifications in SCOP?, 2021)
All beta proteins: Secondary structure is composed entirely of β-
sheet, with the possible exception of a few isolated α-helices on the
periphery. Such as SH3 domain, the beta-propeller domain, the
immunoglobulin fold.
(SCOP| Structural Classification of Proteins | What Is Scope Database? | Classifications in SCOP?, 2021)
Alpha or beta proteins (a/b): Secondary structure is composed
alternating α-helices and β-strands along the backbone. The β-strands
are therefore mostly parallel. Such as flavodoxin fold, the TIM barrel
and leucine-rich-repeat (LRR) proteins such as ribonuclease inhibitor.
(SCOP| Structural Classification of Proteins | What Is Scope Database? | Classifications in SCOP?, 2021)
Alpha or beta proteins (a+b): Secondary structure is composed
of α-helices and β-strands that occur separately along the backbone.
The β-strands are therefore mostly antiparallel. Such as ferredoxin
fold, Pancreatic ribonuclease A and SH2 domain.
(SCOP| Structural Classification of Proteins | What Is Scope Database? | Classifications in SCOP?, 2021)
What is the source of protein structures in SCOP and CATH?
The source of protein structures in SCOP & CATH is PDB (Protein Data Bank) or UniProt ID.
https://www.rcsb.org/
https://www.uniprot.org/
Step 1
SCOP: https://scop.mrc-lmb.cam.ac.uk/
Step 2
Results
Protein Structure
classification database
Class
Architecture
Topology
The CATH database was created in the mid-1990s
by Prof Christine Orengo and colleagues, and
Homology
continues to be developed by the Orengo group at
University College London.
The overall secondary structure content of the domain.
Information on how the secondary structure elements are connected and arranged is
the basis of Topology.
CATH: https://www.cathdb.info/
Step 2
Results
REFERENCES
• Aryal, S. (2021, February 4). Protein Databases- Types and Importance. Microbe Notes.
https://microbenotes.com/protein-databases-types-and-importance/
• Malik, A. J., Poole, A. M., & Allison, J. R. (2020). Structural Phylogenetics with Confidence. Molecular
Biology and Evolution, 37(9), 2711–2726. https://doi.org/10.1093/molbev/msaa100
• Lin, C., Zou, Y., Qin, J., Liu, X., Jiang, Y., Ke, C., & Zou, Q. (2013). Hierarchical Classification of Protein
Folds Using a Novel Ensemble Classifier. PLoS ONE, 8(2), e56499.
https://doi.org/10.1371/journal.pone.0056499
• SCOP| Structural Classification of Proteins | What is scope database? | classifications in SCOP? (2021,
September 3). YouTube. https://www.youtube.com/watch?v=_yo1mRf8opA
• Sillitoe, Ian & Dawson, Natalie & Thornton, Janet & Orengo, Christine. (2015). The History of the CATH
Structural Classification of Protein Domains. Biochimie. 80. 10.1016/j.biochi.2015.08.004.
• Bukhari, Syed Abbas & Anolles, Gustavo. (2013). Origin and Evolution of Protein Fold Designs Inferred
from Phylogenomic Analysis of CATH Domain Structures in Proteomes. PLoS Computational Biology.
10.1371/journal.pcbi.1003009.
• Topics in Bioinformatics. (n.d.). . http://www.cs.cmu.edu/%7Eblmt/Seminar/SeminarMaterials/CATH.html
Thank You