Title | Bioinformatics: symbol comparison tables and protein sequences | |
Authors | Vassiliki-Sofia Grech¹, Maria Grech², Antonios Togias³ and Ioanna Grech⁴
1. Bachelor of Medical Biochemistry, King’s College University, Student of Medicine, Medical University of Pecs, Hungary 2. Medical Resident in Pediatrics, Children’s Hospital Aglaia Kyriakou, Athens, Greece 3. Medical Resident in Pediatrics, General Hospital of Kalamata 4. Technological Education Institution of Athens, Egaleo, Athens, Greece |
|
Citation | Grech, V.S., Grech, M., Togias, A., Grech, I.: Bioinformatics: symbol comparison tables and protein sequences, Epitheorese Klin. Farmakol. Farmakokinet. 25(3): 115-123 (2011) | |
Publication Date | Accepted for publication: November 10, 2011 | |
Full Text Language | English | |
Order – Buy | Ηλεκτρονική Μορφή: pdf (10 €) – Digital Type: pdf (10 €)
pharmakonpress[at]pharmakonpress[.]gr |
|
Keywords | Protein sequences, symbol comparison tables, scoring matrices, alignments, 2D structure of proteins. | |
Other Terms | review article | |
Summary | Analysis of protein sequence alignment can enable the connection between different amino acids and providing important information of their evolutionary pattern. Scientists can answer questions related to human origin and genetic similarities among organisms, even create model organisms. Nevertheless, this is not a simple process and number of difficulties like sequence mutations and the huge number of alignments, are to overcome. In order to study these sequences, symbol comparison tables were created, based on statistical methods. They are used as scoring matrices for computing alignment scores, comparing any two amino acid sequences. One of the first attempts for matrixes formation is the Unitary Protein Matrix (UPM). Another simple symbol comparison table is the Genetic Code Matrix (GCM), and a third one is called Structure Genetic Matrix (SGM), published in 1972 by McLachlan. However, groupings used for the observations are artificial. For that reason, there is no assurance that a random common amino acid property is significant for functional and structural relationships between amino. The most extensively used symbol comparison tables are the Log Odd Matrix (LOM), the very similar Mutation Data Matrix (MDM) known also as Accepted Point Mutation (PAM) designed by Dayhoff in1978, Perwise Exchange Table 1991 (PET1991) created by Jones et al. at 1992 and Blocks of amino acid Substitution Matrix (BLOSUM) designed by S. & J. Henikoff in 1992. Moreover, all the pure systemized mathematical techniques do not apply directly to the natural word. Natural selection and relative mutability had to be taken into account, as well as relative mutability had to be normalized. In addition, global and local alignments are used. Local alignments, on the one hand, used mainly by BLOSUM, use part of a sequence in order to compare two amino acids. Consequently, they are more useful for dissimilar series believed to enclose regions of similarity or similar sequence motifs. Global alignments, on the other, use the whole sequence in order to align every residue in every sequence and are more useful when the sequences are closely related. PAM matrices are entrenched in global alignments of sequences being more than 85% identical. Last but not least, alignments may not be biologically significant by themselves, but through comparison tables are able to provide statistical information, reflecting structural equivalent positions in aligned families of protein. This can be the starting point for the de novo prediction of secondary protein structures. | |
References | 1. Gonnet G.H., Cohen, M.A., Benner, S.A.: Exhaustive matching of the entire protein sequence database. Science 256: 1443-1445 (1992)
2. Churbavon A.: PAM matrix for BLAST algorithm, 2002 3. http://en.wikipedia.org/wiki/Sequence_alignment 4. Paila U., Kondam R., Ranjan A.: Genome bias influences amino acid choices: analysis of amino acid substitution and recompilation of substitution matrices exclusive to an AT-biased genome. Nucleic Acids Res. 36: 6664-6675 (2008) 5. Dayhoff M.O., Schwartz R.M., Orcutt B.C.: A model of evolutionary change in proteins. In: Atlas of Protein Sequence and Structure (Dayhoff M.O, ed.). Vol. 5, Suppl. 3, pp. 345-352, Natl. Biomed. Res. Found., Washington, DC, 1978 6. Cates S.: Scoring Matrices. Connection modules:m11062, 2008 7. Jones D.T., Taylor W.R., Thornton J.M.: The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci. 8: 275-282 (1992) 8. Brick K., Pizzi E.: A novel series of compositionally biased substitution matrices for comparing Plasmodium proteins. BMC Bioinformatics 9: 236 (2008) 9. Henikoff S., Henikoff J.G.: Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci. USA 89: 10915-10919 (1992) 10. http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/Scoring2.html 11. Brudno M., Malde S., Poliakov A., Do C.B., Couronne O., Dubchak I., Batzoglou S.: Glocal alignment: finding rearrangements during alignment. Bioinformatics 19(Suppl 1): i54-62 (2003) 12. Eddy S.R.: Where did the BLOSUM62 alignment score matrix come from? Nat. Biotechnol 22: 1035-1036 (2004) 13. Rice D.W., Eisenberg D.: A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence. J. Mol. Biol. 267: 1026-1038 (1997) |
|
Relative Papers |
Online ISSN 1011-6575
Άρθρα Δημοσιευμένα σε αυτό το Περιοδικό Καταχωρούνται στα:
- Chemical Abstracts
- Elsevier’s Bibliographic Databases: Scopus, EMBASE, EMBiology, Elsevier BIOBASE SCImago Journal and Country Rank Factor
Articles published in this Journal are Indexed or Abstracted in: • Chemical Abstracts • Elsevier’s Bibliographic Databases: Scopus, EMBASE, EMBiology, Elsevier BIOBASE SCImago Journal and Country Rank Factor
Συντακτικη Επιτροπή-Editorial Board
ΕΤΗΣΙΑ ΣΥΝΔΡΟΜΗ 2011 – ANNUAL SUBSCRIPTION 2011 | |
Γλώσσα Πλήρους Κειμένου – Full Text Language | Αγγλικά – English |
Παραγγελία – Αγορά – Order – Buy | Ηλεκτρονική Μορφή: pdf (70 €) – Digital Type: pdf (70 €)
pharmakonpress[at]pharmakonpress[.]gr |
Έντυπη Μορφή (70 € + έξοδα αποστολής) – Printed Type (70 € + shipping)
pharmakonpress[at]pharmakonpress[.]gr |