Substitution matrix: Difference between revisions

Content deleted Content added
m Background: Gave detailed citation about the consequences of sequence substitution on proteins and why they are less likely to be selected in evolution
Line 17:
A'''Q'''EI'''N'''Y'''Q'''RD
 
over a longer period of evolutionary time. Each amino acid is more or less likely to mutate into various other amino acids. For instance, a [[hydrophilic]] residue such as [[arginine]] is more likely to be replaced by another hydrophilic residue such as [[glutamine]], than it is to be mutated into a [[hydrophobic]] residue such as [[leucine]]. (Here, a residue refers to an amino acid stripped of a hydrogen and/or a [[hydroxyl group]] and inserted in the [[polymer|polymeric chain]] of a protein.) This is primarily due to redundancy in the [[genetic code]], which translates similar codons into similar amino acids. Furthermore, mutating an amino acid to a residue with significantly different properties could affect the [[protein folding|folding]] and/or activity of the protein. ThereThis istype thereforeof usuallydisruptive strongsubstitution selectiveis pressure{{dubious|date=Octoberless 2014}}likely to removebe suchselected mutationsin quicklyevolution frombecause ait populationrenders nonfunctional proteins.<ref>{{Cite book|last=Xiong|first=Jin|url=http://ebooks.cambridge.org/ref/id/CBO9780511806087|title=Essential Bioinformatics|date=2006|publisher=Cambridge University Press|isbn=978-0-511-80608-7|___location=Cambridge|doi=10.1017/cbo9780511806087.004}}</ref>
 
If we have two amino acid sequences in front of us, we should be able to say something about how likely they are to be derived from a common ancestor, or [[Sequence homology|homologous]]. If we can line up the two sequences using a [[sequence alignment]] algorithm such that the mutations required to transform a hypothetical ancestor sequence into both of the current sequences would be evolutionarily plausible, then we'd like to assign a high score to the comparison of the sequences.