Major
Chemistry
Anticipated Graduation Year
2021
Access Type
Open Access
Abstract
It is of long-standing interest to investigate the correlations within protein primary structures. Approaches across decades have looked primarily to alignment and database algorithms. We elect a different path by applying the Burrows-Wheeler transform (BWT) to archetypal proteins that include insulin, myoglobin, and lysozyme. We construct information-conserving transforms of protein sequences expressed by organisms placed at different points of the evolutionary tree. The transforms illuminate molecular evolution properties that are otherwise obscure in the sequences.
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License.
A Study of the Internal Correlations of Protein Sequences Using Non-Alignment Methods: Application of the Burrows-Wheeler Transform
It is of long-standing interest to investigate the correlations within protein primary structures. Approaches across decades have looked primarily to alignment and database algorithms. We elect a different path by applying the Burrows-Wheeler transform (BWT) to archetypal proteins that include insulin, myoglobin, and lysozyme. We construct information-conserving transforms of protein sequences expressed by organisms placed at different points of the evolutionary tree. The transforms illuminate molecular evolution properties that are otherwise obscure in the sequences.