Minh Duc Cao, Trevor I. Dix, and Lloyd Allison
in 'Software Tools and Algorithms for Biological Systems',
Advances in Experimental Medicine and Biology (AEMB),
A biological compression model, expert model,
is presented which is superior to existing compression algorithms
in both compression performance and speed.
The model is able to compress whole eukaryotic genomes.
Most importantly, the model provides a framework for knowledge discovery
from biological data.
It can be used for
repeat element discovery,
sequence alignment and
We demonstrate that the model can handle statistically biased sequences and
distantly related sequences where conventional knowledge discovery tools