The capacity of some Pólya string models
Abstract
We study random string-duplication systems, called Pólya string models, motivated by certain random mutation processes in the genome of living organisms. Unlike previous works that study the combinatorial capacity of string-duplication systems, or peripheral properties such as symbol frequency, this work provides exact capacity or bounds on it, for several probabilistic models. In particular, we give the exact capacity of the random tandem-duplication system, and the end-duplication system, and bound the capacity of the complement tandem-duplication system. Interesting connections are drawn between the former and the beta distribution common to population genetics, as well as between the latter system and signatures of random permutations.
Additional Information
© 2016 IEEE. This work was supported in part by the NSF Expeditions in Computing Program (The Molecular Programming Project).Attached Files
Submitted - 1808.06062.pdf
Files
Name | Size | Download all |
---|---|---|
md5:5de2a418ebc5653f7c2fe2a260f0aaf6
|
319.9 kB | Preview Download |
Additional details
- Eprint ID
- 69897
- DOI
- 10.1109/ISIT.2016.7541303
- Resolver ID
- CaltechAUTHORS:20160824-102815029
- NSF
- Created
-
2016-08-24Created from EPrint's datestamp field
- Updated
-
2021-11-11Created from EPrint's last_modified field