Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published August 20, 2018 | Submitted
Report Open

The Capacity of Some Pólya String Models

Abstract

We study random string-duplication systems, which we call Pólya string models. These are motivated by DNA storage in living organisms, and certain random mutation processes that affect their genome. Unlike previous works that study the combinatorial capacity of string-duplication systems, or various string statistics, this work provides exact capacity or bounds on it, for several probabilistic models. In particular, we study the capacity of noisy string-duplication systems, including the tandem-duplication, end-duplication, and interspersed-duplication systems. Interesting connections are drawn between some systems and the signature of random permutations, as well as to the beta distribution common in population genetics.

Additional Information

The material in this paper was presented in part at the 2016 IEEE International Symposium on Information Theory.

Attached Files

Submitted - etr142.pdf

Files

etr142.pdf
Files (357.2 kB)
Name Size Download all
md5:434c0e69e7438d944b0fd26b37901320
357.2 kB Preview Download

Additional details

Created:
August 19, 2023
Modified:
January 14, 2024