Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published August 12, 2004 | public
Journal Article

Multiple-sequence functional annotation and the generalized hidden Markov phylogeny

Abstract

Motivation: Phylogenetic shadowing is a comparative genomics principle that allows for the discovery of conserved regions in sequences from multiple closely related organisms. We develop a formal probabilistic framework for combining phylogenetic shadowing with feature-based functional annotation methods. The resulting model, a generalized hidden Markov phylogeny (GHMP), applies to a variety of situations where functional regions are to be inferred from evolutionary constraints. Results: We show how GHMPs can be used to predict complete shared gene structures in multiple primate sequences. We also describe shadower, our implementation of such a prediction system. We find that shadower outperforms previously reported ab initio gene finders, including comparative human–mouse approaches, on a small sample of diverse exonic regions. Finally, we report on an empirical analysis of shadower's performance which reveals that as few as five well-chosen species may suffice to attain maximal sensitivity and specificity in exon demarcation. Availability: A Web server is available at http://bonaire.lbl.gov/shadower

Additional Information

© 2004 Oxford University Press. Received on September 20, 2003; revised on January 4, 2004; accepted on January 20, 2004. Advance Access publication February 26, 2004. We thank Dario Boffelli and Eddy Rubin for the sequence data that we have analyzed as well as many helpful discussions about the concepts of phylogenetic shadowing. We are also grateful to the anonymous referees for comments that led to several improvements. L.P. was supported in part by a grant from the NIH (R01-HG02362-02). M.J. was supported by a grant from the NSF (IIS-9988642).

Additional details

Created:
August 19, 2023
Modified:
October 24, 2023