Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published April 15, 2003 | Published
Journal Article Open

Majority of divergence between closely related DNA samples is due to indels

Abstract

It was recently shown that indels are responsible for more than twice as many unmatched nucleotides as are base substitutions between samples of chimpanzee and human DNA. A larger sample has now been examined and the result is similar. The number of indels is approximate to1/12th of the number of base substitutions and the average length of the indels is 36 nt, including indels up to 10 kb. The ratio (R-u) of unpaired nucleotides attributable to indels to those attributable to substitutions is 3.0 for this 2 million-nt chimp DNA sample compared with human. There is similar evidence of a large value of R-u for sea urchins from the polymorphism of a sample of Strongylocentrotus purpuratus DNA (Ru = 3-4). Other work indicates that similarly, per nucleotide affected, large differences are seen for indels in the DNA polymorphism of the plant Arabidopsis thaliana (R-u = 51). For the insect Drosophila melanogaster a high value of R-u (4.5) has been determined. For the nematode Caenorhabditis elegans the polymorphism data are incomplete but high values of R-u are likely. Comparison of two strains of Escherichia coli O157:H7 shows a preponderance of indels. Because these six examples are from very distant systematic groups the implication is that in general, for alignments of closely related DNA, indels are responsible for many more unmatched nucleotides than are base substitutions. Human genetic evidence suggests that indels are a major source of gene defects, indicating that indels are a significant source of evolutionary change.

Additional Information

Copyright © 2003 by the National Academy of Sciences. Contributed by Roy J. Britten, February 15, 2003. Published online before print April 2, 2003, 10.1073/pnas.0330964100 We thank Dmitri Petrov for help with the Drosophila data, Hugh M. Robertson for help with the C. elegans measurements, Tanya Berardini for making available Arabidopsis data, and Tetsuya Hayashi for help with the E. coli comparisons. R.A.C. was supported by Grant IBN-9982875 from the National Science Foundation Developmental Mechanisms Program.

Attached Files

Published - BRIpnas03.pdf

Files

BRIpnas03.pdf
Files (205.3 kB)
Name Size Download all
md5:33d4f67d21831af4e6104f4658fcb6e2
205.3 kB Preview Download

Additional details

Created:
August 22, 2023
Modified:
October 13, 2023