Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published December 5, 2002 | Supplemental Material
Journal Article Open

Initial sequencing and comparative analysis of the mouse genome

Abstract

The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

Additional Information

© 2002 Macmillan Publishers Limited. Received 18 September 2002; Accepted 31 October 2002. We thank J. Takahashi and M. Johnston for comments on the manuscript; the Mouse Liaison Group for strategic advice; L. Gaffney, D. Leja and K.-S. Toh for graphical help; B. Graham and G. Roberts for administrative work on sequencing of individual mouse BACs; and P. Kassos and M. McMurtry for secretarial assistance. We thank D. Hill and L. Corbani of the Mouse Genome Informatics Group for their contributions to the GO analysis for mouse and human, and the members of the Bork group at EMBL for discussions. Funding was provided by the National Institutes of Health (National Human Genome Research Institute, National Cancer Institute, National Institute of Dental and Craniofacial Research, National Institute of Diabetes and Digestive and Kidney Diseases, National Institute of General Medical Sciences, National Eye Institute, National Institute of Environmental Health Sciences, National Institute of Aging, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institute on Deafness and Other Communication Disorders, National Institute of Mental Health, National Institute on Drug Abuse, National Center for Research Resources, the National Heart Lung and Blood Institute and The Fogarty International Center); the Wellcome Trust; the Howard Hughes Medical Institute; the United States Department of Energy; the National Science Foundation; the Medical Research Council; NSERC; BMBF (German Ministry for Research and Education); the European Molecular Biology Laboratory; Plan Nacional de I + D and Instituto Carlos III; Swiss National Science Foundation, NCCR Frontiers in Genetics, the Swiss Cancer League and the 'Childcare' and 'J. Lejeune' Foundations; and the Ministry of Education, Culture, Sports, Science and Technology of Japan. The initial threefold sequence coverage was partly supported by the Mouse Sequencing Consortium (GlaxoSmithKline, Merck and Affymetrix) through the Foundation for the National Institutes of Health. We acknowledge A. Holden for coordinating the Mouse Sequencing Consortium. We thank the Sanger Institute systems group for maintenance and provision of the computer resource. The MGSC also used Hewlett-Packard Company's BioCluster, a configuration of 27 HP AlphaServer ES40 systems with 100 CPUs and 1 terabyte of storage. The BioCluster is housed in Hewlett-Packard's IQ Solutions Center, and was accessed remotely. The computing resource greatly accelerated the analysis. Authors' contributions: The following authors contributed to project leadership: R. H. Waterston, K. Lindblad-Toh, E. Birney, J. Rogers, M. R. Brent, F. S. Collins, R. Guigó, R. C. Hardison, D. Haussler, D. B. Jaffe, W. J. Kent, W. Miller, C. P. Ponting, A. Smit, M. C. Zody and E. S. Lander. Availability of sequence and assembly data: Unprocessed sequence reads are available from the NCBI trace archive (ftp://ftp.ncbi.nih.gov/pub/TraceDB/mus_musculus/). Raw assembly data (before removal of contaminants, anchoring to chromosomes, and addition of finished sequence) are available from the Whitehead Institute for Biomedical Research (WIBR) (ftp://wolfram.wi.mit.edu/pub/mouse_contigs/Mar10_02/). The released assembly MGSCv3 is available from Ensembl (http://www.ensembl.org/Mus_musculus/), NCBI (ftp://ftp.ncbi.nih.gov/genomes/M_musculus/MGSCv3_Release1/), UCSC (http://genome.ucsc.edu/downloads.html) and WIBR (ftp://wolfram.wi.mit.edu/pub/mouse_contigs/MGSC_V3/). (See Supplementary Information for detailed Methods.) The author declares no competing financial interests.

Attached Files

Supplemental Material - nature01262-s1.doc

Supplemental Material - nature01262-s10.doc

Supplemental Material - nature01262-s11.jpg

Supplemental Material - nature01262-s12.jpg

Supplemental Material - nature01262-s13.jpg

Supplemental Material - nature01262-s14.jpg

Supplemental Material - nature01262-s15.jpg

Supplemental Material - nature01262-s16.jpg

Supplemental Material - nature01262-s2.doc

Supplemental Material - nature01262-s3.doc

Supplemental Material - nature01262-s4.doc

Supplemental Material - nature01262-s5.doc

Supplemental Material - nature01262-s6.doc

Supplemental Material - nature01262-s7.doc

Supplemental Material - nature01262-s8.doc

Supplemental Material - nature01262-s9.doc

Files

nature01262-s13.jpg
Files (1.0 MB)
Name Size Download all
md5:afb69d161e6f0fe556e0ddab2089a002
19.5 kB Download
md5:4afb897b930a2369c8aed1678b8ef5ee
43.9 kB Preview Download
md5:741a8fc26a6b2fbb1c5ffd3c7ada17e4
19.5 kB Download
md5:d8927491a1854f98c34abc7d9235afa4
107.5 kB Download
md5:7904908fad351c192431adda778b950d
80.3 kB Preview Download
md5:c7446becc576a546e1166fa13822f0c0
126.9 kB Preview Download
md5:7d5acde9d882764e17c43acc19804aa1
25.6 kB Download
md5:23d03e1bacecdb4653b84709a4c30ecc
38.9 kB Download
md5:3e7fc52c00d3a84721c80ca64348849f
122.2 kB Preview Download
md5:7fb614c642dddbff337494aa42bda6c2
35.3 kB Download
md5:f213c9b9147f49349ed39fba884d0c8f
24.1 kB Download
md5:f07d963f40f27aa971c43d139eda7442
28.2 kB Download
md5:98b548e0bf8e30f3797ff92661e7fdc5
61.9 kB Preview Download
md5:d281b0e280946f05916ae2efcd9598a7
24.6 kB Download
md5:8c0b9d7063216b37eaf83050c833d729
20.0 kB Download
md5:f09b742b4b8826c8f1cb16d305fbb03f
239.9 kB Preview Download

Additional details

Created:
August 19, 2023
Modified:
October 24, 2023