Sequencing and chromosome-scale assembly of the giant Pleurodeles waltl genome
Abstract
The Iberian ribbed newt (Pleurodeles waltl) constitutes a central model for probing the basis of regeneration. Here, we present the sequencing and chromosome-scale assembly of the 20.3Gb P.waltl genome, which exhibits the highest level of contiguity and completeness among giant genomes. We uncover that DNA transposable elements are the major contributors to its expansion, with hAT transposons comprising a large portion of repeats. Several hATs are actively transcribed and differentially expressed during adult P. waltl limb regeneration, along with domesticated hAT transposons of the ZBED transcription factor family. Despite its size, syntenic relationships are conserved across the genome. As an example we show the high degree of conservation of the regeneration-associated Tig1 locus with several neighbouring genes. Together, the P. waltl genome provides a fundamental resource for the study of regenerative, developmental and evolutionary principles.
Additional Information
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license. Storage and handling of sequencing data was enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC) at UPPMAX -partially funded by the Swedish Research Council through grant agreement no. 2018-05973-, the DRESDEN Concept Genome Center -part of the technology platform of the CMCB at the TU Dresden, supported by DFG (INST 269/768-1)-, the MPICBG computing cloud, and the Center for Information Services and High-Performance Computing (ZIH) at Technische Universität Dresden. Work performed at NGI/Uppsala Genome Center has been funded by RFI/VR and Science for Life Laboratory, Sweden. We thank Miho Kiyooka and Wei Chen for blastema Iso-seq library preparation and Sequel sequencing in the National Institute for Genetics, Japan. TB supported by DFG (INST 269/768-1). AE is supported by grant PID2020-115672RJ-I00 type JIN from Ministerio de Ciencia y Innovación (Spain). K.T.S is supported by JSPS KAKENHI, Grant-in Aid for Scientific Research(C), 18K06257 and 16H06279 (PAGS). NDL receives funding from the Knut and Alice Wallenberg Foundation and the Swedish Research Council (Registration # 2020-01486). AS is supported by ERC (951477), Swedish Research Council (2018-02443), KAW (2018.0040), Cancerfonden (20 0417). MHY is supported by Deutsche Forschungsgemeinschaft grants (DFG 22137416, 450807335 & 497658823) and TUD-CRTD core and seed funds. Author contributions. TB performed genome assembly and chromosome scaffolding. AE and TB performed genome annotation, with input and scripts from EO. SI performed computational genomic analysis, with input from MHY, TB, AE, AP and NDL. ES optimized and generated tissue samples for genomic and Iso-seq sequencing. AJA analysed 2n DNA content prior to tissue collection. ES, AJA and NDL coordinated sample extraction. MS, KS, TH, AT generated limb blastema Iso-seq. CO analysed data. TB performed macrosynteny comparisons. NDL developed microsynteny pipeline. MHY and AS provided scientific coordination. MHY, AS and NDL supervised the project. AS and MHY provided funding (AS: genome and Iso-seq sequencing except for limb blastema, staff; MHY: computational capacity, staff). AS and MHY edited the manuscript. MHY wrote the manuscript with contributions from all authors. Data Availability. Genome and annotation files are available through the Max Planck Digital Library at the following location: https://doi.org/10.17617/3.90C1ND and NCBI under the BioProject: PRJNA847026. PacBio HiFi, Hi-C and Iso-seq data will also be available under the same BioProject. The authors have declared no competing interest.Attached Files
Submitted - 2022.10.19.512763v1.full.pdf
Supplemental Material - media-1.pdf
Supplemental Material - media-10.pdf
Supplemental Material - media-11.pdf
Supplemental Material - media-12.pdf
Supplemental Material - media-13.pdf
Supplemental Material - media-14.pdf
Supplemental Material - media-15.txt
Supplemental Material - media-16.txt
Supplemental Material - media-17.xlsx
Supplemental Material - media-18.xlsx
Supplemental Material - media-19.pdf
Supplemental Material - media-2.pdf
Supplemental Material - media-20.xlsx
Supplemental Material - media-3.pdf
Supplemental Material - media-4.pdf
Supplemental Material - media-5.pdf
Supplemental Material - media-6.pdf
Supplemental Material - media-7.pdf
Supplemental Material - media-8.pdf
Supplemental Material - media-9.pdf
Files
Name | Size | Download all |
---|---|---|
md5:5d4e39e2680fe13e453b40d6c88f4052
|
90.8 kB | Preview Download |
md5:8b602401c4c420753b7c9ede4b02f8f8
|
280.4 kB | Preview Download |
md5:52c767543fa2053cd3868c4acb6a9a84
|
11.6 kB | Download |
md5:27dff336762d4afa8fa245ecc09f25b3
|
10.7 kB | Download |
md5:fe887b2d1527c6804181f35fd8cfe13a
|
779.8 kB | Preview Download |
md5:829c611349b7d19163fa162070b1c800
|
819.2 kB | Preview Download |
md5:b2cb5d1c46b82bba2ce968995e305756
|
100.9 kB | Preview Download |
md5:fce78039057b9c4b3c3de3821adb7082
|
616.2 kB | Preview Download |
md5:fb844b1c13b6f64567e672ccf97473db
|
79.5 kB | Preview Download |
md5:0c66a0a68112804ab834357f9ae172de
|
95.3 kB | Preview Download |
md5:7708c36a525f4f026404dea52b36a263
|
27.2 kB | Preview Download |
md5:016722f30093e6bbc2a363d46535615e
|
173.3 kB | Preview Download |
md5:9e0aa8e3a97f2f6993fb46189da0c3a9
|
100.5 kB | Preview Download |
md5:6d9db318a8c4727b8f59b356607592c7
|
3.5 MB | Preview Download |
md5:bcd917728e7601556be56a0df5443328
|
84.8 kB | Preview Download |
md5:e55b0a1d5b7afd3abe3709cfb1967eb4
|
245.1 kB | Preview Download |
md5:902ead72020f92e2e2467ef16b283922
|
22.7 kB | Preview Download |
md5:c03e45378100504a58922a0a43c137c9
|
10.6 kB | Download |
md5:58293d6687db7167f6b4b4c98d3e8d8f
|
244.7 kB | Preview Download |
md5:9290e24cb430d2a07ef790dec87cb14f
|
889.6 kB | Preview Download |
md5:4a5df86d2f4f1cbf6ec2a31294e303ea
|
54.0 kB | Preview Download |
Additional details
- Eprint ID
- 120309
- Resolver ID
- CaltechAUTHORS:20230322-101534000.13
- 2018-05973
- Swedish Research Council
- INST 269/768-1
- Deutsche Forschungsgemeinschaft (DFG)
- Science for Life Laboratory (Sweden)
- PID2020-115672RJ-I00
- Ministerio de Ciencia y Innovación (MCINN)
- 18K06257
- Japan Society for the Promotion of Science (JSPS)
- 16H06279
- Japan Society for the Promotion of Science (JSPS)
- 2018.0040
- Knut and Alice Wallenberg Foundation
- 2020-01486
- Swedish Research Council
- 951477
- European Research Council (ERC)
- 2018-02443
- Swedish Research Council
- 20 0417
- Swedish Cancer Society
- 22137416
- Deutsche Forschungsgemeinschaft (DFG)
- 450807335
- Deutsche Forschungsgemeinschaft (DFG)
- 497658823
- Deutsche Forschungsgemeinschaft (DFG)
- Technische Universität Dresden
- Created
-
2023-03-22Created from EPrint's datestamp field
- Updated
-
2023-03-22Created from EPrint's last_modified field