Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published February 2022 | Published + Supplemental Material
Journal Article Open

Unique mobile elements and scalable gene flow at the prokaryote–eukaryote boundary revealed by circularized Asgard archaea genomes

Abstract

Eukaryotic genomes are known to have garnered innovations from both archaeal and bacterial domains but the sequence of events that led to the complex gene repertoire of eukaryotes is largely unresolved. Here, through the enrichment of hydrothermal vent microorganisms, we recovered two circularized genomes of Heimdallarchaeum species that belong to an Asgard archaea clade phylogenetically closest to eukaryotes. These genomes reveal diverse mobile elements, including an integrative viral genome that bidirectionally replicates in a circular form and aloposons, transposons that encode the 5,000 amino acid-sized proteins Otus and Ephialtes. Heimdallaechaeal mobile elements have garnered various genes from bacteria and bacteriophages, likely playing a role in shuffling functions across domains. The number of archaea- and bacteria-related genes follow strikingly different scaling laws in Asgard archaea, exhibiting a genome size-dependent ratio and a functional division resembling the bacteria- and archaea-derived gene repertoire across eukaryotes. Bacterial gene import has thus likely been a continuous process unaltered by eukaryogenesis and scaled up through genome expansion. Our data further highlight the importance of viewing eukaryogenesis in a pan-Asgard context, which led to the proposal of a conceptual framework, that is, the Heimdall nucleation–decentralized innovation–hierarchical import model that accounts for the emergence of eukaryotic complexity.

Additional Information

© The Author(s) 2022. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. Received 12 May 2021; Accepted 29 November 2021; Published 13 January 2022. We thank W. Fischer for critical comments on the manuscript, L. Kelly for advice on viral sequence analysis, A. Roger for discussions on phylogenetic methods and K. Makarova and E. Koonin for discussions on CRISPR–Cas systems. We thank the pilots, crew and participants on the cruises to the southern Pescadero Basin, FK181031 on R/V Falkor operated by the Schmidt Ocean Institute and NA091 on E/V Nautilus operated by the Ocean Exploration Trust, with NA091 supported by the Dalio Foundation and Woods Hole Oceanographic Institute. This research used samples provided by the Ocean Exploration Trust's Nautilus Exploration Program, cruise NA091. We thank chief scientists S. Wankel and A. Michel for the opportunity to sail on NA091, Co-Chief Scientists D. Caress and R. Zierenberg on FK181031, and S. Wankel, A. Foulk and L. Marsh, R. Zierenberg and D. Cardace for assistance with shipboard processing of rock samples and J. Magyar and S. Goffredi for shipboard processing of sediment samples. Illumina library construction and Nanopore sequencing were performed at the Millard and Muriel Jacobs Genetics and Genomics Laboratory at Caltech. F.W. was supported by the Netherlands Organisation for Scientific Research Rubicon Award no. 019.162LW.037 and a Human Frontiers Science Program Long-term Fellowship no. LT000468/2017. D.R.S. was supported by the Netherlands Organisation for Scientific Research Rubicon Award no. 019.153LW.039 and the Caltech GPS Division Texaco Postdoctoral Fellowship. J.P.A. is funded by the National Science Foundation (NSF) no. OCE-1431598. V.J.O. is a Canadian Institute for Advanced Science fellow in the Earth 4D program. This research was supported by a Caltech Center for Evolutionary Science Pilot Grant (F.W. and V.J.O.), the NOMIS Foundation (V.J.O.), the Simons Foundation Principles of Microbial Ecosystems project (V.J.O.) and the NSF Center for Dark Energy Biosphere Investigations (no. OCE-0939564, V.J.O. and J.P.A.). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. Data availability: The assembled genomes and raw metagenomic sequencing reads can be found on the National Center for Biotechnology Information database under BioProject no. PRJNA721962. Source data are provided with this paper. Code availability: The custom script for recoding of amino acid sequences to four-state SR4 can be found at https://github.com/dspeth/bioinfo_scripts/tree/master/phylogeny. Other custom scripts can be found at https://github.com/wufabai/genomics. Author Contributions: F.W., D.R.S., A.C. and V.J.O. conceived the project. D.R.S. and V.J.O. collected the hydrothermal vent samples. F.W., D.R.S., A.C. and S.A.C. carried out the microbial incubations, periodic sampling, DNA extraction, sulphide analyses and amplicon sequencing. I.A.A. and A.N. prepared the Illumina sequencing libraries. I.A.A. performed the Oxford Nanopore sequencing. F.W., I.A.A. and A.P. assembled the Asgard archaea genomes. D.R.S. performed the phylogenomic analyses, protein clustering and overall bioinformatics platform support. R.A.B. performed the ANI/AAI analyses and taxonomic evaluation. F.W., D.R.S. and A.P. annotated the genomes. F.W. performed the PacBio HiFi 16S sequencing, protein phylogenetic analyses, marker HMM construction, comparative genomics, CRISPR/mobilome discovery, statistical analyses and wrote the paper. V.J.O. revised the paper. D.R.S., A.P., A.C., R.A.B. and J.P.A. provided critical comments on the paper. All authors read and approved the manuscript. V.J.O. and J.P.A. supervised the work. The authors declare no competing interests. Peer review information: Nature Microbiology thanks Brett Baker and the other, anonymous, reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Attached Files

Published - s41564-021-01039-y.pdf

Supplemental Material - 41564_2021_1039_Fig10_ESM.webp

Supplemental Material - 41564_2021_1039_Fig11_ESM.webp

Supplemental Material - 41564_2021_1039_Fig12_ESM.webp

Supplemental Material - 41564_2021_1039_Fig13_ESM.webp

Supplemental Material - 41564_2021_1039_Fig14_ESM.webp

Supplemental Material - 41564_2021_1039_Fig15_ESM.webp

Supplemental Material - 41564_2021_1039_Fig7_ESM.webp

Supplemental Material - 41564_2021_1039_Fig8_ESM.webp

Supplemental Material - 41564_2021_1039_Fig9_ESM.webp

Supplemental Material - 41564_2021_1039_MOESM10_ESM.xlsx

Supplemental Material - 41564_2021_1039_MOESM11_ESM.xlsx

Supplemental Material - 41564_2021_1039_MOESM12_ESM.txt

Supplemental Material - 41564_2021_1039_MOESM13_ESM.txt

Supplemental Material - 41564_2021_1039_MOESM14_ESM.xlsx

Supplemental Material - 41564_2021_1039_MOESM15_ESM.xlsx

Supplemental Material - 41564_2021_1039_MOESM16_ESM.txt

Supplemental Material - 41564_2021_1039_MOESM17_ESM.xlsx

Supplemental Material - 41564_2021_1039_MOESM1_ESM.pdf

Supplemental Material - 41564_2021_1039_MOESM2_ESM.pdf

Supplemental Material - 41564_2021_1039_MOESM3_ESM.pdf

Supplemental Material - 41564_2021_1039_MOESM4_ESM.zip

Supplemental Material - 41564_2021_1039_MOESM5_ESM.zip

Supplemental Material - 41564_2021_1039_MOESM6_ESM.zip

Supplemental Material - 41564_2021_1039_MOESM7_ESM.zip

Supplemental Material - 41564_2021_1039_MOESM8_ESM.zip

Supplemental Material - 41564_2021_1039_MOESM9_ESM.xlsx

Files

41564_2021_1039_MOESM2_ESM.pdf
Files (16.8 MB)
Name Size Download all
md5:3f0522b1b842d1916e88e002005fe063
76.5 kB Download
md5:a5d694ad6207817b74789dbc3d6c895c
90.7 kB Download
md5:2ac4fc78bb6fdaf99267863ddc7880ce
191.5 kB Download
md5:cfd722127b553fef07fa6ac213e5e790
10.5 kB Download
md5:f8f77fc77306cae50c11ea08ac857ffa
74.3 kB Preview Download
md5:3e06304097ac05251e4a93c372c3f202
306.8 kB Download
md5:a2ed500ca1789a3ee8084ba0af15c3d7
19.9 kB Preview Download
md5:122e71708c8b75888d9380a74b0c2e81
30.5 kB Preview Download
md5:ebb1cdc250b04b8c9cb83ae88a57cd70
2.9 kB Preview Download
md5:006c6952984b6f9d1311ce9387897f11
11.4 kB Download
md5:b6b734858a0b2a532e95e0ef34f6a2ff
298.3 kB Preview Download
md5:d004861b875862bc13057b9cd2972e29
444.3 kB Preview Download
md5:10a396af14971a397603e8e24c1f682a
10.5 MB Preview Download
md5:0752cbc62c284ff0f565e4e7be5bacbe
9.0 kB Download
md5:f7e8f615745c725e3da42cb87779dce5
18.6 kB Download
md5:3c7ba0f780b0b559ea494b4823daf7fb
4.2 kB Preview Download
md5:7fabf3ad240fc8c22a152340c1180b87
316.7 kB Download
md5:4b99fa3d614fc9e372ffe3e1144813ec
286.6 kB Download
md5:ba6a41c671017e9f6bb7f87eaaadc767
178.5 kB Download
md5:22d3599944ce0d37bd08192b0420bda1
130.9 kB Download
md5:249734e3db2bb36f4488051d2eb913e7
2.3 MB Preview Download
md5:223ca176489b589b204a273a635d2d87
10.7 kB Download
md5:e1ca56dcad46d6a55d1dc0f0f2f88b01
16.3 kB Preview Download
md5:0ee2edd67179a84fb4987f2697aef8ca
120.4 kB Preview Download
md5:0cfce2b248e5f673f898adc2c88b80db
11.2 kB Download
md5:b2cce9ef444ed49fbf813c9c2e7363bf
1.1 MB Preview Download
md5:d5b042c652495a6e22bae0c08bd2097a
238.7 kB Download

Additional details

Created:
August 22, 2023
Modified:
December 22, 2023