Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published January 7, 2022 | Supplemental Material + Published
Journal Article Open

Echinobase: leveraging an extant model organism database to build a knowledgebase supporting research on the genomics and biology of echinoderms

Abstract

Echinobase (www.echinobase.org) is a third generation web resource supporting genomic research on echinoderms. The new version was built by cloning the mature Xenopus model organism knowledgebase, Xenbase, refactoring data ingestion pipelines and modifying the user interface to adapt to multispecies echinoderm content. This approach leveraged over 15 years of previous database and web application development to generate a new fully featured informatics resource in a single year. In addition to the software stack, Echinobase uses the private cloud and physical hosts that support Xenbase. Echinobase currently supports six echinoderm species, focused on those used for genomics, developmental biology and gene regulatory network analyses. Over 38 000 gene pages, 18 000 publications, new improved genome assemblies, JBrowse genome browser and BLAST + services are available and supported by the development of a new echinoderm anatomical ontology, uniformly applied formal gene nomenclature, and consistent orthology predictions. A novel feature of Echinobase is integrating support for multiple, disparate species. New genomes from the diverse echinoderm phylum will be added and supported as data becomes available. The common code development design of the integrated knowledgebases ensures parallel improvements as each resource evolves. This approach is widely applicable for developing new model organism informatics resources.

Additional Information

© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. Received: 13 August 2021. Revision received: 05 October 2021. Accepted: 13 October 2021. Published: 17 November 2021. Features used in Echinobase are derived from the work of many active and past Xenbase, SpBase and Echinobase developers and contributors. These individuals are listed in the 'About' page on each resource's landing page. Training and support from the Xenbase curatorial team at Cincinnati Children's Hospital was essential in building this resource, as was advice from the Echinobase Scientific Advisory Board. The contents of Echinobase are solely the responsibility of the authors and do not necessarily represent the official views of the National Institutes of Health. unding for Echinobase is provided by the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) [P41HD095831]; this work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation [ACI-1548562]; specifically, it used the Bridges system, which is supported by NSF [ACI-1445606], at the Pittsburgh Supercomputing Center (PSC) via allocation request MCB200030; J.C.C. is funded by the Centre National de la Recherche Scientifique (CNRS) and benefited from the European project 'Coordinated Research Infrastructures Building Enduring Life Science Services' (CORBEL) [H2020-INFRADEV-4-2014-2015 RIA]. Funding for open access charge: NICHD [P41HD095831]. DATA AVAILABILITY: Echinobase; http://www.echinobase.org Xenbase; http://www.xenbase.org GMOD, The Generic Model Organism Database project; http://gmod.org Legacy Echinobase, a VM running the previous generation Echinobase; http://legacy.echinobase.org FastOrtho, orthology prediction tool; https://github.com/olsonanl/FastOrtho Echinobase gene nomenclature guidelines; https://www.echinobase.org/gene/static/geneNomenclature.jsp The ECAO; https://www.echinobase.org/anatomy/ecao.do?method=display The ECAO at the OBO Foundry; http://obofoundry.org/ontology/ecao.html Conflict of interest statement. None declared.

Attached Files

Published - gkab1005.pdf

Supplemental Material - gkab1005_supplemental_file.pdf

Files

gkab1005.pdf
Files (3.8 MB)
Name Size Download all
md5:6f7816297c042a10d13dcef79e23793d
3.6 MB Preview Download
md5:7d9329902abad196ae8a9afa962f3b55
205.0 kB Preview Download

Additional details

Created:
August 22, 2023
Modified:
October 23, 2023