Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published June 30, 2023 | Submitted + Supplemental Material
Report Open

Studying stochastic systems biology of the cell with single-cell genomics data

Abstract

Recent experimental developments in genome-wide RNA quantification hold considerable promise for systems biology. However, rigorously probing the biology of living cells requires a unified mathematical framework that accounts for single-molecule biological stochasticity in the context of technical variation associated with genomics assays. We review models for a variety of RNA transcription processes, as well as the encapsulation and library construction steps of microfluidics-based single-cell RNA sequencing, and present a framework to integrate these phenomena by the manipulation of generating functions. Finally, we use simulated scenarios and biological data to illustrate the implications and applications of the approach.

Additional Information

This work is licensed under a Creative Commons Attribution 4.0 International License, which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. G.G. and L.P. were partially funded by NIH 5UM1HG012077-02 and NIH U19MH114830. J.V. was partially funded by NIH 1U19NS118246-01. The RNA, DNA, and cDNA illustrations were derived from the DNA Twemoji by Twitter, Inc., used under the CC-BY 4.0 license. The authors thank Dr. A. Sina Booeshaghi, Maria Carilli, Tara Chari, Taleen Dilanyan, Dr. Kristján Eldjárn Hjörleifsson, Meichen Fang, Catherine Felce, and Delaney Sullivan for fruitful discussions of co-regulation, contamination, transient behaviors, catalysis, fragmentation, genomic alignment, and a variety of other phenomena and processes. Part of this work was performed during G.G.'s Data Sciences Co-op with Celsius Therapeutics, Inc. DATA AVAILABILITY. Notebooks that reproduce all of the results in the figures are hosted at https://github.com/pachterlab/GVP_2023. The raw data used to generate Figure 2b–c, as well as related supplementary figures, are hosted as the Zenodo package 7694182. The data and Monod fits reported in Figure 5d–e, originating from Gorin et al.21, are hosted as the Zenodo package 7388133, and were originally generated using the notebooks and scripts at https://github.com/pachterlab/GP_2021_3/. The authors have declared no competing interest.

Attached Files

Submitted - nihpp-2023.05.17.541250v2.pdf

Supplemental Material - media-1.xlsx

Supplemental Material - media-2.pdf

Files

nihpp-2023.05.17.541250v2.pdf
Files (16.3 MB)
Name Size Download all
md5:d629c9f6ad012783b5cff0706faa12d7
14.6 kB Download
md5:a826c695de58bfa95dc86494dd466fba
11.7 MB Preview Download
md5:52a3c4003b2e5fa92cd0af0279b73c19
4.5 MB Preview Download

Additional details

Created:
August 20, 2023
Modified:
December 22, 2023