Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published July 8, 2022 | Supplemental Material + Submitted
Report Open

Spectral neural approximations for models of transcriptional dynamics

Abstract

Transcriptional systems involving discrete, stochastic events are naturally modeled using Chemical Master Equations (CMEs). These can be solved for microstate probabilities over time and state space for a better understanding of biological rates and system dynamics. However, closed form solutions to CMEs are available in only the simplest cases. Probing systems of higher complexity is challenging due to the computational cost of finding solutions and often compromises accuracy by treating infinite systems as finite. We use statistical understanding of system behavior and the generalizability of neural networks to approximate steady-state joint distribution solutions for a two-species model of the life cycle of RNA. We define a set of kernel functions using moments of the system and learn optimal weights for kernel functions with a neural network trained to minimize statistical distance between approximated and numerically calculated distributions. We show that this method of kernel weight regression (KWR) approximation is as accurate as lower-order generating-function solutions to the system, but faster; KWR approximation reduces the time for likelihood evaluation by several orders of magnitude. KWR also generalizes to produce probability predictions for system rates outside of training sets, thereby enabling efficient transcriptional parameter exploration and system analysis.

Additional Information

The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license. We thank Dr. Yisong Yue, Yongin Choi, Dr. John J. Vastola, and Dr. Zachary Fox for valuable discussions. G.G. and L.P. were partially funded by NIH U19MH114830. Figures 1, 2, 3, and S3 use color palettes from MetBrewer, developed by Blake R. Mills and available at https://github.com/BlakeRMills/MetBrewer. Data and code availability. All training and validation datasets, trained models, scripts to generate manuscript figures, and a small Google Colaboratory demonstration are available at https://github.com/pachterlab/GCCP_2022. The authors have declared no competing interest.

Attached Files

Submitted - 2022.06.16.496448v1.full.pdf

Supplemental Material - media-1.pdf

Files

2022.06.16.496448v1.full.pdf
Files (5.2 MB)
Name Size Download all
md5:6fcd73c236a34728aebe71f17036345c
4.0 MB Preview Download
md5:ca67e5d5810db4565aa21ec890353a96
1.2 MB Preview Download

Additional details

Created:
August 20, 2023
Modified:
December 13, 2023