Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published August 15, 2017 | Accepted Version + Submitted + Published
Journal Article Open

Model-based spike sorting with a mixture of drifting t-distributions

Abstract

Background:Chronic extracellular recordings are a powerful tool for systems neuroscience, but spike sorting remains a challenge. A common approach is to fit a generative model, such as a mixture of Gaussians, to the observed spike data. Even if non-parametric methods are used for spike sorting, such generative models provide a quantitative measure of unit isolation quality, which is crucial for subsequent interpretation of the sorted spike trains. New method: We present a spike sorting strategy that models the data as a mixture of drifting t-distributions. This model captures two important features of chronic extracellular recordings—cluster drift over time and heavy tails in the distribution of spikes—and offers improved robustness to outliers. Results: We evaluate this model on several thousand hours of chronic tetrode recordings and show that it fits the empirical data substantially better than a mixture of Gaussians. We also provide a software implementation that can re-fit long datasets in a few seconds, enabling interactive clustering of chronic recordings. Comparison with existing methods: We identify three common failure modes of spike sorting methods that assume stationarity and evaluate their impact given the empirically-observed cluster drift in chronic recordings. Using hybrid ground truth datasets, we also demonstrate that our model-based estimate of misclassification error is more accurate than previous unit isolation metrics. Conclusions: The mixture of drifting t-distributions model enables efficient spike sorting of long datasets and provides an accurate measure of unit isolation quality over a wide range of conditions.

Additional Information

© 2017 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Received 27 January 2017, Revised 16 June 2017, Accepted 20 June 2017, Available online 23 June 2017. We thank all the lab members who contributed user-hours towards data collection and spike sorting (C. Wierzynski, A. Hoenselaar, M. Papadopoulou, B. Sauerbrei). Special thanks to Alex Ecker for development of the moksm MATLAB package, and Andreas Hoenselaar for development of the clustering GUI. This work was supported by the Mathers Foundation, the Moore Foundation, NSF grants 1546280, 1146871, NIH grants 1DP1OD008255/5DP1MH099907, 1R01MH113016, and iARPA contract D16PC00003.

Attached Files

Published - 1-s2.0-S016502701730225X-main.pdf

Accepted Version - nihms889924.pdf

Submitted - 109850.full.pdf

Files

1-s2.0-S016502701730225X-main.pdf
Files (11.6 MB)
Name Size Download all
md5:aa2e8a02fea35c13abf12824bc709e1c
5.4 MB Preview Download
md5:1f8125423a214ab2f1ba6a261c20cce5
2.4 MB Preview Download
md5:e845da9b6bcbcefb5260feba042c6867
3.8 MB Preview Download

Additional details

Created:
August 21, 2023
Modified:
October 23, 2023