Machine Learning Techniques for Stellar Light Curve Classification
- Creators
- Hinners, Trisha A.
- Tat, Kevin
- Thorp, Rachel
Abstract
We apply machine learning techniques in an attempt to predict and classify stellar properties from noisy and sparse time-series data. We preprocessed over 94 GB of Kepler light curves from the Mikulski Archive for Space Telescopes (MAST) to classify according to 10 distinct physical properties using both representation learning and feature engineering approaches. Studies using machine learning in the field have been primarily done on simulated data, making our study one of the first to use real light-curve data for machine learning approaches. We tuned our data using previous work with simulated data as a template and achieved mixed results between the two approaches. Representation learning using a long short-term memory recurrent neural network produced no successful predictions, but our work with feature engineering was successful for both classification and regression. In particular, we were able to achieve values for stellar density, stellar radius, and effective temperature with low error (~2%–4%) and good accuracy (~75%) for classifying the number of transits for a given star. The results show promise for improvement for both approaches upon using larger data sets with a larger minority class. This work has the potential to provide a foundation for future tools and techniques to aid in the analysis of astrophysical data.
Additional Information
© 2018. The American Astronomical Society. Received 2017 October 16; revised 2018 April 25; accepted 2018 April 26; published 2018 June 8. We would like to thank the Kepler team for making all of the data publicly available on MAST. We would also like to thank Sara Seager (MIT) and David Hogg (NYU) for discussions, encouragement, and advice. This work was supported by Northrop Grumman Corporation.Attached Files
Published - Hinners_2018_AJ_156_7.pdf
Accepted Version - 1710.06804
Files
Name | Size | Download all |
---|---|---|
md5:66b756cc6e62876498364d8311f30e14
|
1.9 MB | Preview Download |
md5:6009ff19790325ea0e78f6860cd21c1f
|
1.5 MB | Download |
Additional details
- Eprint ID
- 87396
- Resolver ID
- CaltechAUTHORS:20180627-111110805
- Northrop Grumman Corporation
- Created
-
2018-06-27Created from EPrint's datestamp field
- Updated
-
2021-11-15Created from EPrint's last_modified field