How to Find Variable Active Galactic Nuclei with Machine Learning
Abstract
Machine-learning (ML) algorithms will play a crucial role in studying the large data sets delivered by new facilities over the next decade and beyond. Here, we investigate the capabilities and limits of such methods in finding galaxies with brightness-variable active galactic nuclei (AGNs). Specifically, we focus on an unsupervised method based on self-organizing maps (SOM) that we apply to a set of nonparametric variability estimators. This technique allows us to maintain domain knowledge and systematics control while using all the advantages of ML. Using simulated light curves that match the noise properties of observations, we verify the potential of this algorithm in identifying variable light curves. We then apply our method to a sample of ~8300 WISE color-selected AGN candidates in Stripe 82, in which we have identified variable light curves by visual inspection. We find that with ML we can identify these variable classified AGN with a purity of 86% and a completeness of 66%, a performance that is comparable to that of more commonly used supervised deep-learning neural networks. The advantage of the SOM framework is that it enables not only a robust identification of variable light curves in a given data set, but it is also a tool to investigate correlations between physical parameters in multidimensional space—such as the link between AGN variability and the properties of their host galaxies. Finally, we note that our method can be applied to any time-sampled light curve (e.g., supernovae, exoplanets, pulsars, and other transient events).
Additional Information
© 2019 The American Astronomical Society. Received 2019 June 24; revised 2019 July 23; accepted 2019 July 24; published 2019 August 6. We thank the anonymous referee for the useful comments that improved this paper. We also thank J. Tan for proofreading the manuscript. This work was supported by Joint Survey Processing (JSP) at IPAC/Caltech, which is aimed at combined analysis of Euclid, LSST, and WFIRST. This work was supported by Joint Survey Processing (JSP) at IPAC/Caltech which was funded by NASA grant 80NM0018F0803. This publication makes use of data products from the Wide-field Infrared Survey Explorer, which is a joint project of the University of California, Los Angeles, and the Jet Propulsion Laboratory/California Institute of Technology, funded by the National Aeronautics and Space Administration.Attached Files
Published - Faisst_2019_ApJL_881_L9.pdf
Accepted Version - 1908.07542.pdf
Files
Name | Size | Download all |
---|---|---|
md5:974ced31429b992864d3d70895e4d807
|
1.3 MB | Preview Download |
md5:68591d9a3efef40a0af97fc6c6cd5415
|
1.1 MB | Preview Download |
Additional details
- Eprint ID
- 97673
- Resolver ID
- CaltechAUTHORS:20190806-125453442
- Infrared Processing and Analysis Center (IPAC)
- NASA
- 80NM0018F0803
- NASA/JPL/Caltech
- Created
-
2019-08-07Created from EPrint's datestamp field
- Updated
-
2021-11-16Created from EPrint's last_modified field
- Caltech groups
- Infrared Processing and Analysis Center (IPAC)