Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published April 2022 | Supplemental Material + Published
Journal Article Open

A machine learning toolkit for CRISM image analysis

Abstract

Hyperspectral images collected by remote sensing have played a significant role in the discovery of aqueous alteration minerals, which in turn have important implications for our understanding of the changing habitability on Mars. Traditional spectral analyzes based on summary parameters have been helpful in converting hyperspectral cubes into readily visualizable three channel maps highlighting high-level mineral composition of the Martian terrain. These maps have been used as a starting point in the search for specific mineral phases in images. Although the amount of labor needed to verify the presence of a mineral phase in an image is quite limited for phases that emerge with high abundance, manual processing becomes laborious when the task involves determining the spatial extent of detected phases or identifying small outcrops of secondary phases that appear in only a few pixels within an image. Thanks to extensive use of remote sensing data and rover expeditions, significant domain knowledge has accumulated over the years about mineral composition of several regions of interest on Mars, which allow us to collect reliable labeled data required to train machine learning algorithms. In this study we demonstrate the utility of machine learning in two essential tasks for hyperspectral data analysis: nonlinear noise removal and mineral classification. We develop a simple yet effective hierarchical Bayesian model for estimating distributions of spectral patterns and extensively validate this model for mineral classification on several test images. Our results demonstrate that machine learning can be highly effective in exposing tiny outcrops of specific phases in orbital data that are not uncovered by traditional spectral analysis. We package implemented scripts, documentation illustrating use cases, and pixel-scale training data collected from dozens of well-characterized images into a new toolkit. We hope that this new toolkit will provide advanced and effective processing tools and improve community's ability to map compositional units in remote sensing data quickly, accurately, and at scale.

Additional Information

© 2021 The Author(s). Published by Elsevier Under a Creative Commons license - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0). Received 29 June 2021, Revised 18 October 2021, Accepted 6 December 2021, Available online 1 January 2022. Thanks to the CRISM science and operations teams for their work to collect and process these datasets. Thanks to Abigail A. Fraeman for her feedback on a subset of detections. B.L.E and M.D. were sponsored by NASA, USA under Grant Number 80NSSC19K1594. M.D. was sponsored by the National Science Foundation (NSF), USA under Grant Number IIS-1252648 (CAREER). Data Availability. Training datasets and training segmentation maps can be downloaded from https://cs.iupui.edu/~mdundar/CRISM.htm. The toolkit with a user friendly Python notebook, source codes, and documentation is hosted at https://github.com/Banus/crism_ml.

Attached Files

Published - 1-s2.0-S0019103521004905-main.pdf

Supplemental Material - 1-s2.0-S0019103521004905-mmc1.pdf

Files

1-s2.0-S0019103521004905-mmc1.pdf
Files (9.8 MB)
Name Size Download all
md5:c871f0285f6e9b50887b860f6ec4f9b3
555.9 kB Preview Download
md5:3a59bed7777c33ff52f2d4b6242237f5
9.2 MB Preview Download

Additional details

Created:
August 22, 2023
Modified:
October 23, 2023