Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published March 14, 2020 | Supplemental Material + Submitted + Published
Journal Article Open

Benchmarking the Acceleration of Materials Discovery by Sequential Learning

Abstract

Sequential learning (SL) strategies, i.e. iteratively updating a machine learning model to guide experiments, have been proposed to significantly accelerate materials discovery and research. Applications on computational datasets and a handful of optimization experiments have demonstrated the promise of SL, motivating a quantitative evaluation of its ability to accelerate materials discovery, specifically in the case of physical experiments. The benchmarking effort in the present work quantifies the performance of SL algorithms with respect to a breadth of research goals: discovery of any "good" material, discovery of all "good" materials, and discovery of a model that accurately predicts the performance of new materials. To benchmark the effectiveness of different machine learning models against these goals, we use datasets in which the performance of all materials in the search space is known from high-throughput synthesis and electrochemistry experiments. Each dataset contains all pseudo-quaternary metal oxide combinations from a set of six elements (chemical space), the performance metric chosen is the electrocatalytic activity (overpotential) for the oxygen evolution reaction (OER). A diverse set of SL schemes is tested on four chemical spaces, each containing 2121 catalysts. The presented work suggests that research can be accelerated by up to a factor of 20 compared to random acquisition in specific scenarios. The results also show that certain choices of SL models are ill-suited for a given research goal resulting in substantial deceleration compared to random acquisition methods. The results provide quantitative guidance on how to tune an SL strategy for a given research goal and demonstrate the need for a new generation of materials-aware SL algorithms to further accelerate materials discovery.

Additional Information

© 2020 The Royal Society of Chemistry. This Open Access Article is licensed under a Creative Commons Attribution 3.0 Unported Licence. Received 27th November 2019, Accepted 27th January 2020, First published on 29th January 2020. This work was funded by Toyota Research Institute through the Accelerated Materials Design and Discovery program (machine learning and simulation of sequential learning); by the Joint Center for Artificial Photosynthesis, a US Department of Energy (DOE) Energy Innovation Hub, supported through the Office of Science of the DOE under Award Number DE-SC0004993 (data acquisition); and by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Award DE-SC0020383 (data curation and establishment of baselines). Data and code availability: All catalyst data is visualized in Fig. 2 or the ESI† and is available for interactive visualization and download at http://data.matr.io/ACE-I. Source code for benchmarking sequential learning runs against random sample selection and demonstrating the sequential learning is available at http://https://github.com/SantoshSuram-TRI/ACE-I. The compilation of data is available in that repository and also at http://https://data.caltech.edu/records/1345 (DOI: 10.22002/D1.1345). Conflicts of interest: B. R., H. S., S. S. and J. G. filed a provisional patent application on active learning enabled experimental catalyst materials discovery: US app. no. 62/837,379. The remaining authors declare no competing interests.

Attached Files

Published - c9sc05999g.pdf

Submitted - benchmark_seq_learning_chemrxiv_submit1.pdf

Supplemental Material - c9sc05999g1_si.pdf

Files

c9sc05999g.pdf
Files (10.8 MB)
Name Size Download all
md5:e4566195fc40e1e4ed2f207a16568308
1.3 MB Preview Download
md5:97c462e59a590e09a6f4658b85de1c67
7.6 MB Preview Download
md5:d096155d4d97cbb11553d211abcbbece
1.9 MB Preview Download

Additional details

Created:
August 22, 2023
Modified:
October 23, 2023