Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published June 2021 | Submitted + Accepted Version + Published
Journal Article Open

Materials representation and transfer learning for multi-property prediction

Abstract

The adoption of machine learning in materials science has rapidly transformed materials property prediction. Hurdles limiting full capitalization of recent advancements in machine learning include the limited development of methods to learn the underlying interactions of multiple elements as well as the relationships among multiple properties to facilitate property prediction in new composition spaces. To address these issues, we introduce the Hierarchical Correlation Learning for Multi-property Prediction (H-CLMP) framework that seamlessly integrates: (i) prediction using only a material's composition, (ii) learning and exploitation of correlations among target properties in multi-target regression, and (iii) leveraging training data from tangential domains via generative transfer learning. The model is demonstrated for prediction of spectral optical absorption of complex metal oxides spanning 69 three-cation metal oxide composition spaces. H-CLMP accurately predicts non-linear composition-property relationships in composition spaces for which no training data are available, which broadens the purview of machine learning to the discovery of materials with exceptional properties. This achievement results from the principled integration of latent embedding learning, property correlation learning, generative transfer learning, and attention models. The best performance is obtained using H-CLMP with transfer learning [H-CLMP(T)] wherein a generative adversarial network is trained on computational density of states data and deployed in the target domain to augment prediction of optical absorption from composition. H-CLMP(T) aggregates multiple knowledge sources with a framework that is well suited for multi-target regression across the physical sciences.

Additional Information

© 2021 Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Submitted: 9 February 2021. Accepted: 28 May 2021. Published Online: 23 June 2021. This paper is part of the special collection on Autonomous (AI-driven) Materials ScienceThis paper is part of the special collection on Autonomous (AI-driven) Materials Science. This work was funded by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Award DE-SC0020383 (data curation, design of multi-property prediction setting and transfer setting, model evaluation) and by the Toyota Research Institute through the Accelerated Materials Design and Discovery program (development of machine learning models). The authors thank Santosh K. Suram for assistance with curation of the dataset. DATA AVAILABILITY. The data that support the findings of this study are available at https://data.caltech.edu/records/1878. The source code and additional data for H-CLMP are available at https://www.cs.cornell.edu/gomes/udiscoverit/?tag=materials. The source code for H-CLMP and the cWGAN for transfer learning are also available at https://github.com/gomes-lab/H-CLMP.

Attached Files

Published - 5.0047066.pdf

Accepted Version - 2106.02225.pdf

Submitted - materials-representation-and-transfer-learning-for-multi-property-prediction.pdf

Files

2106.02225.pdf
Files (11.0 MB)
Name Size Download all
md5:105f8db3be8dea24da78f30d4e0f091b
4.3 MB Preview Download
md5:6e7f7d88c0afd655d7ea82c23ea52986
4.4 MB Preview Download
md5:4771f2d789c0f127e9791c22829b19bd
2.4 MB Preview Download

Additional details

Created:
August 20, 2023
Modified:
October 20, 2023