TY - JOUR
T1 - Predicting Enthalpy of Formation of Energetic Compounds by Machine Learning
T2 - Comparison of Featurization Methods and Algorithms
AU - Tian, Xiaolan
AU - Qi, Xiujuan
AU - Wang, Yi
AU - Wu, Junnan
AU - Song, Siwei
AU - Zhang, Qinghua
N1 - Publisher Copyright:
© 2022 Wiley-VCH GmbH.
PY - 2023/3
Y1 - 2023/3
N2 - Machine learning (ML) is an emerging approach for predicting molecular properties. The prediction of the properties of energetic molecules by ML is still in its infancy. In order to improve the accuracy of ML-based predictions, it is important to pay attention to aspects such as data preparation, model selection, and hyperparameter tuning. In this work, we focused on the influence of different featurization methods and algorithms on predicting the enthalpy of formation (EOF) of energetic compounds. We manually extracted a dataset consisting of 649 EOF values of energetic materials from the literature and compared different combinations of featurization methods and algorithms. The experimental results confirmed that ML can effectively map the relationship between molecular structure and EOF. Custom descriptor sets were found to perform best in featurization with a mean absolute error of 90.10 kJ mol−1, after training by kernel ridge regression algorithm.
AB - Machine learning (ML) is an emerging approach for predicting molecular properties. The prediction of the properties of energetic molecules by ML is still in its infancy. In order to improve the accuracy of ML-based predictions, it is important to pay attention to aspects such as data preparation, model selection, and hyperparameter tuning. In this work, we focused on the influence of different featurization methods and algorithms on predicting the enthalpy of formation (EOF) of energetic compounds. We manually extracted a dataset consisting of 649 EOF values of energetic materials from the literature and compared different combinations of featurization methods and algorithms. The experimental results confirmed that ML can effectively map the relationship between molecular structure and EOF. Custom descriptor sets were found to perform best in featurization with a mean absolute error of 90.10 kJ mol−1, after training by kernel ridge regression algorithm.
KW - Energetic compounds
KW - Enthalpy of formation
KW - Featurization
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85144346479&partnerID=8YFLogxK
U2 - 10.1002/prep.202200236
DO - 10.1002/prep.202200236
M3 - 文章
AN - SCOPUS:85144346479
SN - 0721-3115
VL - 48
JO - Propellants, Explosives, Pyrotechnics
JF - Propellants, Explosives, Pyrotechnics
IS - 4
M1 - e202200236
ER -