Machine Learning Enabled Prediction of Solvent-Based Reaction Energetics

Mao, Yiren

doi:doi:10.7939/r3-7re0-3j25

This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

View

Download

Communities and Collections

Graduate and Postdoctoral Studies (GPS), Faculty of / Theses and Dissertations

Usage

180 views
273 downloads

Machine Learning Enabled Prediction of Solvent-Based Reaction Energetics

Author / Creator

Mao, Yiren
Despite recent advancements in molecular dynamics (MD) methods, the computational costs of \emph{ab initio} molecular dynamics simulations for explicit solvation systems are still too significant. If accuracy is to be left uncompromised, new methods must be employed to reduce computational expenses. This work focuses on the development of machine learning (ML) models as proxy models for Car-Parrinello molecular dynamics (CPMD) metadynamics simulations in condensed-phase biomass reactions.

Explicit solvation CPMD metadynamics simulation data of HMF undergoing protonation in a solution of dimethyl sulphoxide (DMSO) and water is used to train various model architectures to make time-series predictions of their probability distribution functions (PDFs). For each model architecture, three models were trained to fully predict the system, one for each of the following species: reactants, water and DMSO. Each model was tested assuming an initial simulation had been performed and the proxy models were used to complete the simulation.

The long short-term memory (LSTM) autoencoder and 3D convolutional neural networks (CNN)-LSTM autoencoder architectures failed to accurately capture PDF magnitudes and locations. A binary relevance 3D CNN-LSTM autoencoder, employing different loss functions, showed marginal improvement but struggled to predict probability locations over a large horizon. Models trained on principal component analysis (PCA)-transformed and dynamic PCA (DPCA)-transformed data showed promise in training but failed in testing. Models trained on PDFs without "dead voxels" (zero probability voxels independent of time) and atomic Cartesian coordinates perform well during training but encounter challenges in testing due to teacher forcing. Teacher forcing is a training method that can potentially make the trained model over-reliant on ground truth, which is unavailable if the model is to be used as a proxy. Despite attempts to mitigate teacher forcing effects through scheduled sampling, no model architecture achieves reliable long-term predictions without ground truth data. However, the model trained on Cartesian coordinates demonstrated proficiency in making short-term predictions regarding the atomic configuration of the system.
Subjects / Keywords
Graduation date

Spring 2024
Type of Item

Thesis
Degree

Master of Science
DOI

https://doi.org/10.7939/r3-7re0-3j25
License

This thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.

Language

English
Institution

University of Alberta
Degree level

Master's
Department
- Department of Chemical and Materials Engineering
Specialization
- Process Control
Supervisor / co-supervisor and their department(s)
- Prasad, Vinay (Chemical and Materials Engineering)
- Mushrif, Samir H. (Chemical and Materials Engineering)