Abstract
We have developed an actor-critic-type policy-based reinforcement learning (RL) method to find low-energy nanoparticle structures and compared its effectiveness to classical basin-hopping. We took a molecule building approach where nanoalloy particles can be regarded as metallic molecules, albeit with much higher flexibility in structure. We explore the strengths of our approach by tasking an agent with the construction of stable mono- and bimetallic clusters. Following physics, an appropriate reward function and an equivariant molecular graph representation framework is used to learn the policy. The agent succeeds in finding well-known stable configuration for small clusters in both single and multicluster experiments. However, for certain use cases the agent lacks generalization to avoid overfitting. We relate this to the pitfalls of actor-critic methods for molecular design and discuss what learning properties an agent will require to achieve universality.
| Original language | English |
|---|---|
| Journal | Journal of Chemical Information and Modeling |
| Volume | 63 |
| Issue number | 12 |
| Pages (from-to) | 3731-3741 |
| Number of pages | 11 |
| ISSN | 1549-9596 |
| DOIs | |
| Publication status | Published - 2023 |
Bibliographical note
A.B. acknowledges financial support from VILLUM FONDEN by a research grant (00023105) for the DeepDFT project and European Union’s Horizon 2020 Research and Innovation Program under Grant Agreement No. 957189 (BIG-MAP).Fingerprint
Dive into the research topics of 'Equivariant Graph-Representation-Based Actor-Critic Reinforcement Learning for Nanoparticle Design'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver