Skip to main navigation Skip to search Skip to main content

Equivariant Graph-Representation-Based Actor-Critic Reinforcement Learning for Nanoparticle Design

Research output: Contribution to journalJournal articleResearchpeer-review

Abstract

We have developed an actor-critic-type policy-based reinforcement learning (RL) method to find low-energy nanoparticle structures and compared its effectiveness to classical basin-hopping. We took a molecule building approach where nanoalloy particles can be regarded as metallic molecules, albeit with much higher flexibility in structure. We explore the strengths of our approach by tasking an agent with the construction of stable mono- and bimetallic clusters. Following physics, an appropriate reward function and an equivariant molecular graph representation framework is used to learn the policy. The agent succeeds in finding well-known stable configuration for small clusters in both single and multicluster experiments. However, for certain use cases the agent lacks generalization to avoid overfitting. We relate this to the pitfalls of actor-critic methods for molecular design and discuss what learning properties an agent will require to achieve universality.
Original languageEnglish
JournalJournal of Chemical Information and Modeling
Volume63
Issue number12
Pages (from-to)3731-3741
Number of pages11
ISSN1549-9596
DOIs
Publication statusPublished - 2023

Bibliographical note

A.B. acknowledges financial support from VILLUM FONDEN by a research grant (00023105) for the DeepDFT project and European Union’s Horizon 2020 Research and Innovation Program under Grant Agreement No. 957189 (BIG-MAP).

Fingerprint

Dive into the research topics of 'Equivariant Graph-Representation-Based Actor-Critic Reinforcement Learning for Nanoparticle Design'. Together they form a unique fingerprint.

Cite this