Sound propagation in realistic interactive 3D scenes with parameterized sources using deep neural operators

Nikolas Borrel-Jensen, Somdatta Goswami, Allan Peter Engsig-Karup, George Em Karniadakis, Cheol-Ho Jeong

Research output: Contribution to journalJournal articleResearchpeer-review

70 Downloads (Pure)

Abstract

We address the challenge of acoustic simulations in three-dimensional (3D) virtual rooms with parametric source positions, which have applications in virtual/augmented reality, game audio, and spatial computing. The wave equation can fully describe wave phenomena such as diffraction and interference. However, conventional numerical discretization methods are computationally expensive when simulating hundreds of source and receiver positions, making simulations with parametric source positions impractical. To overcome this limitation, we propose using deep operator networks to approximate linear wave-equation operators. This enables the rapid prediction of sound propagation in realistic 3D acoustic scenes with parametric source positions, achieving millisecond-scale computations. By learning a compact surrogate model, we avoid the offline calculation and storage of impulse responses for all relevant source/listener pairs. Our experiments, including various complex scene geometries, show good agreement with reference solutions, with root mean squared errors ranging from 0.02 to 0.10 Pa. Notably, our method signifies a paradigm shift as-to our knowledge-no prior machine learning approach has achieved precise predictions of complete wave fields within realistic domains.
Original languageEnglish
Article numbere2312159120
JournalProceedings of the National Academy of Sciences of the United States of America
Volume121
Issue number2
Number of pages9
ISSN0027-8424
DOIs
Publication statusPublished - 2024

Keywords

  • DeepONet
  • Domain decomposition
  • Operator learning
  • Transfer learning
  • Virtual acoustics

Fingerprint

Dive into the research topics of 'Sound propagation in realistic interactive 3D scenes with parameterized sources using deep neural operators'. Together they form a unique fingerprint.

Cite this