학술논문

Sound propagation in realistic interactive 3D scenes with parameterized sources using deep neural operators.
Document Type
Academic Journal
Author
Borrel-Jensen N; Department of Electrical and Photonics Engineering, Acoustic Technology, Technical University of Denmark, Kongens Lyngby 2800, Denmark.; Goswami S; Division of Applied Mathematics, Brown University, Providence, RI 02906.; Engsig-Karup AP; Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kongens Lyngby 2800, Denmark.; Karniadakis GE; Division of Applied Mathematics, Brown University, Providence, RI 02906.; School of Engineering, Brown University, Providence, RI 02906.; Jeong CH; Department of Electrical and Photonics Engineering, Acoustic Technology, Technical University of Denmark, Kongens Lyngby 2800, Denmark.
Source
Publisher: National Academy of Sciences Country of Publication: United States NLM ID: 7505876 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1091-6490 (Electronic) Linking ISSN: 00278424 NLM ISO Abbreviation: Proc Natl Acad Sci U S A Subsets: PubMed not MEDLINE; MEDLINE
Subject
Language
English
Abstract
We address the challenge of acoustic simulations in three-dimensional (3D) virtual rooms with parametric source positions, which have applications in virtual/augmented reality, game audio, and spatial computing. The wave equation can fully describe wave phenomena such as diffraction and interference. However, conventional numerical discretization methods are computationally expensive when simulating hundreds of source and receiver positions, making simulations with parametric source positions impractical. To overcome this limitation, we propose using deep operator networks to approximate linear wave-equation operators. This enables the rapid prediction of sound propagation in realistic 3D acoustic scenes with parametric source positions, achieving millisecond-scale computations. By learning a compact surrogate model, we avoid the offline calculation and storage of impulse responses for all relevant source/listener pairs. Our experiments, including various complex scene geometries, show good agreement with reference solutions, with root mean squared errors ranging from 0.02 to 0.10 Pa. Notably, our method signifies a paradigm shift as-to our knowledge-no prior machine learning approach has achieved precise predictions of complete wave fields within realistic domains.
Competing Interests: Competing interests statement:G.E.K. provides technical advice on direction in machine learning to Anailytica, a private startup company developing AI software products for engineering. He has a very small equity for his work. The rest of the authors declare no competing interest.