research-article

Parametric directional coding for precomputed sound propagation

Authors:
Nikunj Raghuvanshi

Microsoft Research

Microsoft Research
View Profile

,
John Snyder

Microsoft Research

Microsoft Research
View Profile

Authors Info & Claims

ACM Transactions on Graphics Volume 37 Issue 4Article No.: 108pp 1–14https://doi.org/10.1145/3197517.3201339

Published:30 July 2018Publication History

ACM Transactions on Graphics

Abstract

Convincing audio for games and virtual reality requires modeling directional propagation effects. The initial sound's arrival direction is particularly salient and derives from multiply-diffracted paths in complex scenes. When source and listener straddle occluders, the initial sound and multiply-scattered reverberation stream through gaps and portals, helping the listener navigate. Geometry near the source and/or listener reveals its presence through anisotropic reflections. We propose the first precomputed wave technique to capture such directional effects in general scenes comprising millions of polygons. These effects are formally represented with the 9D directional response function of 3D source and listener location, time, and direction at the listener, making memory use the major concern. We propose a novel parametric encoder that compresses this function within a budget of ~100MB for large scenes, while capturing many salient acoustic effects indoors and outdoors. The encoder is complemented with a lightweight signal processing algorithm whose filtering cost is largely insensitive to the number of sound sources, resulting in an immediately practical system.

Supplemental Material

108-334.mp4

mp4

361.4 MB

Download

a108-raghuvanshi.mp4

mp4

259.4 MB

Download

References

Lakulish Antani, Anish Chandak, Micah Taylor, and Dinesh Manocha. 2012. Direct-to-indirect Acoustic Radiance Transfer. IEEE Transactions on Visualization and Computer Graphics 18, 2 (Feb. 2012), 261--269. Google ScholarDigital Library
P. Bilinski, Ahrens J., Thomas M. R. P., Tashev I. J., and Platt J. C. 2014. HRTF magnitude synthesis via sparse representation of anthropometric features. In 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence. 4468--4472. http://ieeexplore.ieee.org/stamp/stamp-jsp?tp=&arnumber=6854447&isnumber=6853544Google Scholar
J. Blauert. 1997. An introduction to binaural technology. In Binaural and Spatial Hearing in Real and Virtual Environments, R. Gilkey and T. R. Anderson (Eds.). Lawrence Erlbaum, USA.Google Scholar
Jeroen Breebaart, Sascha Disch, Christof Faller, Jürgen Herre, Gerard Hotho, Kristofer Kjörling, Francois Myburg, Matthias Neusinger, Werner Oomen, Heiko Purnhagen, and Jonas Rödén. 2005. MPEG Spatial Audio Coding / MPEG Surround: Overview and Current Status. In Audio Engineering Society Convention 119. http://www.aes.org/e-lib/browse.cfm?elib=13333Google Scholar
Chunxiao Cao, Zhong Ren, Carl Schissler, Dinesh Manocha, and Kun Zhou. 2016. Interactive Sound Propagation with Bidirectional Path Tracing, to appear. ACM Transactions on Graphics (SIGGRAPH Asia 2016) (2016). Google ScholarDigital Library
Jeffrey N. Chadwick, Steven S. An, and Doug L. James. 2009. Harmonic shells: a practical nonlinear sound model for near-rigid thin shells. In SIGGRAPH Asia '09: ACM SIGGRAPH Asia 2009 papers. ACM, New York, NY, USA, 1--10. Google ScholarDigital Library
Anish Chandak, Christian Lauterbach, Micah Taylor, Zhimin Ren, and Dinesh Manocha. 2008. AD-Frustum: Adaptive Frustum Tracing for Interactive Sound Propagation. IEEE Transactions on Visualization and Computer Graphics 14, 6 (2008), 1707--1722. Google ScholarDigital Library
Jean-Jacques Embrechts. 2016. Review on the applications of directional impulse responses in room acoustics. In Proceedings of CFA 2016. Société française d'acoustique (SFA). http://orbi.ulg.ac.be/handle/2268/193820Google Scholar
Kenji Fujii, Takuya Hotehama, Kosuke Kato, Ryota Shimokura, Yosuke Okamoto, Yukio Suzumura, and Yoichi Ando. 2004. Spatial Distribution of Acoustical Parameters in Concert Halls: Comparison of Different Scattered Reflections. 4 (01 2004).Google Scholar
Anders Gade. 2007. Acoustics in Halls for Speech and Music. In Springer Handbook of Acoustics (2007 ed.), Thomas Rossing (Ed.). Springer, Chapter 9. http://www.worldcat.org/isbn/0387304460Google ScholarCross Ref
Michael A. Gerzon. 1973. Periphony: With-Height Sound Reproduction. J. Audio Eng. Soc 21, 1 (1973), 2--10. http://www.aes.org/e-lib/browse.cfm?elib=2012Google Scholar
Nail A. Gumerov and Ramani Duraiswami. 2005. Fast Multipole Methods for the Helmholtz Equation in Three Dimensions (Elsevier Series in Electromagnetism) (1 ed.). Elsevier Science, http://www.worldcat.org/isbn/0080443710Google Scholar
Brian Hamilton, Stefan Bilbao, Brian Hamilton, and Stefan Bilbao. 2017. FDTD Methods for 3-D Room Acoustics Simulation With High-Order Accuracy in Space and Time. IEEE/ACM Trans. Audio, Speech and Lang. Proc. 25, 11 (Nov. 2017), 2112--2124. Google ScholarDigital Library
Jürgen Herre, Johannes Hilpert, Achim Kuntz, and Jan Plogsties. 2015. MPEG-H Audio - The New Standard for Universal Spatial/3D Audio Coding. J. Audio Eng. Soc 62, 12 (2015), 821--830. http://www.aes.org/e-lib/browse.cfm?elib=17556Google ScholarCross Ref
Doug L. James, Jernej Barbie, and Dinesh K. Pai. 2006. Precomputed acoustic transfer: output-sensitive, accurate sound generation for geometrically complex vibration sources. ACM Transactions on Graphics 25, 3 (July 2006), 987--995. Google ScholarDigital Library
Heinrich Kuttruff. 2000. Room Acoustics (4 ed.). Taylor & Francis. http://www.worldcat.org/isbn/0419245804Google Scholar
Mikko V. Laitinen, Tapani Pihlajamäki, Cumhur Erkut, and Ville Pulkki. 2012. Parametric Time-frequency Representation of Spatial Sound in Virtual Worlds. ACM Trans. Appl. Percept. 9, 2 (June 2012). Google ScholarDigital Library
Dingzeyu Li, Yun Fei, and Changxi Zheng. 2015. Interactive Acoustic Transfer Approximation for Modal Sound. ACM Trans. Graph. 35, 1 (Dec. 2015). Google ScholarDigital Library
Ruth Y. Litovsky, Steven H. Colburn, William A. Yost, and Sandra J. Guzman. 1999. The precedence effect. The Journal of the Acoustical Society of America 106, 4 (1999), 1633--1654.Google ScholarCross Ref
Ravish Mehra, Nikunj Raghuvanshi, Lakulish Antani, Anish Chandak, Sean Curtis, and Dinesh Manocha. 2013. Wave-based Sound Propagation in Large Open Scenes Using an Equivalent Source Formulation. ACM Trans. Graph. 32, 2 (April 2013). Google ScholarDigital Library
Ravish Mehra, Nikunj Raghuvanshi, Lauri Savioja, Ming C. Lin, and Dinesh Manocha. 2012. An efficient GPU-based time domain solver for the acoustic wave equation. Applied Acoustics 73, 2 (Feb. 2012), 83--94.Google ScholarCross Ref
Ravish Mehra, Atul Rungta, Abhinav Golas, Ming Lin, and Dinesh Manocha. 2015. WAVE: Interactive Wave-based Sound Propagation for Virtual Environments. IEEE transactions on visualization and computer graphics 21, 4 (April 2015), 434--442. http://view.ncbi.nlm.nih.gov/pubmed/26357093Google Scholar
Juha Merimaa and Ville Pulkki. 2005. Spatial Impulse Response Rendering I: Analysis and Synthesis. J. Audio Eng. Soc 53, 12 (2005), 1115--1127. http://www.aes.org/e-lib/browse.cfm?elib=13401Google Scholar
Brian Moore and Brian Glasberg. 1996. A Revision of Zwicker's Loudness Model. 82 (03 1996), 335--345.Google Scholar
D. Murphy, A. Kelloniemi, J. Mullen, and S. Shelley. 2007. Acoustic Modeling Using the Digital Waveguide Mesh. IEEE Signal Processing Magazine 24, 2 (March 2007), 55--66.Google ScholarCross Ref
Juhani Paasonen, Aleksandr Karapetyan, Jan Plogsties, and Ville Pulkki. 2017. Proximity of Surfaces - Acoustic and Perceptual Effects. J. Audio Eng. Soc 65, 12 (2017), 997--1004. http://www.aes.org/e-lib/browse.cfm?elib=19365Google ScholarCross Ref
Allan D. Pierce. 1989. Acoustics: An Introduction to Its Physical Principles and Applications. Acoustical Society of America, http://www.worldcat.org/isbn/0883186128Google Scholar
Boaz Rafaely 2015. Fundamentals of Spherical Array Processing (Springer Topics in Signal Processing) (2015 ed.). Springer, http://www.worldcat.org/isbn/9783662456644Google ScholarCross Ref
Nikunj Raghuvanshi, Rahul Narain, and Ming C. Lin. 2009a. Efficient and Accurate Sound Propagation Using Adaptive Rectangular Decomposition. IEEE Transactions on Visualization and Computer Graphics 15, 5 (2009), 789--801. Google ScholarDigital Library
Nikunj Raghuvanshi, Rahul Narain, and Ming C. Lin. 2009b. Efficient and Accurate Sound Propagation Using Adaptive Rectangular Decomposition. IEEE Transactions on Visualization and Computer Graphics 15, 5 (2009), 789--801. Google ScholarDigital Library
Nikunj Raghuvanshi and John Snyder. 2014. Parametric Wave Field Coding for Precomputed Sound Propagation. ACM Trans. Graph. 33, 4 (July 2014). Google ScholarDigital Library
Nikunj Raghuvanshi, John Snyder, Ravish Mehra, Ming C. Lin, and Naga K. Govindaraju. 2010. Precomputed Wave Simulation for Real-Time Sound Propagation of Dynamic Sources in Complex Scenes. ACM Transactions on Graphics 29, 3 (July 2010). Google ScholarDigital Library
Jens H. Rindel and Claus L. Christensen. 2013. The use of colors, animations and auralizations in room acoustics. In Internoise 2013.Google Scholar
Lauri Savioja and U. Peter Svensson. 2015. Overview of geometrical room acoustic modeling techniques. The Journal of the Acoustical Society of America 138, 2 (01 Aug. 2015), 708--730.Google ScholarCross Ref
Carl Schissler, Ravish Mehra, and Dinesh Manocha. 2014. High-order Diffraction and Diffuse Reflections for Interactive Sound Propagation in Large Environments. ACM Trans. Graph. 33, 4 (July 2014). Google ScholarDigital Library
Dirk Schröder. 2011. Physically Based Real-Time Auralization of Interactive Virtual Environments. Logos Verlag, http://www.worldcat.org/isbn/3832530312Google Scholar
Jonathan Sheaffer, Maarten Van Walstijn, Boaz Rafaely, and Konrad Kowalczyk. 2015. Binaural Reproduction of Finite Difference Simulations Using Spherical Array Processing. IEEE/ACM Trans. Audio, Speech and Lang. Proc. 23, 12 (Dec. 2015), 2125--2135. Google ScholarDigital Library
S. Siltanen, T. Lokki, and L. Savioja. 2010a. Rays or Waves? Understanding the Strengths and Weaknesses of Computational Room Acoustics Modeling Techniques. In Proc. Int. Symposium on Room Acoustics. Melbourne, Australia.Google Scholar
Samuel Siltanen, Tapio Lokki, and Lauri Savioja. 2010b. Room acoustics modeling with acoustic radiance transfer. Proc. ISRA Melbourne (2010).Google Scholar
Julius O. III Smith. 2007. Introduction to Digital Filters with Audio Applications. (2007). https://ccrma.stanford.edu/~jos/filters/Google Scholar
Alex Southern, Damian T. Murphy, and Lauri Savioja. 2012. Spatial Encoding of Finite Difference Time Domain Acoustic Models for Auralization. Trans. Audio, Speech and Lang. Proc. 20, 9 (Nov. 2012), 2420--2432. Google ScholarDigital Library
Micah T. Taylor, Anish Chandak, Lakulish Antani, and Dinesh Manocha. 2009. RESound: interactive sound rendering for dynamic virtual environments. In Proceedings of ACM conference on Multimedia. ACM, New York, NY, USA, 271--280. Google ScholarDigital Library
Sakari Tervo, Jukka Pätynen, Antti Kuusinen, and Tapio Lokki. 2013. Spatial Decomposition Method for Room Impulse Responses. J. Audio Eng. Soc 61, 1/2 (2013), 17--28.Google Scholar
Nicolas Tsingos. 2009. Pre-computing geometry-based reverberation effects for games. In 35th AES Conference on Audio for Games.Google Scholar
Nicolas Tsingos, Carsten Dachsbacher, Sylvain Lefebvre, and Matteo Dellepiane. 2007. Instant Sound Scattering. In Rendering Techniques (Proceedings of the Eurographics Symposium on Rendering). http://www-sop.inria.fr/reves/Basilic/2007/TDLD07 Google ScholarDigital Library
Michael Vorländer. 2007. Auralization: Fundamentals of Acoustics, Modelling, Simulation, Algorithms and Acoustic Virtual Reality (RWTHedition) (1 ed.). Springer, http://www.worldcat.org/isbn/3540488294 Google ScholarDigital Library
Hengchin Yeh, Ravish Mehra, Zhimin Ren, Lakulish Antani, Dinesh Manocha, and Ming Lin. 2013. Wave-ray Coupling for Interactive Sound Propagation in Large Complex Scenes. ACM Trans. Graph. 32, 6 (Nov. 2013). Google ScholarDigital Library
Wen Zhang, Thushara D. Abhayapala, Rodney A. Kennedy, and Ramani Duraiswami. 2010. Insights into head-related transfer function: Spatial dimensionality and continuous representation. The Journal of the Acoustical Society of America 127, 4 (01 April 2010), 2347--2357.Google ScholarCross Ref

Index Terms

Parametric directional coding for precomputed sound propagation
1. Applied computing
  1. Arts and humanities
    1. Sound and music computing
2. Computing methodologies
  1. Computer graphics
    1. Graphics systems and interfaces
      1. Virtual reality

Recommendations

Directional sources and listeners in interactive sound propagation using reciprocal wave field coding

Common acoustic sources, like voices or musical instruments, exhibit strong frequency and directional dependence. When transported through complex environments, their anisotropic radiated field undergoes scattering, diffraction, and occlusion before ...
Read More
Parametric wave field coding for precomputed sound propagation

The acoustic wave field in a complex scene is a chaotic 7D function of time and the positions of source and listener, making it difficult to compress and interpolate. This hampers precomputed approaches which tabulate impulse responses (IRs) to allow ...
Read More
Interactive Sound Propagation and Rendering for Large Multi-Source Scenes

We present an approach to generate plausible acoustic effects at interactive rates in large dynamic environments containing many sound sources. Our formulation combines listener-based backward ray tracing with sound source clustering and hybrid audio ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Graphics Volume 37, Issue 4
August 2018
1670 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/3197517
Issue’s Table of Contents

Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 30 July 2018
Published in tog Volume 37, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
HRTF
directional impulse response
flux density
plane wave decomposition
sound propagation
spatial audio
vector intensity
virtual acoustics
wave equation
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 36
  Total Citations
  View Citations
- 519
  Total Downloads
- Downloads (Last 12 months)64
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Parametric directional coding for precomputed sound propagation

ACM Transactions on Graphics

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

Directional sources and listeners in interactive sound propagation using reciprocal wave field coding

Parametric wave field coding for precomputed sound propagation

Interactive Sound Propagation and Rendering for Large Multi-Source Scenes

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Parametric directional coding for precomputed sound propagation

ACM Transactions on Graphics

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

Directional sources and listeners in interactive sound propagation using reciprocal wave field coding

Parametric wave field coding for precomputed sound propagation

Interactive Sound Propagation and Rendering for Large Multi-Source Scenes

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media