research-article

SPREAD: sound propagation and perception for autonomous agents in dynamic environments

Authors:
Pengfei Huang

University of Pennsylvania

University of Pennsylvania
View Profile

,
Mubbasir Kapadia

University of Pennsylvania

University of Pennsylvania
View Profile

,
Norman I. Badler

University of Pennsylvania

University of Pennsylvania
View Profile

SCA '13: Proceedings of the 12th ACM SIGGRAPH/Eurographics Symposium on Computer AnimationJuly 2013Pages 135–144https://doi.org/10.1145/2485895.2485911

Published:19 July 2013Publication History

SCA '13: Proceedings of the 12th ACM SIGGRAPH/Eurographics Symposium on Computer Animation

Pages 135–144

ABSTRACT

The perception of sensory information and its impact on behavior is a fundamental component of being human. While visual perception is considered for navigation, collision, and behavior selection, the acoustic domain is relatively unexplored. Recent work in acoustics focuses on synthesizing sound in 3D environments; however, the perception of acoustic signals by a virtual agent is a useful and realistic adjunct to any behavior selection mechanism. In this paper, we present SPREAD, a novel agent-based sound perception model using a discretized sound packet representation with acoustic features including amplitude, frequency range, and duration. SPREAD simulates how sound packets are propagated, attenuated, and degraded as they traverse the virtual environment. Agents perceive and classify the sounds based on the locally-received packet set using a hierarchical clustering scheme, and have individualized hearing and understanding of their surroundings. Using this model, we demonstrate several simulations that greatly enrich controls and outcomes.

Supplemental Material

Available for Download

zip

p135-huang.zip (51.6 MB)

Supplemental material.

References

Bee, M., and Micheyl, C. 2008. The cocktail party problem: What is it? how can it be solved? and why should animal behaviorists study it? J. of Comparative Psychology 122, 3, 235.Google ScholarCross Ref
Bonebright, T. 2001. Perceptual structure of everyday sounds: A multidimensional scaling approach. In Proc. of the 7th international conference on auditory display, 73--78.Google Scholar
Chandak, A., Lauterbach, C., Taylor, M., Ren, Z., and Manocha, D. 2008. Ad-frustum: Adaptive frustum tracing for interactive sound propagation. IEEE TVCG 14, 6. Google ScholarDigital Library
Cony, C., de Lima Bicho, A., Jung, C., Magalhaes, L., and Musse, S. 2007. A perceptive model for virtual agents in crowds. In CGI, vol. 1, 141--150.Google Scholar
Cowling, M., and Sitte, R. 2003. Comparison of techniques for environmental sound recognition. Pattern Recognition Letters 24, 15, 2895--2907. Google ScholarDigital Library
Dekel, O., Keshet, J., and Singer, Y. 2005. An online algorithm for hierarchical phoneme classification. In MLMI, 146--158. Google ScholarDigital Library
Funkhouser, T., Carlbom, I., Elko, G., Pingali, G., Sondhi, M., and West, J. 1998. A beam tracing approach to acoustic modeling for interactive virtual environments. In SIGGRAPH, ACM, 21--32. Google ScholarDigital Library
Gygi, B., Kidd, G., and Watson, C. 2007. Similarity and categorization of environmental sounds. Attention, Perception, & Psychophysics 69, 6, 839--855.Google ScholarCross Ref
Herrero, P., and de Antonio, A. 2003. Introducing human-like hearing perception in intelligent virtual agents. In AAMAS, ACM, 733--740. Google ScholarDigital Library
Holland, J., Dabelsteen, T., Pedersen, S., and Larsen, O. 1998. Degradation of wren troglodytes troglodytes song: implications for information transfer and ranging. J. of the Acoustical Society of America 103, 2154.Google ScholarCross Ref
Hory, C., Martin, N., and Chehikian, A. 2002. Spectrogram segmentation by means of statistical features for non-stationary signal interpretation. Signal Processing, IEEE Transactions on 50, 12, 2915--2925. Google ScholarDigital Library
James, D., Barbič, J., and Pai, D. 2006. Precomputed acoustic transfer: output-sensitive, accurate sound generation for geometrically complex vibration sources. In ACM TOG. Google ScholarDigital Library
Kagawa, Y., Tsuchiya, T., Fujii, B., and Fujioka, K. 1998. Discrete huygens'model approach to sound wave propagation. J. of Sound and Vibration 218, 3, 419--444.Google ScholarCross Ref
Kapadia, M., Singh, S., Reinman, G., and Faloutsos, P. 2011. A Behavior-Authoring Framework for Multiactor Simulations. Computer Graphics & Applications, IEEE 31, 6, 45--55. Google ScholarDigital Library
Kristiansen, U., and Viggen. 2010. Computational methods in acoustics. Compendium, NTNU.Google Scholar
Li, S., and Loew, M. 1987. Adjacency detection using quad-codes. Communications of the ACM 30, 7, 627--631. Google ScholarDigital Library
Mast, T. 2000. Empirical relationships between acoustic parameters in human soft tissues. Acoustics Research Letters Online 1, 2, 37--42.Google ScholarCross Ref
Monzani, J., and Thalmann, D. 2000. A sound propagation model for interagents communication. In Virtual Worlds, Springer, 135--146. Google ScholarDigital Library
Ondřej, J., Pettré, J., Olivier, A., and Donikian, S. 2010. A synthetic-vision based steering approach for crowd simulation. ACM TOG 29, 4, 123. Google ScholarDigital Library
O'Sullivan, C., and Ennis, C. 2011. Metropolis: multisensory simulation of a populated city. In Proc. Intl. Conf. on Games and Virtual Worlds for Serious Applications, IEEE Computer Society, 1--7. Google ScholarDigital Library
Pelechano, N., Allbeck, J., and Badler, N. 2008. Virtual crowds: Methods, simulation, and control. Synthesis Lectures on Computer Graphics and Animation 3, 1, 1--176. Google ScholarDigital Library
Raghuvanshi, N., Narain, R., and Lin, M. 2009. Efficient and accurate sound propagation using adaptive rectangular decomposition. IEEE TVCG 15, 5, 789--801. Google ScholarDigital Library
Raghuvanshi, N., Snyder, J., Mehra, R., Lin, M., and Govindaraju, N. 2010. Precomputed wave simulation for real-time sound propagation of dynamic sources in complex scenes. ACM Transactions on Graphics (TOG) 29, 4, 68. Google ScholarDigital Library
Savioja, L., Huopaniemi, J., Lokki, T., and Vaananen, R. 1999. Creating interactive virtual acoustic environments. J. of the Audio.Google Scholar
Shoulson, A., Marshak, N., Kapadia, M., and Badler, N. I. 2013. ADAPT: the agent development and prototyping testbed. In ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games, I3D, 9--18. Google ScholarDigital Library
Takala, T., and Hahn, J. 1992. Sound rendering. In ACM SIGGRAPH Computer Graphics, vol. 26, ACM, 211--220. Google ScholarDigital Library
Thalmann, D. 2007. Crowd simulation. Wiley Online Library. Google ScholarDigital Library
Turetsky, R., and Ellis, D. 2003. Ground-truth transcriptions of real music from force-aligned midi syntheses. ISMIR 2003, 135--141.Google Scholar
Unity3D. 2012. Unity3d game engine. http://unity3d.com.Google Scholar
Xu, C., Maddage, N. C., and Shao, X. 2005. Automatic music classification and summarization. Speech and Audio Processing, IEEE Transactions on 13, 3, 441--450.Google Scholar

Index Terms

SPREAD: sound propagation and perception for autonomous agents in dynamic environments
1. Computing methodologies
  1. Computer graphics
    1. Animation

Recommendations

Source and Listener Directivity for Interactive Wave-Based Sound Propagation

We present an approach to model dynamic, data-driven source and listener directivity for interactive wave-based sound propagation in virtual environments and computer games. Our directional source representation is expressed as a linear combination of ...
Read More
Sound propagation model for sound source localization in area of observation of an audio robot
NN'08: Proceedings of the 9th WSEAS International Conference on Neural Networks

An audio robot uses sound information to localize the subjects or persons in area of observation. The main problem in this case is to recover the sound direction and to localize the position of the talker. This problem is similar to the ability of human ...
Read More
Effects of Sound Type on Recreating the Trajectory of a Moving Source
CHI EA '15: Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems

The ABBI (Audio Bracelet for Blind Interaction) device is designed for visually impaired and blind children to wear on the wrist and produce sound based on the movement of the arm through space. The primary function is to inform a child (or adult) about ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SCA '13: Proceedings of the 12th ACM SIGGRAPH/Eurographics Symposium on Computer Animation
July 2013
225 pages
ISBN:9781450321327
DOI:10.1145/2485895
Conference Chairs:
Jinxiang Chai
Texas A&M University
,
Yizhou Yu
University of Hong Kong, China
,
Program Chairs:
Theodore Kim
University of California, Santa Barbara
,
Robert Sumner
Disney Research Zurich, Switzerland
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 July 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
autonomous agent
sound perception
sound propagation
sound representation
Qualifiers
- research-article
Conference

Acceptance Rates
SCA '13 Paper Acceptance Rate20of57submissions,35%Overall Acceptance Rate183of487submissions,38%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 251
  Total Downloads
- Downloads (Last 12 months)6
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

SPREAD: sound propagation and perception for autonomous agents in dynamic environments

SCA '13: Proceedings of the 12th ACM SIGGRAPH/Eurographics Symposium on Computer Animation

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

Source and Listener Directivity for Interactive Wave-Based Sound Propagation

Sound propagation model for sound source localization in area of observation of an audio robot

Effects of Sound Type on Recreating the Trajectory of a Moving Source

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

SPREAD: sound propagation and perception for autonomous agents in dynamic environments

SCA '13: Proceedings of the 12th ACM SIGGRAPH/Eurographics Symposium on Computer Animation

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

Source and Listener Directivity for Interactive Wave-Based Sound Propagation

Sound propagation model for sound source localization in area of observation of an audio robot

Effects of Sound Type on Recreating the Trajectory of a Moving Source

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media