ABSTRACT
Timeliner is a browser for long audio recordings and features that it derives from such recordings. Features can be either signal-based, like spectrograms, or model-based, like categorical classifiers. Unlike conventional audio editors, Timeliner pans and zooms smoothly across many orders of magnitude, from days-long overviews to millisecond-scale details, with zero latency, zero flicker, and low CPU load. Also, to suggest which details are worth zooming in to examine, Timeliner's agglomerative hierarchical caches propagate feature-specific details up to wider zoom levels. Because these details are not averaged away, "big data" can be browsed rapidly and effectively. Several studies demonstrate this.
- B. Arons. SpeechSkimmer: a system for interactively skimming recorded speech. ACM Transactions on Computer-Human Interaction, 4(1):3--38, 1997. Google ScholarDigital Library
- P. Bénard, A. Bousseau, and J. Thollot. Dynamic solid textures for real-time coherent stylization. In Symposium on Interactive 3D Graphics and Games (I3D), pages 121--127. ACM, 2009. Google ScholarDigital Library
- J. Blow. Mipmapping, part 1. Game Developer Magazine, 8(12):13--17, Dec. 2001.Google Scholar
- J. Blow. Mipmapping, part 2. Game Developer Magazine, 9(1):16--19, Jan. 2002.Google Scholar
- D. Cohen, C. Goudeseune, and M. Hasegawa-Johnson. Efficient simultaneous multi-scale computation of FFTs. Technical Report FODAVA-09-01, NSF/DHS FODAVA-Lead: Foundations of Data and Visual Analytics, 2009.Google Scholar
- D. Ellis. The SPRACH project. www.icsi.berkeley.edu/~riptstyle~dpwe/projects/sp%rach, 1999.Google Scholar
- D. Ellis, C. Oei, C. Wooters, and P. Faerber. Quicknet. www.icsi.berkeley.edu/Speech/qn.html, 2012.Google Scholar
- C. Goudeseune. Timeliner. http://mickey.ifp.illinois.edu/speechWiki/index.php/ Software, 2012.Google Scholar
- C. Han, E. Risser, R. Ramamoorthi, and E. Grinspun. Multiscale texture synthesis. ACM Trans. Graphics, 27(3), Aug. 2008. Google ScholarDigital Library
- M. Hasegawa-Johnson, C. Goudeseune, J. Cole, H. Kaczmarski, H. Kim, S. King, T. Mahrt, J.-T. Huang, X. Zhuang, K.-H. Lin, H. V. Sharma, Z. Li, and T. S. Huang. Multimodal speech and audio user interfaces for K-12 outreach. In Proc. Asia-Pacific Signal and Information Processing Assn., 2011.Google Scholar
- P. S. Heckbert. Fundamentals of texture mapping and image warping. Master's thesis, University of California, Berkeley, June 1989.Google ScholarDigital Library
- C.-F. Hollemeersch, B. Pieters, P. Lambert, and R. Van de Walle. A new approach to combine texture compression and filtering. The Visual Computer, 28(4):371--385, 2012. Google ScholarDigital Library
- M. Huijbregts. Segmentation, Diarization and Speech Transcription: Surprise Data Unraveled. PhD thesis, University of Twente, 2008.Google Scholar
- K.-H. Lin, X. Zhuang, C. Goudeseune, S. King, M. Hasegawa-Johnson, and T. S. Huang. Improving faster-than-real-time human acoustic event detection by saliency-maximized audio visualization. In Proc. ICASSP, pages 1--4, 2012.Google ScholarCross Ref
- S. Meignier and T. Merlin. LIUM SpkDiarization: an open source toolkit for diarization. In Carnegie-Mellon University Sphinx Workshop for Users and Developers (Carnegie Mellon University-SPUD), Mar. 2010.Google Scholar
- P. Mermelstein. Distance measures for speech recognition: Psychological and instrumental. In C. H. Chen, editor, Pattern Recognition and Artificial Intelligence, pages 374--388. Academic Press, 1976.Google Scholar
- Microsoft Corp. Silverlight. www.silverlight.net, 2012.Google Scholar
- B. Reitinger, M. Hoefler, A. Lengauer, R. Tomasi, M. Lamperter, and M. Gruber. Dragonfly: interactive visualization of huge aerial image datasets. In Proc. 21st ISPRS Congress, volume 37, pages 491--494, 2008.Google Scholar
- R. N. Shepard. Circularity in judgements of relative pitch. J. Acoust. Soc. Am., 36(12):2346--2353, 1964.Google ScholarCross Ref
- L. Williams. Pyramidal parametrics. SIGGRAPH Computer Graphics, 17(3):1--11, July 1983. Google ScholarDigital Library
- R. Williams, L. Yan, X. Zhou, L. Lu, A. Centeno, L. Kuan, M. Hawrylycz, and G. Rosen. Global exploratory analysis of massive neuroimaging collections using Microsoft Live Labs Pivot and Silverlight. In Neuroinformatics: INCF Japan Node Session Abstracts, 2010.Google Scholar
- C. Xu and S. A. Boppart. Comparative performance analysis of time-frequency distributions for spectroscopic optical coherence tomography. In Biomedical Topical Meeting, page FH9. Optical Society of America, 2004.Google Scholar
- S. Young, G. Evermann, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland. The HTK Book. Cambridge University Engineering Dept., Cambridge, UK, 2002.Google Scholar
- X. Zhuang, X. Zhou, M. A. Hasegawa-Johnson, and T. S. Huang. Real-world acoustic event detection. Pattern Recognition Letters, 31(2):1543--1551, Sept. 2010. Google ScholarDigital Library
- X. Zhuang, X. Zhou, T. S. Huang, and M. Hasegawa-Johnson. Feature analysis and selection for acoustic event detection. In Proc. ICASSP, pages 17--20, 2008.Google Scholar
Index Terms
- Effective browsing of long audio recordings
Recommendations
Interfaces for interactive audio-visual media browsing
MM '06: Proceedings of the 14th ACM international conference on MultimediaIn this demo, we present new interfaces for interactive navigation in continuous, time-based multimedia files, such as video recordings. In contrast to common techniques for multimedia skimming, our approaches enable users to interactively navigate ...
Effective Big Data Visualization
IDEAS '17: Proceedings of the 21st International Database Engineering & Applications SymposiumIn the last several years, big data analytics has found an increasing role in our everyday lives. Data visualization has long been accepted as an integral part of data analytics. However, data visualization systems are not equipped to handle the ...
Optimizing star-coordinate visualization models for effective interactive cluster exploration on big data
Interactive visual cluster analysis is the most intuitive way for finding clustering patterns, validating algorithmic clustering results, understanding data clusters with domain knowledge, and refining cluster definitions. The most challenging step is ...
Comments