ABSTRACT
Communicating technical material orally is often hindered by the relentless linearity of audio; information flows actively past a passive listener. This is in stark contrast to communication through the printed medium, where we can actively peruse the visual display to access relevant information.
ASTER is an interactive computing system for audio formatting electronic documents (presently, documents written in (LA)TEX) to produce audio documents. ASTER can speak both literary texts and highly technical documents that contain complex mathematics. In fact, the effective speaking and interactive browsing of mathematics is a key goal of ASTER. To this end, a listener can browse both complete documents and complex mathematical expressions. ASTER thus enables active listening.
This paper describes the browsing component of ASTER. The design and implementation of ASTER is beyond the scope of this paper. Here, we will focus on the browser, and refer to other parts of the system in passing for the sake of completeness.
- 1.ARONS, B. The design of audio servers and toolkits for supporting speech in the user interface. Journal of the American Voice I/0 Society (Mar. 1991), 27-41.Google Scholar
- 2.ARONS, B. Hyperspeech: Navigating in speechonly hypermedia. In Hypertext '91 A CM (1991), 133-146. Google ScholarDigital Library
- 3.ARONS, B. Techniques, perception, and applications of time-colnpressed speech. In Plvn'cedings of 199e American Vozce I/0 Society (Sept. 1992), 169-177.Google Scholar
- 4.KNUT., D.E. The TEX book. Addison-Wesley, Reading, Massachusetts, 1984. Google ScholarDigital Library
- 5.KNUTH, D. E. TEX The Program. Addison-Wesley, Reading, Mass., 1986. Google ScholarDigital Library
- 6.LAMPORT, L. LATEX: A Document Preparation System. Addison-Wesley, Reading, Mass., 1986. Google ScholarDigital Library
- 7.RAMAN, T., AND GRIES, D. Audio formatting -making spoken text and mathematics comprehensible. ~nd. International Conference on Auditory Displays (Nov 1994). Submitted.Google Scholar
- 8.RAMAN, T. V. Documents are not just for printing. Proceedings of the 1st Workshop on the Principles of Document Processing (Oct. 1992).Google Scholar
- 9.RAMAN, T. V. Audio System for Technical Readings. PhD thesis, Cornell University, May 1994. URL ftp://ftp.cs.cornell.edu/pub/raman/aster-thesis.ps. Google ScholarDigital Library
- 10.RAMAN, T. V., AND GP, IES, D. Documents mean more than just paper! Proceedings of the and. International Workshop on the Principles of Document Processin9 (Apt 1994).Google Scholar
- 11.RESNECK, P. tlyperVoice: Groupware by Telephone. PhD thesis, Ml'F, 1992. Google ScholarDigital Library
- 12.RESNICK, P., AND ViRZl, R. A. Skip and scan: Cleaning up telephone interfaces. Proceedings of CHI 199~ (1992). Google ScholarDigital Library
- 13.STEELE, G. L. Common Lisp The Language, second ed. Digital Press, Bedford, Mass, 1990. Google ScholarDigital Library
- 14.VIP. Zl, R. A., RESNICK, P., AND OTTENS, D. Skip anD scan telephone menus: User performance as a function of experience. Human factors Society, HFS 1992 (199'2).Google Scholar
- 15.X3J 13, A. S. C. Programming Language Common Lisp-- Draft Proposed. CBEMA, 1993. URI. FTP://parcftp.xerox.com/pub/el/dpANS2.Google Scholar
Index Terms
Interactive audio documents
Recommendations
Extracting Keyphrases from Spoken Audio Documents
Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].Spoken audio documents are becoming more and more common on the World Wide Web, and this is likely to be accelerated by the widespread deployment of broadband technologies. Unfortunately, speech documents are inherently hard to browse because of their ...
Web-intrinsic interactive documents
DocEng '14: Proceedings of the 2014 ACM symposium on Document engineeringModern interactive documents are complex applications that give the user the editing experience of editing a document as it will look in its final visual form. Sections of the document can be either editable, or read-only, and can dynamically conform ...
Structuring interactive TV documents
DocEng '03: Proceedings of the 2003 ACM symposium on Document engineeringInteractive video technology is meant to support user-interaction with video in scene objects associated with navigation in video segments and access to text-based metadata. Interactive TV is one of the most important applications of this area, which ...
Comments