Abstract
Although talking is an integral part of collaboration, there has been little computer support for acquiring and accessing the contents of conversations. Our approach has focused on ubiquitous audio, or the unobtrusive capture of speech interactions in everyday work environments. Speech recognition technology cannot yet transcribe fluent conversational speech, so the words themselves are not available for organizing the captured interactions. Instead, the structure of an interaction is derived from acoustical information inherent in the stored speech and augmented by user interaction during or after capture. This article describes applications for capturing and structuring audio from office discussions and telephone calls, and mechanisms for later retrieval of these stored interactions. An important aspect of retrieval is choosing an appropriate visual representation, and this article describes the evolution of a family of representations across a range of applications. Finally, this work is placed within the broader context of desktop audio, mobile audio applications, and social implications.
- ADES, S., AND SWINEHART, D.C. 1986. Voice annotation and editing in a workstation enwronment. In Proceedings of the 1986 Conference. The American Voice I/O Society, San Jose, Calif., 13 28.]]Google Scholar
- ARONS, B 1993. Interactlvely skimming recorded speech In the Symposium on User Inter/ace Software and Technology UIST'93 Conference Proceedings. ACM, New York.]] Google Scholar
- ARONS, B. 1992a. Techniques, perception, and applications of time-compressed speech. In Proceedings of the 1992 Conference. The American Voice I/O Society, San Jose, Calif., 169-177.]]Google Scholar
- ARONS, B 1992b. Tools for building asynchronous servers to support speech and audio applications. In the Symposium on User Interface Software and Technology UIST'92 Conference Proceedings. ACM, New York, 71-78.]] Google Scholar
- ARONS, B. 1991. Hyperspeech Navigating in speech-only hypermedia. In tIypertext '91 ACM, New York, 133 146.]] Google Scholar
- BEATTIE, G. W., AND BARNARD, P. J 1979. The temporal structure of natural telephone conversations (directory enquiry calls) Lmguistics 17, 213 229.]]Google Scholar
- BELLOTTI, V., AND SELLEN, A. 1993. Design for privacy in ubiquitous computing environments. In Proceedings of European Conference oil Computer Szepported Cooperative Work. Available as Rank Xerox EuroPARC Tech Rep EPC-93-103]] Google Scholar
- BI,~, S. A., HARmSON, S. R., AND IRWIN, S. Media Spaces: Video, audio, and computing. Commun. ACM 36, 1 (Jan.), 28-46.]] Google Scholar
- CHALFONTI~;, B L, FISH, R S , ANn KRAUT, R. E 1991. Expressive richness: A comparison of speech and text as media for revision. In Human Factors in Computer Systems CHI'91 Conference Proceedings. ACM, New York, 21 26.]] Google Scholar
- CHEN, F. R., AND WITHGOTT, M M. 1992 The use of emphasis to automatically summarize a spoken discourse. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing IEEE, New York, 1-229-232.]]Google Scholar
- DEGEN, L., MANDER, R., AND SALOMON, G. 1992. Working withaudio: Integrating personal tape recorders and desktop computers. In Human Factors Ln ComputerSystems--CHI'92 Conference Proceedings. ACM, New York, 413-418.]] Google Scholar
- DENNIS, A. R., GEORGE, J. F., JESSUP, L. M., NUNAMAKER, J. F., JR., AND VOGEL, D.R. 1988. Information technology to support electronic meetings. MIS Q. 12, 4, 591-624.]] Google Scholar
- DOURISH, P. 1993. Culture and control in a media space In Proceedzngs of the European Conference on Computer Supported Cooperatme Work. Available as Rank Xerox EuroPARC Tech Rep. EPC-93-101]] Google Scholar
- DUNLOP, C., AND KLING, R., EDS. 1991. Computerization and Controversy: Value Conflicts and Social Chomes. Academic Press, New York.]] Google Scholar
- EGIDO, C. 1990 Teleconi~rencing as a technology to support cooperative work' Its possibilities and limitations. In Intellectual Teamwork' Social and Technological Foundatzons of Cooperative Work. Lawrence Erlbaum, Hillsdale, N.J., Chapter 13, 351 371.]] Google Scholar
- FISH, R. S., K~AUT, R. E., LELAND, M.D., AND COHEN, M. 1988 Quilt: A collaborative tool for cooperative writing. In Co,ference on Office Information Systenzs--COIS'88 Conference Proceedings. ACM, New York, 30-37.]] Google Scholar
- F~su, R., K~UT, R., ROOT, R., AND RICE, R. ~993. Video informal communication. Commun. ACM 36, i (Jan.), 48-61.]] Google Scholar
- GAVER, W., MORAN, T., MACLEAN, A., LOVSTRAND, L., DOURISH, P., CARTER, K., AND BUXTON, B. 1992. Realizing a video environment: EuroPARC's RAVE system. In Human Factors in Computer Systems CHI'92 Conference Proceedings. ACM, New York, 27-35.]] Google Scholar
- HINDUS, D. 1992. Semi-structured capture and display of telephone conversations. Master's thesis, Massachusetts Institute of Technolog~y, Cambridge, Mass.]]Google Scholar
- HORNER, C. 1993. NewsTime: A graphical user interface to audio news. Master's thesis, Massachusetts Institute of Technology, Cambridge, Mass.]]Google Scholar
- ISHn, H. 1990. TeamWorkStation: Towards a seamless shared workspace. In Computer Supported Cooperative Work--CSCW'90 Conference Proceedings. ACM, New York, 13-26.]] Google Scholar
- ISAACS, E. A., AND TANG, J.C. 1993. What video can and can't do for collaboration. In the 1st International Conference on Multimedia. ACM, New York, 199 206.]] Google Scholar
- LAMMING, M., AND NEWMAN, W. 1992. Activity-based information retrieval: Technology in support of human memory. Tech. Rep. 92-002, Rank Xerox EuroPARC.]]Google Scholar
- MACKAY, W. E., MALONE, T. W., CROWSTON, K., RAO, R., ROSENBLITT, D., AND CARD, S.K. 1989. How do experienced Information Lens users use rules? In Human Factors in Computer Systems--CHI'89 Conference Proceedings. ACM, New York, 211 216.]] Google Scholar
- MALONE, T. W., GRANT, K. R., LAI, K.-Y., RAO, R., AND ROSENBLITT, D. 1987. Semi-structured messages are surprisingly useful for computer-supported coordination. ACM Trans. Office Inf. Syst. 5, 2, 115 131.]] Google Scholar
- MANTEI, M. 1988. Capturing the Capture Lab concepts: A case study in the design of computer supported meeting environments. In Computer Supported Cooperative Work--CSCW'88 Conference Proceedings. ACM, New York, 257 270.]] Google Scholar
- MANTEi, M., BAECKER, R., SELLEN, A., BUXTON, W., AND MILLIGAN, T. 1991. Experiences in the use of a media space. In Human Factors ~n Computer Systems--CHI'91 Conference Proceedings. ACM, New York, 203-208.]] Google Scholar
- MILLS, M., COHEN, J., AND WONG, Y.Y. 1992. A magnifier tool for video data. In Human Factors in Computer Systems--CHI'92 Con/krence Proceedings. ACM, New York.]] Google Scholar
- MULLER, M. J., AND DANIEL, J.E. 1990. Toward a definition of voice documents. In Conference on Office Informatzon Systems COIS'90 Conference Proceedings. ACM, New York, 174-183.]] Google Scholar
- MYERS, B.A. 1985. The importance of percent-done progress indicators for computer-human In Human Factors in Computer Systems--CHI'85 Conference Proceedings. ACM, New York, 11-17.]] Google Scholar
- OSCHMAN, a. B., AND CHAPANIS, h. 1974. The effects of ten communication modes on the behavior of teams during co-operative problem solving. Int. J. Man/Machine Syst. 6, 579 619.]]Google Scholar
- REDER, S., AND SCHWAB, R.G. 1990. The temporal structure of cooperative activity. In Computer Supported Cooperattve Work CSCW'90 Conference Proceedings. ACM, New York, 303-316.]] Google Scholar
- RESNICK, P. 1992. HyperVoice: A phone-based CSCW platform. In Computer Supported Cooperative Work--CSCW'92 Confkrence Proceedtngs. ACM, New York, 218-225.]] Google Scholar
- RESNICK, P., AND VIRZi, R. A. 1992. Skip and Scan: Cleaning up telephone interfaces. In Human Factors in Computer Systems--CHI'92 Conference Proceedtngs. ACM, New York, 419-426.]] Google Scholar
- ROTHFEDER, J. 1992. Privacy for Sale. Simon and Schuster, New York.]]Google Scholar
- RUq~rER. D.R. 1987. Communicating by Telephone. Pergamon Press, New York.]]Google Scholar
- SCHMANDT, C. 1993. Phoneshelh The telephone as computer terminal. In the 1st Internatwnal Conference on Multzmedta. ACM, New York, 373 382.]] Google Scholar
- SCHMANgT, C. 1990. Caltalk: A multi-media calendar. In Proceedings of the 1990 Conference. The American Voice I/O Society, San Jose, Calif., 71-75.]]Google Scholar
- SCUMANDT, C. 1981. The Intelligent Ear: A graphical interfaceto digital audio. In Proceedings of the IEEE Conference on Cybernctlc~' altd Hocle(v. IEEE, New York, 393 397.]]Google Scholar
- SCHMANDT, C., AND ARONS, B. 1985. Phone Slave: A graphical telecommunications interface. Proc. Soc. Inf. D~splay 26, 1, 79 82.]]Google Scholar
- SOCLOF, M., AND ZUE, V. 1990. Collection and analysis of spontaneous and read corpora for spoken language system development. In Proceedmgs of ICSLP. 1105-1108.]]Google Scholar
- SPROULL, L., AND KIESLER, S. 1991. Connections: New Ways of Working zn the Networked Organization. MIT Press, Cambridge, Mass.]] Google Scholar
- TIFELMAN, L.J. 1992. VoiceNotes: An application for a voice-controlled hand-held computer. Master's thesm, Massachusetts Institute of Technology, Cambridge, Mass]]Google Scholar
- STIFELMAN, L. J. 1991. Not just another voice mail system. In Proceedings of the 1991 Conference. American Voice I/O Society, San Jose, Calif., 21-26.]]Google Scholar
- STIFELMAN, L. J., ARONS, B., SCHMANDT, C., AND HULTEEN, E. A 1993. VoiceNotes: A speech interlace for a hand*held voice notetaker. In Human Factors in Computer Systems InterCHI'93 Conference Proceedings. ACM, New York, 179-186.]] Google Scholar
- WANT, R., HOPPER, A., FALCCO, V., AND GIBBONS, d. 1992. The active badge location system. ACM Trans. Office Inf. Syst. 10, 1, 91-102]] Google Scholar
- WATABE, K., SAKATA, S., MAENO, K., FUKUOKA, H., AND OHMORI, T. 1991. Distributed desktop conferenclng system with multluser multimedia interface. IEEE J. Sel. Areas Commun. 9, 4, 531 539.]]Google Scholar
- WEISER, M. 1991. The computer for the 21st century. Sc~. Am. 265, 3 (Sept.), 66 75.]]Google Scholar
- WILCOX, L., AND BUSH, M. 1991. HMM-based wordspotting for vmce editing and indexing. In Proceedings of Eurospeech 91. 25 28.]]Google Scholar
- ZELLWECER, P., TERRY, D., ANO SWlNE~ART, D. 1988. An overview of the Etherphone system and its applications. In Proceedings of the 2nd IEEE Conference on Computer Workstatmns. IEEE, New York, 160-168.]]Google Scholar
- ZuE, V.W. 1991. From signals to symbols to meaning. On machine understanding of spoken language. In Proceedings of the 12th International Congress of Phonetic Sciences.]]Google Scholar
Index Terms
- Capturing, structuring, and representing ubiquitous audio
Recommendations
Ubiquitous audio: capturing spontaneous collaboration
CSCW '92: Proceedings of the 1992 ACM conference on Computer-supported cooperative workUbiquitous Computing: Are We There Yet?
The widespread deployment of technologies like mobile phones continues to drive new applications and to open research opportunities.
Ubiquitous ID: Standards for Ubiquitous Computing and the Internet of Things
In ubiquitous computing environments, many tiny computers cooperate, adapting their behaviors according to real-world contexts to provide flexible information services. This article discusses the principle of technology standardization for the ...
Comments