ACM Home Page
Please provide us with feedback. Feedback
A speech interface for building musical score collections
Full text PdfPdf (89 KB)
Source International Conference on Digital Libraries archive
Proceedings of the fifth ACM conference on Digital libraries table of contents
San Antonio, Texas, United States
Pages: 165 - 173  
Year of Publication: 2000
ISBN:1-58113-231-X
Authors
Lloyd A. Smith  NM Highlands University, Las Vegas, NM
Eline F. Chiu  Deloitte Consulting, Ltd., Wellington, New Zealand
Brian L. Scott  Universal Interface, Inc., Denton, TX
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGIR: ACM Special Interest Group on Information Retrieval
SIGLINK: Hypertext, Hypermedia, and Web
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 31,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/336597.336657
What is a DOI?

ABSTRACT

Building machine readable collections of musical scores is a tedious and time consuming task. The most common interface for performing music data entry is a mouse and toolbar system; using the mouse, the user selects a rhythm (note shape) from a toolbar, then drags the note to the correct position on the staff. We compare the usability of a hybrid speech and mouse-driven interface to a traditional mouse-driven one. The speech-enhanced interface allows users to enter note rhythms by voice, while still using the mouse to indicate pitches. While task completion time is nearly the same, users (N=13) significantly preferred the speech-augmented interface. A second study using the first two authors of this paper (N=2) indicates that experienced users can enter music 11% faster with the speech interface. Many users expressed a desire to enter pitches, as well as rhythms, by speech. A third study, however, shows that the recognizer is unable to reliably distinguish among A, B, C, D, E, F and G (N=10).


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
Belkin, A. Macintosh notation software: Present and future. Computer Music Journal 18 (1994), pp. 53-69.
 
4
B lostein, D. and Haken, L. Template matching for rhythmic analysis of music keyboard input, in Proc. 10# International Conference on Pattern Recognition (Atlantic City, N.J., 1990).
 
5
Carter, N. Music Score Recognition: Problems and Prospects. Computing in Musicology 9 (1993), pp. 152-158.
 
6
Choral Public Domain Library {Online}. http:// www.cpdl.org, 1998.
 
7
Huron, D. Design principles in computer based music representation, in A. Marsden and A. Pople, Eds. Computer Representations and Models in Music. Academic Press, London, 1992.
 
8
IBM, Inc. ViaVoice Developer's Comer {Online}. http ://www. software.ibm.com/is/voicetype/dev_home. html, 1998.
9
 
10
Wildcat Canyon Software: Autoscore {Online}. http://www.wildcat.com, 1999.


Collaborative Colleagues:
Lloyd A. Smith: colleagues
Eline F. Chiu: colleagues
Brian L. Scott: colleagues

Peer to Peer - Readers of this Article have also read: