| A speech interface for building musical score collections |
| Full text |
Pdf
(89 KB)
|
| Source
|
International Conference on Digital Libraries
archive
Proceedings of the fifth ACM conference on Digital libraries
table of contents
San Antonio, Texas, United States
Pages: 165 - 173
Year of Publication: 2000
ISBN:1-58113-231-X
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 31, Citation Count: 1
|
|
|
ABSTRACT
Building machine readable collections of musical scores is a tedious and time consuming task. The most common interface for performing music data entry is a mouse and toolbar system; using the mouse, the user selects a rhythm (note shape) from a toolbar, then drags the note to the correct position on the staff. We compare the usability of a hybrid speech and mouse-driven interface to a traditional mouse-driven one. The speech-enhanced interface allows users to enter note rhythms by voice, while still using the mouse to indicate pitches. While task completion time is nearly the same, users (N=13) significantly preferred the speech-augmented interface. A second study using the first two authors of this paper (N=2) indicates that experienced users can enter music 11% faster with the speech interface. Many users expressed a desire to enter pitches, as well as rhythms, by speech. A third study, however, shows that the recognizer is unable to reliably distinguish among A, B, C, D, E, F and G (N=10).
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
David Bainbridge , Craig G. Nevill-Manning , Ian H. Witten , Lloyd A. Smith , Rodger J. McNab, Towards a digital library of popular music, Proceedings of the fourth ACM conference on Digital libraries, p.161-169, August 11-14, 1999, Berkeley, California, United States
[doi> 10.1145/313238.313295]
|
| |
3
|
Belkin, A. Macintosh notation software: Present and future. Computer Music Journal 18 (1994), pp. 53-69.
|
| |
4
|
B lostein, D. and Haken, L. Template matching for rhythmic analysis of music keyboard input, in Proc. 10# International Conference on Pattern Recognition (Atlantic City, N.J., 1990).
|
| |
5
|
Carter, N. Music Score Recognition: Problems and Prospects. Computing in Musicology 9 (1993), pp. 152-158.
|
| |
6
|
Choral Public Domain Library {Online}. http:// www.cpdl.org, 1998.
|
| |
7
|
Huron, D. Design principles in computer based music representation, in A. Marsden and A. Pople, Eds. Computer Representations and Models in Music. Academic Press, London, 1992.
|
| |
8
|
IBM, Inc. ViaVoice Developer's Comer {Online}. http ://www. software.ibm.com/is/voicetype/dev_home. html, 1998.
|
 |
9
|
Rodger J. McNab , Lloyd A. Smith , Ian H. Witten , Clare L. Henderson , Sally Jo Cunningham, Towards the digital music library: tune retrieval from acoustic input, Proceedings of the first ACM international conference on Digital libraries, p.11-18, March 20-23, 1996, Bethesda, Maryland, United States
[doi> 10.1145/226931.226934]
|
| |
10
|
Wildcat Canyon Software: Autoscore {Online}. http://www.wildcat.com, 1999.
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
Additional Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Interaction styles (e.g., commands, menus, forms, direct manipulation);
Natural language
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.1
Applications and Expert Systems
Subjects:
Natural language interfaces
General Terms:
Design,
Documentation,
Experimentation,
Human Factors,
Management,
Measurement,
Performance,
Theory
Keywords:
music digital libraries,
music notation editors,
speech recognition,
user interfaces
Peer to Peer - Readers of this Article have also read:
-
M4: a metamodel for data preprocessing
Proceedings of the 4th ACM international workshop on Data warehousing and OLAP
Anca Vaduva
, Jörg-Uwe Kietz
, Regina Zücker
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
|