| Highlight scene extraction in real time from baseball live video |
| Full text |
Pdf
(301 KB)
|
| Source
|
International Multimedia Conference
archive
Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
table of contents
Berkeley, California
POSTER SESSION: Posters
table of contents
Pages: 209 - 214
Year of Publication: 2003
ISBN:1-58113-778-8
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 54, Citation Count: 4
|
|
|
ABSTRACT
This paper proposes a method to automatically extract highlight scenes from sports (baseball) live video in real time and to allow users to retrieve them. For this purpose, sophisticated speech recognition is employed to convert the speech signal into the text and to extract a group of keywords in real time. Image processing detects, also in real time, the pitcher scenes and ending at the successive pitcher scene. Highlight scenes are extracted as the pitching sections with the keywords such as home run, two-base hit and three-base hit extracted from speech signals.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
C. L. Leggetter and P. C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language, 9:171--185, 1995.
|
| |
2
|
A. Ito, M. Kohda, and M. Ostendorf. A new metric for stochastic language model evaluation. In Proceedings of Eurospeech99, pages 1591--1594. ISCA, 1999.
|
| |
3
|
J. L. Gauvain and C. Lee. Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains. IEEE Trans. on Speech and Audio Processing, 2(2):291--298, 1994.
|
| |
4
|
J. Ogata and Y. Ariki. An efficient lexical tree search for large vocabulary continuous speech recognition. In Proceedings of International Conference on Spoken Language Processing, pages 967--970, 2000.
|
| |
5
|
T. Kawashima, K. Tateyama, T. Iijima, and Y. Aoki. Indexing of baseball telecast for content - based video retrieval. In Proceedings of International Conference on Image Processing, pages CD-ROM. IEEE, October 1998.
|
| |
6
|
K. Maekawa, H. Koiso, S. Furui, and H. Isahara. Spontaneous speech corpus of japanese. In Proceedings of LREC2000, pages 947--952, 2000.
|
| |
7
|
M. Kumano and Y. Ariki. Automatic useful shot extraction for a video editing support system. In Proceedings of MVA, pages 310--313, 2002.
|
| |
8
|
N. Babaguchi. Towards abstracting sports video by highlights. In Proceedings of International Conference on Multimedia and Expo, pages 1519--1522. IEEE, 2000.
|
| |
9
|
P. Chang, M. Han, and Y. Gong. Extract highlights from baseball game video with hidden markov models. In Proceedings of International Conference on Image Processing, pages 609--612. IEEE, 2002.
|
| |
10
|
S. Ortmanns, H. Ney, and X. Aubert. A word graph algorithm for large vocabulary continuous speech recognition. volume 11, pages 43--72, 1997.
|
| |
11
|
|
 |
12
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|