ACM Home Page
Please provide us with feedback. Feedback
Highlight scene extraction in real time from baseball live video
Full text PdfPdf (301 KB)
Source International Multimedia Conference archive
Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval table of contents
Berkeley, California
POSTER SESSION: Posters table of contents
Pages: 209 - 214  
Year of Publication: 2003
ISBN:1-58113-778-8
Authors
Yasuo Ariki  Ryukoku University, Seta, Otsu, Japan
Masahito Kumano  Ryukoku University, Seta, Otsu, Japan
Kiyoshi Tsukada  17-1 Chayamachi, Kita-ku, Osaka, Japan
Sponsor
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 54,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/973264.973297
What is a DOI?

ABSTRACT

This paper proposes a method to automatically extract highlight scenes from sports (baseball) live video in real time and to allow users to retrieve them. For this purpose, sophisticated speech recognition is employed to convert the speech signal into the text and to extract a group of keywords in real time. Image processing detects, also in real time, the pitcher scenes and ending at the successive pitcher scene. Highlight scenes are extracted as the pitching sections with the keywords such as home run, two-base hit and three-base hit extracted from speech signals.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
C. L. Leggetter and P. C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language, 9:171--185, 1995.
 
2
A. Ito, M. Kohda, and M. Ostendorf. A new metric for stochastic language model evaluation. In Proceedings of Eurospeech99, pages 1591--1594. ISCA, 1999.
 
3
J. L. Gauvain and C. Lee. Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains. IEEE Trans. on Speech and Audio Processing, 2(2):291--298, 1994.
 
4
J. Ogata and Y. Ariki. An efficient lexical tree search for large vocabulary continuous speech recognition. In Proceedings of International Conference on Spoken Language Processing, pages 967--970, 2000.
 
5
T. Kawashima, K. Tateyama, T. Iijima, and Y. Aoki. Indexing of baseball telecast for content - based video retrieval. In Proceedings of International Conference on Image Processing, pages CD-ROM. IEEE, October 1998.
 
6
K. Maekawa, H. Koiso, S. Furui, and H. Isahara. Spontaneous speech corpus of japanese. In Proceedings of LREC2000, pages 947--952, 2000.
 
7
M. Kumano and Y. Ariki. Automatic useful shot extraction for a video editing support system. In Proceedings of MVA, pages 310--313, 2002.
 
8
N. Babaguchi. Towards abstracting sports video by highlights. In Proceedings of International Conference on Multimedia and Expo, pages 1519--1522. IEEE, 2000.
 
9
P. Chang, M. Han, and Y. Gong. Extract highlights from baseball game video with hidden markov models. In Proceedings of International Conference on Image Processing, pages 609--612. IEEE, 2002.
 
10
S. Ortmanns, H. Ney, and X. Aubert. A word graph algorithm for large vocabulary continuous speech recognition. volume 11, pages 43--72, 1997.
 
11
12


Collaborative Colleagues:
Yasuo Ariki: colleagues
Masahito Kumano: colleagues
Kiyoshi Tsukada: colleagues

Peer to Peer - Readers of this Article have also read: