ACM Home Page
Please provide us with feedback. Feedback
Player action recognition in broadcast tennis video with applications to semantic analysis of sports game
Full text PdfPdf (2.39 MB)
Source International Multimedia Conference archive
Proceedings of the 14th annual ACM international conference on Multimedia table of contents
Santa Barbara, CA, USA
SESSION: Content session 3: semantic concepts table of contents
Pages: 431 - 440  
Year of Publication: 2006
ISBN:1-59593-447-2
Authors
Guangyu Zhu  Harbin Institute of Technology. Harbin, P.R. China
Changsheng Xu  Institute for Infocomm Research, Singapore
Qingming Huang  Graduate School of Chinese Academy of Sciences, Beijing, P.R. China
Wen Gao  Harbin Institute of Technology. Harbin, P.R. China
Liyuan Xing  Graduate School of Chinese Academy of Sciences, Beijing, P.R. China
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 19,   Downloads (12 Months): 193,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1180639.1180728
What is a DOI?

ABSTRACT

Recognition of player actions in broadcast sports video is a challenging task due to low resolution of the players in video frames. In this paper, we present a novel method to recognize the basic player actions in broadcast tennis video. Different from the existing appearance-based approaches, our method is based on motion analysis and considers the relationship between the movements of different body parts and the regions in the image plane. A novel motion descriptor is proposed and supervised learning is employed to train the action classifier. We also propose a novel framework by combining the player action recognition with other multimodal features for semantic and tactic analysis of the broadcast tennis video. Incorporating action recognition into the framework not only improves the semantic indexing and retrieval performance of the video content, but also conducts highlights ranking and tactics analysis in tennis matches, which is the first solution to our knowledge for tennis game. The experimental results demonstrate that our player action recognition method outperforms existing appearance-based approaches and the multimodal framework is effective for broadcast tennis video analysis.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
A. Ekin, A.M. Tekalp, R. Mehrotra. Automatic soccer video analysis and summarization. IEEE Transaction on Image Processing, vol. 12, no. 7, pp. 796--807, 2003.
 
3
 
4
 
5
N. Babaguchi, Y. Kawai, T. Ogura, T. Kitahashi. Personalized abstraction of broadcasted American football video by highlight selection. IEEE Transaction on Multimedia, vol. 6, no. 4, pp. 575--586, 2004.
 
6
K. Wan, J. Wang, C. Xu, Q. Tian. Automatic sports highlights extraction with content augmentation. Pacific-Rim Conference on Multimedia, vol. 3332, pp. 19--26, 2004.
 
7
G. S. Pingali, Y. Jean, A. Opalach, I. Carlbom. LucentVision: converting real world events into multimedia experiences. IEEE International Conference on Multimedia and Expo, vol. 3, pp. 1433--1436, 2000.
 
8
 
9
 
10
 
11
 
12
 
13
 
14
 
15
H. Miyamori. Improving accuracy in behavior identification for content-based retrieval by using audio and video information. IEEE International Conference on Pattern Recognition, vol. 2, pp. 826--830, 2002.
 
16
 
17
 
18
X. Yu, C.H. Sim, J.R. Wang, L.F. Cheong. A trajectory-based ball detection and tracking algorithm in broadcast tennis video. IEEE International Conference on Image Processing, vol. 2, pp. 1049--1052, 2004.
 
19
 
20
M. Xu, L.Y. Duan, C.S. Xu, Q. Tian. A fusion scheme of visual and auditory modalities for event detection in sports video. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, pp. 189--192, 2003.
 
21
M. Petkovic, V. Mihajlovic, W. Jonker. Techniques for automatic video content derivation. IEEE International Conference on Image Processing, vol. 2, pp. 611--614, 2003.
 
22
P. Wang, R. Cai, S.Q. Yang. A tennis video indexing approach through pattern discovery in interactive process. Pacific-Rim Conference on Multimedia, vol. 3331, pp. 49--56, 2004.
 
23
L. Xing, H. Yu, Q. Huang, Q. Ye, A. Divakaran. Subjective evaluation criterion for selecting affective features and modeling highlights. SPIE Conference on Multimedia Content Analysis and Management, vol. 6073, 2006.
 
24
G. Zhu, D. Liang, Y. Liu, Q. Huang, W. Gao. Improving particle filter with support vector regression for efficient visual tracking. IEEE International Conference on Image Processing, vol. 2, pp. 422--425, 2005.
 
25
B.K.P. Horn, B.G. Schunck. Determining optical flow. Artificial Intelligence, vol. 17, pp. 185--203, 1981.
26
 
27
Q. Ye, W. Gao, W. Zeng. Color image segmentation using density-based clustering. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, pp. 345--348, 2003.
 
28
 
29
 
30
 
31
H. Liu, D. Zhou. Content-based news video story segmentation and video retrieval. SPIE Conference on Image and Graphics, vol. 4875, pp. 1038--1044, 2002.


Collaborative Colleagues:
Guangyu Zhu: colleagues
Changsheng Xu: colleagues
Qingming Huang: colleagues
Wen Gao: colleagues
Liyuan Xing: colleagues