|
ABSTRACT
Recognition of player actions in broadcast sports video is a challenging task due to low resolution of the players in video frames. In this paper, we present a novel method to recognize the basic player actions in broadcast tennis video. Different from the existing appearance-based approaches, our method is based on motion analysis and considers the relationship between the movements of different body parts and the regions in the image plane. A novel motion descriptor is proposed and supervised learning is employed to train the action classifier. We also propose a novel framework by combining the player action recognition with other multimodal features for semantic and tactic analysis of the broadcast tennis video. Incorporating action recognition into the framework not only improves the semantic indexing and retrieval performance of the video content, but also conducts highlights ranking and tactics analysis in tennis matches, which is the first solution to our knowledge for tennis game. The experimental results demonstrate that our player action recognition method outperforms existing appearance-based approaches and the multimodal framework is effective for broadcast tennis video analysis.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
A. Ekin, A.M. Tekalp, R. Mehrotra. Automatic soccer video analysis and summarization. IEEE Transaction on Image Processing, vol. 12, no. 7, pp. 796--807, 2003.
|
| |
3
|
|
| |
4
|
|
| |
5
|
N. Babaguchi, Y. Kawai, T. Ogura, T. Kitahashi. Personalized abstraction of broadcasted American football video by highlight selection. IEEE Transaction on Multimedia, vol. 6, no. 4, pp. 575--586, 2004.
|
| |
6
|
K. Wan, J. Wang, C. Xu, Q. Tian. Automatic sports highlights extraction with content augmentation. Pacific-Rim Conference on Multimedia, vol. 3332, pp. 19--26, 2004.
|
| |
7
|
G. S. Pingali, Y. Jean, A. Opalach, I. Carlbom. LucentVision: converting real world events into multimedia experiences. IEEE International Conference on Multimedia and Expo, vol. 3, pp. 1433--1436, 2000.
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
H. Miyamori. Improving accuracy in behavior identification for content-based retrieval by using audio and video information. IEEE International Conference on Pattern Recognition, vol. 2, pp. 826--830, 2002.
|
| |
16
|
|
| |
17
|
|
| |
18
|
X. Yu, C.H. Sim, J.R. Wang, L.F. Cheong. A trajectory-based ball detection and tracking algorithm in broadcast tennis video. IEEE International Conference on Image Processing, vol. 2, pp. 1049--1052, 2004.
|
| |
19
|
|
| |
20
|
M. Xu, L.Y. Duan, C.S. Xu, Q. Tian. A fusion scheme of visual and auditory modalities for event detection in sports video. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, pp. 189--192, 2003.
|
| |
21
|
M. Petkovic, V. Mihajlovic, W. Jonker. Techniques for automatic video content derivation. IEEE International Conference on Image Processing, vol. 2, pp. 611--614, 2003.
|
| |
22
|
P. Wang, R. Cai, S.Q. Yang. A tennis video indexing approach through pattern discovery in interactive process. Pacific-Rim Conference on Multimedia, vol. 3331, pp. 49--56, 2004.
|
| |
23
|
L. Xing, H. Yu, Q. Huang, Q. Ye, A. Divakaran. Subjective evaluation criterion for selecting affective features and modeling highlights. SPIE Conference on Multimedia Content Analysis and Management, vol. 6073, 2006.
|
| |
24
|
G. Zhu, D. Liang, Y. Liu, Q. Huang, W. Gao. Improving particle filter with support vector regression for efficient visual tracking. IEEE International Conference on Image Processing, vol. 2, pp. 422--425, 2005.
|
| |
25
|
B.K.P. Horn, B.G. Schunck. Determining optical flow. Artificial Intelligence, vol. 17, pp. 185--203, 1981.
|
 |
26
|
Shuqiang Jiang , Qixiang Ye , Wen Gao , Tiejun Huang, A new method to segment playfield and its applications in match analysis in sports video, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027594]
|
| |
27
|
Q. Ye, W. Gao, W. Zeng. Color image segmentation using density-based clustering. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, pp. 345--348, 2003.
|
| |
28
|
|
| |
29
|
|
| |
30
|
|
| |
31
|
H. Liu, D. Zhou. Content-based news video story segmentation and video retrieval. SPIE Conference on Image and Graphics, vol. 4875, pp. 1038--1044, 2002.
|
CITED BY
|
Guangyu Zhu , Qingming Huang , Changsheng Xu , Yong Rui , Shuqiang Jiang , Wen Gao , Hongxun Yao, Trajectory based event tactics analysis in broadcast sports video, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|