| Semi-automatic video annotation based on active learning with multiple complementary predictors |
| Full text |
Pdf
(311 KB)
|
| Source
|
International Multimedia Conference
archive
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
table of contents
Hilton, Singapore
POSTER SESSION: Poster session 1: video annotation, indexing and retrieval
table of contents
Pages: 97 - 104
Year of Publication: 2005
ISBN:1-59593-244-5
|
|
Authors
|
|
Yan Song
|
University of Sci&Tech of China, Hefei, China
|
|
Xian-Sheng Hua
|
Microsft Research Asia, Beijing, China
|
|
Li-Rong Dai
|
University of Sci&Tech of China, Hefei, China
|
|
Meng Wang
|
University of Sci&Tech of China, Hefei, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 12, Downloads (12 Months): 64, Citation Count: 7
|
|
|
ABSTRACT
In this paper, we will propose a novel semi-automatic annotation scheme for video semantic classification. It is well known that the large gap between high-level semantics and low-level features is difficult to be bridged by full-automatic content analysis mechanisms. To narrow down this gap, relevance feedback has been introduced in a number of literatures, especially in those works addressing the problem of image retrieval. And at the same time, active learning is also suggested to accelerate the converging speed of the learning process by labeling the most informative samples. Generally an active learning scheme includes a sample selection engine and a learning engine. In this paper, we will discuss the limitations of existing active learning algorithms and propose a novel active learning scheme based on multiple complementary predictors and incremental model adaptation, which improves the efficiencies of both of the primary components of active learning. Firstly, an efficient sample selection scheme is proposed, in which multiple predictors are applied to find most informative samples. Then an incremental model adaptation technique, maximum likelihood linear regression (MLLR), is used to update the classifiers which tackle the issue of unbalance between the original training set and the newly labeled data. It is proved that the samples selected by the proposed scheme are more representative than general active learning scheme, as well as the incremental model adaptation scheme is effective especially when the newly added data size is small.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Chapelle. O, W.J., Scholkopf.B, Cluster kernels for semi-supervised learning. Advances in Neural Information Processing Systems, 2002.
|
| |
4
|
|
 |
5
|
|
| |
6
|
Fang Qian, M.L., Wei-Ying Ma, Alternating Feature Spaces in Relevance Feedback. 1998.
|
| |
7
|
Leggetter,C.J. and Woodland,P, Maximum likelihood linear regression for speaker adaptation of continuous density Hidden Markov Model. Computer Speech and Language.
|
 |
8
|
Jun Wu , Xian-Sheng Hua , Hong-Jiang Zhang , Bo Zhang, An online-optimized incremental learning framework for video semantic classification, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027601]
|
 |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
Yan Song, Xian-Sheng Hua, Li-Rong Dai, Ren-Hua Wang, Semi-Automatic Video Semantic Annotation Based on Active Learning. VCIP, 2005.
|
| |
13
|
Guidelines for the TRECVID 2003 Evaluation http://www-nlpir.nist.gov/projects/tv2003/tv2003.htm
|
| |
14
|
|
| |
15
|
|
| |
16
|
M.J.F.Gales, P.C.Woodland, "Mean and Variance Adaptation within the MLLR Framework", Computer Speech and Language Volume 10. 1996.
|
CITED BY 7
|
|
|
|
Jinhui Tang , Yan Song , Xian-Sheng Hua , Tao Mei , Xiuqing Wu, To construct optimal training set for video annotation, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
Xun Yuan , Xian-Sheng Hua , Meng Wang , Xiu-Qing Wu, Manifold-ranking based video concept detection on large database and feature pool, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
Yan Song , Xian-Sheng Hua , Guo-Jun Qi , Li-Rong Dai , Meng Wang , Hong-Jiang Zhang, Efficient semantic annotation method for indexing large personal video database, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
|
|
Meng Wang , Xian-Sheng Hua , Xun Yuan , Yan Song , Li-Rong Dai, Optimizing multi-graph learning: towards a unified video annotation scheme, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
Jinhui Tang , Xian-Sheng Hua , Guo-Jun Qi , Meng Wang , Tao Mei , Xiuqing Wu, Structure-sensitive manifold ranking for video concept detection, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
Meng Wang , Yan Song , Xun Yuan , Hong-Jiang Zhang , Xian-Sheng Hua , Shipeng Li, Automatic video annotation by semi-supervised learning with kernel density estimation, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|