ACM Home Page
Please provide us with feedback. Feedback
A new general framework for shot boundary detection and key-frame extraction
Full text PdfPdf (294 KB)
Source International Multimedia Conference archive
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval table of contents
Hilton, Singapore
POSTER SESSION: Poster session 1: video annotation, indexing and retrieval table of contents
Pages: 121 - 126  
Year of Publication: 2005
ISBN:1-59593-244-5
Authors
Huamin Feng  Beijing Electronic Science and Technology Institution, Beijing, China
Wei Fang  Beijing University of Posts and Telecommunications, Beijing, China
Sen Liu  YanShan University, QinHuangdao, China
Yong Fang  Beijing Electronic Science and Technology Institution, Beijing, China
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 84,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1101826.1101847
What is a DOI?

ABSTRACT

Video shot boundary detection is an important step in many video applications. Since the rapid development of video editing technology, especially, the extensive use of sub-window in news video, the original method of video segmentation cannot efficiently detect the video shot boundary caused by special video technique. In this paper, previous temporal multi-resolution analysis (TMRA) work was extended by first using SVM (Supported Vector Machines) classify the video frames within a sliding window into normal frames, gradual transition frames and CUT frames, then clustering the classified frames into different shot categories. The experimental result on ground truth, which has about 21 hours (10,250 shots) news video clip, shows that the new framework has relatively good accuracy for the detection of shot boundaries. It basically resolves the difficulties of shot boundaries detection caused by sub-window technique in video. The framework also greatly improves accuracy of gradual transitions of shot.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Adcock, A. Girgensohn, M. Cooper. FXPAL Experiments for TRECVID 2004{A}, Proceedings of TRECVID 2004, 2005.
 
2
T.M. Bae, S.H. Jin, Y. M. Ro. Video Segmentation Using Hidden Markov Model with Multimodal Features, CIVR2004, LNCS 3115, Dublin, 2004, 401--409.
 
3
Chih-Chung Chang and Chih-Jen Lin. LIBSVM: a Library for Support Vector Machines 2004 Available at http://www.csie.ntu.edu.tw/~cjlin
 
4
Chang, C.-C. and C.-J. Lin. LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm. 2001.
 
5
A. Cohen, R. Ryan. Wavelet and Multiscale Signal Processing, Chapman and Hall Publishers, 1995.
 
6
Naiyang Deng, Yingjie Tian. A New Method in Data Mining-Support Vector Machine. Beijing: Science Publisher 2004.
 
7
HuaMin Feng, A Chandrashekhara, T.S. Chua. An Automatic Temporal Multi-resolution Analysis Framework for Shot Boundary Detection. MMM 2003, the 9th International Conference on Multi-Media Modeling, 2003, 224.
 
8
YiHong Gong. An Accuracy and Robust Method for Detecting Video Shot boundaries, ICMCS, 1999, Vol. 1 850--854.
9
 
10
F. Idris, S. Panchanathan. Review of Image and Video Indexing Techniques. Journal of Visual Communication and Image Representation, 1997 June, 8(2), 146--166.
 
11
Lin, H.-T. and C.-J. Lin. A study on sigmoid kernels for SVM and the training of non-PSD kernels by SMO-type methods. Technical report, Department of Computer Science and Information Engineering, National Taiwan University. 2003.
 
12
S. Knerr, L. Personnaz, and G. Dreyfus. Single-layer learning revisited: a stepwise procedure for building and training a neural network. In J. Fogelman, editor, Neurocomputing: Algorithms.
 
13
Architectures and Applications. Springer-Verlag, 1990.
 
14
 
15
Y. Lin, M. S. Kankanhalli, T.S. Chua. Temporal Multi-resolution Analysis for video Segmentation, Proc. SPIE Conf. Storage and Retrieval for Media Database VIII, SPIE 2000 Vol. 3970, 494--505.
 
16
 
17
Hong Heather Yu, Wayne Wolf. Multi-resolution video segmentation using wavelet transformation, Storage and retrieval for Image and Video Database (SPIE), San Jose, CA, USA, 1998, 176--187.
 
18
 
19


Collaborative Colleagues:
Huamin Feng: colleagues
Wei Fang: colleagues
Sen Liu: colleagues
Yong Fang: colleagues