skip to main content
10.1145/1180639.1180786acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

The influence of cross-validation on video classification performance

Published: 23 October 2006 Publication History

Abstract

Digital video is sequential in nature. When video data is used in a semantic concept classification task, the episodes are usually summarized with shots. The shots are annotated as containing, or not containing, a certain concept resulting in a labeled dataset. These labeled shots can subsequently be used by supervised learning methods (classifiers) where they are trained to predict the absence or presence of the concept in unseen shots and episodes. The performance of such automatic classification systems is usually estimated with cross-validation. By taking random samples from the dataset for training and testing as such, part of the shots from an episode are in the training set and another part from the same episode is in the test set. Accordingly, data dependence between training and test set is introduced, resulting in too optimistic performance estimates. In this paper, we experimentally show this bias, and propose how this bias can be prevented using episode-constrained crossvalidation. Moreover, we show that a 17% higher classifier performance can be achieved by using episode constrained cross-validation for classifier parameter tuning.

References

[1]
R. Cowie, E. Douglas-Cowie, N. Tsapatsoulis, G. Votsis, S. Kollias, W. Fellenz, and J. G. Taylor. Emotion recognition in human-computer interaction. Signal Processing Magazine, 18(1):32--80, 2001.
[2]
L.-Y. Duan, M. Xu, X.-D. Yu, and Q. Tian. A unified framework for semantic shot classification in sports video. Transactions on Multimedia, 7(6):1066--1083, 2005.
[3]
R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classification. Wiley-Interscience Publication, 2000.
[4]
A. K. Jain, R. P. W. Duin, and J. Mao. Statistical pattern recognition: A review. TPAMI, 22(1):4--37, 2000.
[5]
L. Lu, H. Jiang, and H. Zhang. A robust audio classification and segmentation method. In ACM Multimedia, pages 203--211, New York, NY, USA, 2001. ACM Press.
[6]
I. McCowan, D. Gatica-Perez, S. Bengio, G. Lathoud, and M. Barnard. Automatic analysis of multimodal group actions in meetings. TPAMI, 27(3):305--317, 2005.
[7]
M. R. Naphade and T. S. Huang. A probabilistic framework for semantic video indexing, filtering, and retrieval. Transactions on Multimedia, 3(1):141--151, 2001.
[8]
NIST. TRECVID Video Retrieval Evaluation, 2001--2005. http://www-nlpir.nist.gov/projects/trecvid/.
[9]
Y. Qi, A. Hauptmann, and T. Liu. Supervised classification for video shot segmentation. In ICME, July 2003.
[10]
C. G. M. Snoek, M. Worring, J.-M. Geusebroek, D. C. Koelma, F. J. Seinstra, and A. W. M. Smeulders. The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing. TPAMI, 2006, in press.
[11]
C. G. M. Snoek, M. Worring, J. C. van Gemert, J.-M. Geusebroek, and A. W. M. Smeulders. The challenge problem for automated detection of 101 semantic concepts in multimedia. In ACM Multimedia, 2006.
[12]
J. C. van Gemert, J. M. Geusebroek, C. J. Veenman, C. G. M. Snoek, and A. W. M. Smeulders. Robust scene categorization by learning image statistics in context. In CVPR-SLAM, 2006.
[13]
V. Vapnik. The Nature of Statistical Learning Theory. Springer Verlag, New York, 1995.

Cited By

View all
  • (2024)Makine Öğrenimi Teknikleriyle Uyku Bozukluklarına Yönelik Derinlemesine Analiz ve SınıflandırmaInternational Journal of Pure and Applied Sciences10.29132/ijpas.147518310:1(242-260)Online publication date: 30-Jun-2024
  • (2022)Towards a Data Engineering Process in Data-Driven Systems Engineering2022 IEEE International Symposium on Systems Engineering (ISSE)10.1109/ISSE54508.2022.10005441(1-8)Online publication date: 24-Oct-2022
  • (2009)Concept-Based Video RetrievalFoundations and Trends in Information Retrieval10.1561/15000000142:4(215-322)Online publication date: 1-Apr-2009
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '06: Proceedings of the 14th ACM international conference on Multimedia
October 2006
1072 pages
ISBN:1595934472
DOI:10.1145/1180639
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 October 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cross-validation
  2. multimedia performance evaluation
  3. parameter tuning
  4. semantic concept detection

Qualifiers

  • Article

Conference

MM06
MM06: The 14th ACM International Conference on Multimedia 2006
October 23 - 27, 2006
CA, Santa Barbara, USA

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Makine Öğrenimi Teknikleriyle Uyku Bozukluklarına Yönelik Derinlemesine Analiz ve SınıflandırmaInternational Journal of Pure and Applied Sciences10.29132/ijpas.147518310:1(242-260)Online publication date: 30-Jun-2024
  • (2022)Towards a Data Engineering Process in Data-Driven Systems Engineering2022 IEEE International Symposium on Systems Engineering (ISSE)10.1109/ISSE54508.2022.10005441(1-8)Online publication date: 24-Oct-2022
  • (2009)Concept-Based Video RetrievalFoundations and Trends in Information Retrieval10.1561/15000000142:4(215-322)Online publication date: 1-Apr-2009
  • (2009)Episode-constrained cross-validation in video concept retrievalIEEE Transactions on Multimedia10.1109/TMM.2009.201761911:4(780-786)Online publication date: 1-Jun-2009
  • (2008)Analyzing video concept detectors visually2008 IEEE International Conference on Multimedia and Expo10.1109/ICME.2008.4607759(1603-1604)Online publication date: Jun-2008
  • (2007)Exploiting redundancy in cross-channel video retrievalProceedings of the international workshop on Workshop on multimedia information retrieval10.1145/1290082.1290109(177-186)Online publication date: 24-Sep-2007
  • (2007)Semantic Video SearchProceedings of the 14th International Conference of Image Analysis and Processing - Workshops10.1109/ICIAPW.2007.39(51-58)Online publication date: 10-Sep-2007

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media