skip to main content
10.1145/1459359.1459468acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Naming faces in broadcast news video by image google

Published: 26 October 2008 Publication History

Abstract

Naming faces is important for news videos browsing and indexing. Although some research efforts have been contributed to it, they only use the concurrent information between the face and name or employ some clues as features and use simple heuristic method or machine learning approach to finish the task. They use little extra knowledge about the names and faces. Different from previous work, in this paper we present a novel approach to name the faces by exploring extra knowledge obtained from image google. The behind assumption is that the faces of those important persons will turn out many times in the web images and could be retrieved from image google easily. Firstly, faces are detected in the video frames; and the name entities of candidate persons are extracted from the textual information by automatic speech recognition and close caption detection. Then, these candidate person names are used as queries to find the name related person images through image google. After that, the retrieved result is analyzed and some typical faces are selected through feature density estimation. Finally, the detected faces in the news video are matched with the faces selected from the result returned by image google to label each face. Experimental results on MSNBC news and CNN news demonstrate that the proposed approach is effective.

References

[1]
S. Satoh, T. Kanade. NAME-IT: Association of Faces and Names in Video. In Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, pp. 368--373, 1997.
[2]
T. L. Berg, A. C. Berg, J. Edwards, M. Maire, R. White, Y. W. Teh, E. Miller, D. A. Foryth. Names and Faces in the News. In Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, pp. 848--854, 2004.
[3]
D. Ozkan and P. Duygulu. A Graph Based Approach for Naming Faces in News Photos. In Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1477--1482, 2006.
[4]
J. Yang and A. G. Hauptmann. Naming Every Individual in News Video Monologues. In Proc. of ACM Int'l Conf. on Multimedia, pp. 580--587, 2004
[5]
J. Yang, R. Yang, and A. G. Hauptmann. Multiple Instance Learning for Labeling Faces in Broadcasting News Video. In Proc. of ACM Int'l Conf. on Multimedia, pp. 31--40, 2005.
[6]
L. Chaisorn, T.-S Chua, C.-K Koh, Y.-L Zhao, H. Xu, H. Feng and Q. Tian. A two-level Multi-modal Approach for Story Segmentation of Large News Video Corpus. TRECVID workshop, 2003.
[7]
J. Chen, X. Chen, W. Gao. Expand Training Set for Face Detection by GA Re-sampling. In Proc. IEEE int'l conf. on automatic face and gesture recognition, 2004.
[8]
Alias-i. Lingpipe named entity tagger. In http://www.aliasi.com/lingpipe/.
[9]
Q. Ye and Q. Huang. A New Text Detection Algorithm in Image/Video Frames. Lecture note in computer science, Pacific-Rim conference on Multimedia, pp. 858--865, 2004.
[10]
http://www.hw99.com
[11]
K. Fukunaga and L. Hostetler. The Estimation of the Gradient of a Density Function, with Applications in Pattern Recognition. IEEE Transactions on Information Theory, vol.21, no.1, pp.32--40, 1975.
[12]
Y. Su, S. Shan, X. Chen and W. Gao. Hierarchical Ensemble of Global and Local Classifiers for Face Recognition. In Proc. of IEEE Int'l Conf. on Computer Vision, pp. 1--8, 2007.

Cited By

View all
  • (2019)Name-face association with web facial image supervisionMultimedia Systems10.1007/s00530-017-0544-y25:1(1-20)Online publication date: 1-Feb-2019
  • (2015)Deep Multimodal Speaker NamingProceedings of the 23rd ACM international conference on Multimedia10.1145/2733373.2806293(1107-1110)Online publication date: 13-Oct-2015
  • (2015)People News Search via Name-Face Association AnalysisProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749301(467-470)Online publication date: 22-Jun-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '08: Proceedings of the 16th ACM international conference on Multimedia
October 2008
1206 pages
ISBN:9781605583037
DOI:10.1145/1459359
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. naming faces
  2. news video analysis
  3. news video browsing and indexing

Qualifiers

  • Short-paper

Conference

MM08
Sponsor:
MM08: ACM Multimedia Conference 2008
October 26 - 31, 2008
British Columbia, Vancouver, Canada

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Name-face association with web facial image supervisionMultimedia Systems10.1007/s00530-017-0544-y25:1(1-20)Online publication date: 1-Feb-2019
  • (2015)Deep Multimodal Speaker NamingProceedings of the 23rd ACM international conference on Multimedia10.1145/2733373.2806293(1107-1110)Online publication date: 13-Oct-2015
  • (2015)People News Search via Name-Face Association AnalysisProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749301(467-470)Online publication date: 22-Jun-2015
  • (2014)A conditional random field approach for face identification in broadcast news using overlaid text2014 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP.2014.7025063(318-322)Online publication date: Oct-2014
  • (2014)Comparison of two methods for unsupervised person identification in TV shows2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI)10.1109/CBMI.2014.6849828(1-6)Online publication date: Jun-2014
  • (2013)Automatic name-face alignment to enable cross-media news retrievalProceedings of the Twenty-Third international joint conference on Artificial Intelligence10.5555/2540128.2540527(2768-2774)Online publication date: 3-Aug-2013
  • (2013)Unsupervised face identification in TV content using audio-visual sources2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI)10.1109/CBMI.2013.6576591(243-249)Online publication date: Jun-2013
  • (2012)Lightweight automatic face annotation in media pagesProceedings of the 21st international conference on World Wide Web10.1145/2187836.2187962(939-948)Online publication date: 16-Apr-2012
  • (2012)Finding Celebrities in Billions of Web ImagesIEEE Transactions on Multimedia10.1109/TMM.2012.218612114:4(995-1007)Online publication date: 1-Aug-2012
  • (2012)Detecting person presence in TV shows with linguistic and structural features2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2012.6289062(5077-5080)Online publication date: Mar-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media