ACM Home Page
Please provide us with feedback. Feedback
Multi-graph enabled active learning for multimodal web image retrieval
Full text PdfPdf (371 KB)
Source International Multimedia Conference archive
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval table of contents
Hilton, Singapore
SESSION: Oral session 2: web searching and applications table of contents
Pages: 65 - 72  
Year of Publication: 2005
ISBN:1-59593-244-5
Authors
Xin-Jing Wang  Tsinghua University, China
Wei-Ying Ma  Microsoft Research, Asia
Lei Zhang  Microsoft Research, Asia
Xing Li  Tsinghua University, China
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 63,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1101826.1101839
What is a DOI?

ABSTRACT

In this paper, we propose a multimodal Web image retrieval technique based on multi-graph enabled active learning. The main goal is to leverage the heterogeneous data on the Web to improve retrieval precision. Three graphs are constructed on images' content features, textual annotations and hyperlinks respectively, namely Content-Graph, Text-Graph and Link-Graph, which provide complimentary information on the images. By analyzing the three graphs, a training dataset is automatically created and transductive learning is enabled. The transductive learner is a multi-graph based classifier, which simultaneously solves the learning problem and the problem of combining heterogeneous data. This proposed approach, overall, tackles the problem of unsupervised active learning on Web graph. Although the proposed approach is discussed in the context of WWW image retrieval, it can be applied to other domains. The experimental results show the effectiveness of our approach.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Barnard, K., Forsyth, D. Learning the Semantic of Words and Pictures. ICCV (2001).
 
2
Barnard K., Duygulu P., and Forsyth D. Clustering Art. Computer Vision and Pattern Recognition, pp. II:434--439, 2001.
 
3
Belkin, M., Niyogi, P., Sindhwani, V. On Manifold Regularization. AISTATS, (2005).
4
 
5
Cai, D., Yu, S.P., Wen, J.-R., Ma, W.-Y. VIPS: a Vision-based Page Segmentation Algorithm. Microsoft Technical Report (MSR-TR-2003-79), (2003).
 
6
7
 
8
Google. http://image.google.com (2005).
9
 
10
 
11
 
12
13
 
14
Platt J.C., Smola A., Bartlett P., Scholkopf B., and Schuurmans D., Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods, MIT Press, 1999, 61--74.
 
15
Rui, Y., Huang, T.S., Mehrotra, S. Content-Based Image Retrieval with Relevance Feedback in MARS. ICIP, (1997).
 
16
 
17
18
 
19
20
21
 
22
Yahoo!, http://images.search.yahoo.com (2005).
 
23
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., and Scholkopf, B., Learning with Local and Global Consistency. NIPS 16, (2004).
 
24
Zhou, D., Schölkopf, B., and Hofmann, T. Semi-supervised Learning on Directed Graphs. NIPS 17 (2005).
 
25
Zhou, D., Weston, J., Gretton, A., Bousquet O., and Scholkopf, B. Ranking on Data Manifolds. NIPS 16, (2004).

Collaborative Colleagues:
Xin-Jing Wang: colleagues
Wei-Ying Ma: colleagues
Lei Zhang: colleagues
Xing Li: colleagues