|
ABSTRACT
In this paper, we propose a multimodal Web image retrieval technique based on multi-graph enabled active learning. The main goal is to leverage the heterogeneous data on the Web to improve retrieval precision. Three graphs are constructed on images' content features, textual annotations and hyperlinks respectively, namely Content-Graph, Text-Graph and Link-Graph, which provide complimentary information on the images. By analyzing the three graphs, a training dataset is automatically created and transductive learning is enabled. The transductive learner is a multi-graph based classifier, which simultaneously solves the learning problem and the problem of combining heterogeneous data. This proposed approach, overall, tackles the problem of unsupervised active learning on Web graph. Although the proposed approach is discussed in the context of WWW image retrieval, it can be applied to other domains. The experimental results show the effectiveness of our approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Barnard, K., Forsyth, D. Learning the Semantic of Words and Pictures. ICCV (2001).
|
| |
2
|
Barnard K., Duygulu P., and Forsyth D. Clustering Art. Computer Vision and Pattern Recognition, pp. II:434--439, 2001.
|
| |
3
|
Belkin, M., Niyogi, P., Sindhwani, V. On Manifold Regularization. AISTATS, (2005).
|
 |
4
|
Deng Cai , Xiaofei He , Zhiwei Li , Wei-Ying Ma , Ji-Rong Wen, Hierarchical clustering of WWW image search results using visual, textual and link information, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027747]
|
| |
5
|
Cai, D., Yu, S.P., Wen, J.-R., Ma, W.-Y. VIPS: a Vision-based Page Segmentation Algorithm. Microsoft Technical Report (MSR-TR-2003-79), (2003).
|
| |
6
|
|
 |
7
|
|
| |
8
|
Google. http://image.google.com (2005).
|
 |
9
|
Jingrui He , Mingjing Li , Hong-Jiang Zhang , Hanghang Tong , Changshui Zhang, Manifold-ranking based image retrieval, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027531]
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
 |
13
|
Ravi Kumar , Prabhakar Raghavan , Sridhar Rajagopalan , D. Sivakumar , Andrew Tompkins , Eli Upfal, The Web as a graph, Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.1-10, May 15-18, 2000, Dallas, Texas, United States
[doi> 10.1145/335168.335170]
|
| |
14
|
Platt J.C., Smola A., Bartlett P., Scholkopf B., and Schuurmans D., Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods, MIT Press, 1999, 61--74.
|
| |
15
|
Rui, Y., Huang, T.S., Mehrotra, S. Content-Based Image Retrieval with Relevance Feedback in MARS. ICIP, (1997).
|
| |
16
|
|
| |
17
|
|
 |
18
|
|
| |
19
|
|
 |
20
|
Xin-Jing Wang , Wei-Ying Ma , Gui-Rong Xue , Xing Li, Multi-model similarity propagation and its application for web image retrieval, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027746]
|
 |
21
|
|
| |
22
|
Yahoo!, http://images.search.yahoo.com (2005).
|
| |
23
|
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., and Scholkopf, B., Learning with Local and Global Consistency. NIPS 16, (2004).
|
| |
24
|
Zhou, D., Schölkopf, B., and Hofmann, T. Semi-supervised Learning on Directed Graphs. NIPS 17 (2005).
|
| |
25
|
Zhou, D., Weston, J., Gretton, A., Bousquet O., and Scholkopf, B. Ranking on Data Manifolds. NIPS 16, (2004).
|
|