|
ABSTRACT
We consider the problem of learning a mapping function from low-level feature space to high-level semantic space. Under the assumption that the data lie on a submanifold embedded in a high dimensional Euclidean space, we propose a relevance feedback scheme which is naturally conducted only on the image manifold in question rather than the total ambient space. While images are typically represented by feature vectors in Rn, the natural distance is often different from the distance induced by the ambient space Rn. The geodesic distances on manifold are used to measure the similarities between images. However, when the number of data points is small, it is hard to discover the intrinsic manifold structure. Based on user interactions in a relevance feedback driven query-by-example system, the intrinsic similarities between images can be accurately estimated. We then develop an algorithmic framework to approximate the optimal mapping function by a Radial Basis Function (RBF) neural network. The semantics of a new image can be inferred by the RBF neural network. Experimental results show that our approach is effective in improving the performance of content-based image retrieval systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Belkin and P. Niyogi, "Laplacian eigenmaps for dimensionality reduction and data representation", Advances in Neural Information Processing Systesms, 2001.
|
| |
2
|
B. Bernstein, V. de Silva, J. C. Langford, and J. B. Tenenbaum, "Graph approximations to geodesics on embedded manifolds", Technical report, Stanford University, December 2000
|
| |
3
|
E. Chang, K. Goh, G. Sychay, and G. Wu, "CBSA: Content-Based Soft Annotation for Multimodal Image Retrieval Using Bayes Point Machine". IEEE Trans. on Circuits and Systems for Video Technology, vol. 13, No. 1, Jan. 2003.
|
| |
4
|
X. He and P. Niyogi, "Locality Preserving Projections", in Advances in Neural Information Processing Systems 16, Vancouver, Canada, 2003.
|
 |
5
|
|
| |
6
|
X. He, O. King, W.-Y. Ma, M.-J. Li, and H.-J. Zhang, "Learning a semantic space from user's relevance feedback for image retrieval", IEEE Trans. on Circuit and System for Video Technology, Jan, 2003.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
W. Matusik, H. Pfister. M. Brand, and L. McMillan, "A data-driven reflectance model", in Proc. of SIGGRAPH, 2003.
|
| |
11
|
S.T. Roweis, and L.K. Saul, "Nonlinear dimensionality reduction by locally linear embedding", Science, vol 290, 22 December 2000.
|
| |
12
|
Y. Rui and T. S. Huang, "Optimizing learning in image retrieval", in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, Hilton Head, SC, June 2000.
|
| |
13
|
H.S. Seung and D. Lee, "The manifold ways of perception", Science, vol 290, 22 December 2000.
|
 |
14
|
|
| |
15
|
J.B. Tenenbaum, V.D. Silva, and J.C. Langford, "A global geometric framework for nonlinear dimensionality reduction", Science, Vol 290, 22 December 2000.
|
| |
16
|
K. Tieu and P. Viola, "Boosting image retrieval", in Proc. IEEE Conf. on Computer Vision and Pattern Recognitino, Hilton head, SC, June 2000.
|
 |
17
|
|
| |
18
|
N. Vasconcelos and A. Lippman, "Learning from user feedback in image retrieval systems", Advances in Neural Information Processing Systems, Denver, Colorado, 1999.
|
 |
19
|
|
 |
20
|
|
 |
21
|
|
CITED BY 16
|
|
|
|
|
|
|
|
|
|
Jing Liu , Mingjing Li , Wei-Ying Ma , Qingshan Liu , Hanqing Lu, An adaptive graph model for automatic image annotation, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ritendra Datta , Dhiraj Joshi , Jia Li , James Z. Wang, Image retrieval: Ideas, influences, and trends of the new age, ACM Computing Surveys (CSUR), v.40 n.2, p.1-60, April 2008
|
|