ACM Home Page
Please provide us with feedback. Feedback
Visual & textual fusion for region retrieval: from both fuzzy matching and bayesian reasoning aspects
Full text PdfPdf (1.02 MB)
Source
International Multimedia Conference archive
Proceedings of the international workshop on Workshop on multimedia information retrieval table of contents
Augsburg, Bavaria, Germany
POSTER SESSION: Multimedia retrieval and modeling table of contents
Pages: 159 - 168  
Year of Publication: 2007
ISBN:978-1-59593-778-0
Authors
Rongrong Ji  Harbin Institute of Technology, Harbin, China
Hongxun Yao  Harbin Institute of Technology, Harbin, China
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 143,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1290082.1290106
What is a DOI?

ABSTRACT

This paper presents a novel visual & textual information fusion framework for region-based image retrieval. We explore the issue of linguistic-integrated region retrieval from both Bayesian Reasoning and Fuzzy Region Matching aspects. Firstly, to associate textual information with image regions, we present a region-based soft annotation strategy. Our method automatically labels each image region with multiple keywords, each of which is assigned a confidence factor to indicate its annotation accuracy. In annotation classifier training, we adopt a pairwise coupling (PWC) SVM bagging network to address the problems of sample insufficiency and sample asymmetry. Consequently, in image retrieval, we fuse regions. visual & textual information to rank image similarities at perceptual level. Two fusion schemes are explored in proposed framework: 1. Semantic-Supervised Integrated Region Matching (SSIRM); 2. Keyword-Integrated Bayesian Reasoning (KIBR). SSIRM is a keyword-integrated fuzzy region matching strategy, which is adopted in the case that the query image is pre-annotated; KIBR is adopted in the case that the query image is non-annotated or poorly-annotated, which supports both query-by-example and query-by-keyword based on statistical text-image translation model. Finally, in relevance feedback (RF) learning, we exploit a unified visual & textual learning algorithm to precisely capture users' retrieval intention. Superior annotation, retrieval (over IRM) and RF performances (Both over IRM + SVM at region-level and SVM & ALSVM & ABSVM at global-level) are presented in our experiments, which demonstrate the efficiency of proposed fusion framework to bridge the semantic gap.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
R. C. Veltkamp, M. Tanase, "Content-Based Image Retrieval Systems: A Survey", Technical report UU-CS-2000-34, Dep. of Computing Science, Utrecht University, Oct. 2000. 34.
 
3
Y. Gong, H. J. Zhang and T. C. Chua, "An image database system with content capturing and fast image indexing abilities," Proc. International Conference on Multimedia Computing and Systems, Boston, 14-19 May 1994, pp.121--130.
 
4
Z. Su, H.-J. Zhang, and S. Li, "Relevance Feedback in Content-Based Image Retrieval: Bayesian Framework, Feature Subspaces and Progressive Learning," IEEE Trans. on Image Processing, vol.12, no.3, 2003.8.
 
5
 
6
 
7
 
8
Ye Lu, Hongjiang Zhang, Liu Wenyin et. al., "Joint Semantic and Feature Based Image Retrieval Using Relevance Feedback," IEEE Trans. on Multimedia, vol.5, no.3, pp.339--347, Sep. 2003.
9
10
 
11
R.M. Haralick, K. Shanmugam, and I. Dinstein, "Texture features for image classification," IEEE Trans. on System, Man, Cybernetics, Vol. 3, pp. 610--621, November 1973.
 
12
 
13
Md. Mahmudur Rahman, P. Bhattacharya, B. C. Desai, "A Framework for Medical Image Retrieval using Machine Learning & Statistical Similarity Matching Techniques with Relevance Feedback," IEEE Trans. on Information Technology in Biomedicine, Vol.11, Issue 1, pp.58--69, Jan. 2007.
 
14
 
15
E. Chang, K. Goh et. al., "CBSA: Content-Based Soft Annotation for Multimodal Image Retrieval Using Bayes Point Machines," IEEE Trans. on Circuits and Systems for Video Technology, Vol.13, Issue 1, pp.26--38, Jan.2003.
 
16
F. Jing, M. Li et. al., "A Unified Framework for Image Retrieval Using Keyword and Visual Features," IEEE Trans. on Image Processing, Vol.14, No.7, pp.979--989, July 2000.
 
17
 
18
R. Y, T. Huang, Mehrotra S et. al., "Relevance Feedback: A Power Tool for Interactive Content-based Image Retrieval," IEEE Trans. Circuits and Systems for Video Technology, 1998. 8(5): 644--655
 
19
S. Liapis, G. Tziritas, "Color and Textual Image Retrieval Using Chromaticity Histograms and Wavelet Frames," IEEE Transactions on Multimedia, Vol.6, No.5, October, 2004, pp.676--686.
 
20
R. Brunelli and O. Mich, "Image Retrieval by Example," IEEE Trans. on Multimedia, Vol.2, No.3, September, 2000, pp.164--171.
 
21
G. Aggarwal, A. T. V., and S. Ghosal, "An Image Retrieval System With Automatic Query Modification," IEEE Trans. on Multimedia, Vol.4, No.2, June 2002, pp. 201--214.
 
22
K.-M. Lee, and W. N. Street, "Cluster-Driven Refinement for Content-Based Digital Image Retrieval," IEEE Trans. on Multimedia, Vol.6, No.6, December 2004, pp.817--827.
 
23
B.C. Ko, H. Byun, "FRIP: A Region-Based Image Retrieval Tool Using Automatic Image Segmentation and Stepwise Boolean AND Matching," IEEE Trans. on Multimedia, Vol.7, No.1, February 2005, pp.105--113.
 
24
D. Tao, X. Tang, X. Li, and Y. Rui, "Direct Kernel Biased Discriminant Analysis: A New Content-Based Image Retrieval Relevance Feedback Algorithm," IEEE Trans. on Multimedia, Vol.8, No. 4, August 2004, pp.716--727.
 
25
Y. Lu, H.-J. Zhang, L. Wenyin, and C. Hu, "Joint Semantic and Feature Based Image Retrieval Using Relevance Feedback," IEEE Trans. on Multimedia, Vol.5, No.3, September 2003, pp.339--347.
 
26
A. Del Bimbo, E. Vicario, "Weighted Walkthroughs between Extended Entities for Retrieval by Spatial Arrangement," IEEE Trans. on Multimedia, Vol.5, No.1, pp.52--70, 2003.
 
27
28
 
29
I. J. Cox, M. L. Miller, T. P. Minka, T. V. Papathomas, and P. N. Yianilos, "The Bayesian Image Retrieval System, PicHunter: Theory, Implementation, and Psychophysical Experiments," IEEE Trans. on Image Processing, Vol.9, No.1, pp.20--37, 2000.
 
30
R. Zhang and Z. Zhang, "Hidden Semantic Concept Discovery in Region Based Image Retrieval, Computer Vision and Pattern Recognition, Washington, DC, USA, Vol.2, pp.996--1001,June 2004.
 
31
X. S. Zhou, T. S. Huang, "Relevance Feedback in Image Retrieval: A Comprehensive Review," International Journal of Multimedia Systems, Vol.8, pp 536--544, 2003.
 
32
33
34

Collaborative Colleagues:
Rongrong Ji: colleagues
Hongxun Yao: colleagues