|
ABSTRACT
Nowadays, mobile phones with the digital camera are getting more and more popular. With necessary technologies, they are possible to become a powerful tool to search the Web on the go. Most Web search engines only support text queries. Therefore, users have to convert their information needs into words. However, it is sometimes difficult to describe the needs in text and the text input is inconvenient on small devices. To solve the problem, we propose a system named Photo-to-Search which allows users to input multimodal queries. Particularly, we study queries with captured images and optional text messages in this paper. For example, the user can simply take a photo of the flower and input a few terms like "flower". Textually relevant Web images are retrieved according to the query terms. Afterwards, the snapped picture is compared with these images by the CBIR (Content Based Image Retrieval) method. According to the context of the visually similar images, related key phrases are extracted. Finally, the search results are returned in multiple forms. Our system can also search for very similar images on the Web, such as movie posters or photos of film stars, to find related information. Experimental results on the large scale data showed our system achieved satisfactory efficiency and performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
E. Chang, C. Li, J. Z. Wang, et al., Searching near-replicas of images via clustering, Proc. of SPIE Multimedia Storage and Archiving System VI, vol.3846, pp.281--292, Boston, USA, Sep. 1999.
|
 |
4
|
Zheng Chen , Liu Wenyin , Chunhui Hu , Mingjing Li , Hong-Jiang Zhang, iFind: a web image search engine, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, p.450, September 2001, New Orleans, Louisiana, United States
[doi> 10.1145/383952.384091]
|
| |
5
|
|
| |
6
|
Myron Flickner , Harpreet Sawhney , Wayne Niblack , Jonathan Ashley , Qian Huang , Byron Dom , Monika Gorkani , Jim Hafner , Denis Lee , Dragutin Petkovic , David Steele , Peter Yanker, Query by Image and Video Content: The QBIC System, Computer, v.28 n.9, p.23-32, September 1995
[doi> 10.1109/2.410146
]
|
| |
7
|
Google Mobile Search, http://www.google.com/xhtml
|
| |
8
|
Google SMS, http://www.google.com/sms/
|
| |
9
|
J. S. Hare and P. H. Lewis, Content-based image retrieval using a mobile device as a novel interface, Proc. of SPIE Storage and Retrieval Methods and Applications for Multimedia 2005, vol.5682, pp.64--75, San Jose, USA, Jan. 2005.
|
| |
10
|
A. Jaimes, S.-F Chang, and A.C. Loui, Detection of non-identical duplicate consumer photographs, Proc. of the Fourth Pacific Rim Conference on Multimedia, vol.1, pp.16--20, Singapore, Dec. 2003.
|
 |
11
|
|
| |
12
|
C. Kim, Content-based image copy detection, Signal Processing: Image Communication, vol.18, no.3, pp.169--184, Mar. 2003.
|
 |
13
|
|
| |
14
|
M. Noda, H. Sonobe, S. Takagi, and F. Yoshimoto, Cosmos: convenient image retrieval system of flowers for mobile computing situations, Proc. of the IASTED Conference on Information Systems and Databases 2002, pp.25--30, Tokyo, Japan, Sep. 2002.
|
| |
15
|
M. F. Porter, An algorithm for suffix stripping, Program, vol.14, no.3, pp.130--137, 1980.
|
| |
16
|
N. Sebe, Q. Tian, E. Loupias, M. Lew, and T. Huang, Evaluation of salient point techniques, Image and Vision Computing, vol.21, pp.1087--1095, 2003.
|
 |
17
|
|
| |
18
|
H. Sonobe, S. Takagi, and F. Yoshimoto, Image retrieval system of fishes using a mobile device, Proc. of International Workshop on Advanced Image Technology 2004, pp.33--37, Singapore, Jan. 2004.
|
| |
19
|
|
| |
20
|
W3C Document Object Model, http://www.w3.org/DOM/
|
| |
21
|
Yahoo! Mobile, http://mobile.yahoo.com
|
| |
22
|
T. Yeh, K. Tollmar, and T. Darrell, Searching the Web with mobile images for location recognition, Proc. of IEEE Conference on Computer Vision and Pattern Recognition, vol.2, pp.76--81, Washington D.C., USA, Jun. 2004.
|
 |
23
|
Tom Yeh , Kristen Grauman , Konrad Tollmar , Trevor Darrell, A picture is worth a thousand keywords: image-based object search on a mobile platform, CHI '05 extended abstracts on Human factors in computing systems, April 02-07, 2005, Portland, OR, USA
[doi> 10.1145/1056808.1057083]
|
CITED BY 3
|
|
|
Jing Liu , Bin Wang , Mingjing Li , Zhiwei Li , Weiying Ma , Hanqing Lu , Songde Ma, Dual cross-media relevance model for image annotation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
Changhu Wang , Feng Jing , Lei Zhang , Hong-Jiang Zhang, Scalable search-based image annotation of personal images, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
|
|