ACM Home Page
Please provide us with feedback. Feedback
Photo-to-search: using multimodal queries to search the web from mobile devices
Full text PdfPdf (564 KB)
Source International Multimedia Conference archive
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval table of contents
Hilton, Singapore
POSTER SESSION: Poster session 2: image/WWW-based system and applications table of contents
Pages: 143 - 150  
Year of Publication: 2005
ISBN:1-59593-244-5
Authors
Xin Fan  University of Science and Technology of China, Hefei, P.R. China
Xing Xie  Microsoft Research Asia, Beijing, P.R.China
Zhiwei Li  Microsoft Research Asia, Beijing, P.R.China
Mingjing Li  Microsoft Research Asia, Beijing, P.R.China
Wei-Ying Ma  Microsoft Research Asia, Beijing, P.R.China
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 118,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1101826.1101851
What is a DOI?

ABSTRACT

Nowadays, mobile phones with the digital camera are getting more and more popular. With necessary technologies, they are possible to become a powerful tool to search the Web on the go. Most Web search engines only support text queries. Therefore, users have to convert their information needs into words. However, it is sometimes difficult to describe the needs in text and the text input is inconvenient on small devices. To solve the problem, we propose a system named Photo-to-Search which allows users to input multimodal queries. Particularly, we study queries with captured images and optional text messages in this paper. For example, the user can simply take a photo of the flower and input a few terms like "flower". Textually relevant Web images are retrieved according to the query terms. Afterwards, the snapped picture is compared with these images by the CBIR (Content Based Image Retrieval) method. According to the context of the visually similar images, related key phrases are extracted. Finally, the search results are returned in multiple forms. Our system can also search for very similar images on the Web, such as movie posters or photos of film stars, to find related information. Experimental results on the large scale data showed our system achieved satisfactory efficiency and performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
E. Chang, C. Li, J. Z. Wang, et al., Searching near-replicas of images via clustering, Proc. of SPIE Multimedia Storage and Archiving System VI, vol.3846, pp.281--292, Boston, USA, Sep. 1999.
4
 
5
 
6
 
7
Google Mobile Search, http://www.google.com/xhtml
 
8
Google SMS, http://www.google.com/sms/
 
9
J. S. Hare and P. H. Lewis, Content-based image retrieval using a mobile device as a novel interface, Proc. of SPIE Storage and Retrieval Methods and Applications for Multimedia 2005, vol.5682, pp.64--75, San Jose, USA, Jan. 2005.
 
10
A. Jaimes, S.-F Chang, and A.C. Loui, Detection of non-identical duplicate consumer photographs, Proc. of the Fourth Pacific Rim Conference on Multimedia, vol.1, pp.16--20, Singapore, Dec. 2003.
11
 
12
C. Kim, Content-based image copy detection, Signal Processing: Image Communication, vol.18, no.3, pp.169--184, Mar. 2003.
13
 
14
M. Noda, H. Sonobe, S. Takagi, and F. Yoshimoto, Cosmos: convenient image retrieval system of flowers for mobile computing situations, Proc. of the IASTED Conference on Information Systems and Databases 2002, pp.25--30, Tokyo, Japan, Sep. 2002.
 
15
M. F. Porter, An algorithm for suffix stripping, Program, vol.14, no.3, pp.130--137, 1980.
 
16
N. Sebe, Q. Tian, E. Loupias, M. Lew, and T. Huang, Evaluation of salient point techniques, Image and Vision Computing, vol.21, pp.1087--1095, 2003.
17
 
18
H. Sonobe, S. Takagi, and F. Yoshimoto, Image retrieval system of fishes using a mobile device, Proc. of International Workshop on Advanced Image Technology 2004, pp.33--37, Singapore, Jan. 2004.
 
19
 
20
W3C Document Object Model, http://www.w3.org/DOM/
 
21
Yahoo! Mobile, http://mobile.yahoo.com
 
22
T. Yeh, K. Tollmar, and T. Darrell, Searching the Web with mobile images for location recognition, Proc. of IEEE Conference on Computer Vision and Pattern Recognition, vol.2, pp.76--81, Washington D.C., USA, Jun. 2004.
23


Collaborative Colleagues:
Xin Fan: colleagues
Xing Xie: colleagues
Zhiwei Li: colleagues
Mingjing Li: colleagues
Wei-Ying Ma: colleagues