skip to main content
10.1145/2808492.2808543acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

Improving bag-of-words representation with efficient twin feature integration

Published:19 August 2015Publication History

ABSTRACT

In recent years, the Bag-of-Words (BoW) model has been widely used in most state-of-the-art large-scale image retrieval systems. However, the standard BoW based systems suffer from low discriminative power of local features as well as quantization errors that significantly affect the retrieval performance. In this paper, twin feature is employed and well combined with two advanced techniques including Hamming Embedding (HE) and Multiple Assignment (MA) to construct a discriminative image retrieval system on BoW representation in an efficient way. Experimental results on two benchmark datasets Oxford5k and Paris6k demonstrate that the proposed technique can greatly refine the visual matching process and enhance the final performance for image retrieval.

References

  1. Y. Cao, C. Wang, Z. Li, L. Zhang, and L. Zhang. Spatial-bag-of-features. In Proc. CVPR'10, pages 3352--3359, 2010.Google ScholarGoogle Scholar
  2. M. Fischler and R. Bolles. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381--395, 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In Proc. ECCV'08, pages 304--317, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Jégou, M. Douze, and C. Schmid. Improving bag-of-features for large scale image search. IJCV, 87(3):316--336, May 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Jégou, H. Harzallah, and C. Schmid. A contextual dissimilarity measure for accurate and efficient image search. In Proc. CVPR'07, pages 1--8, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  6. D. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2):91--110, Nov. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. K. Mikolajczyk and C. Schmid. Scale and affine invariant interest point detectors. IJCV, 60(1):63--86, Oct. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In Proc. CVPR'06, pages 2161--2168, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Philbin, R. Arandjelovic, and A. Zisserman. Flickr100k image dataset, http://www.robots.ox.ac.uk/~vgg/data/oxbuildings/flickr100k.html.Google ScholarGoogle Scholar
  10. J. Philbin, R. Arandjelovic, and A. Zisserman. Oxford5k image dataset. http://www.robots.ox.ac.uk/vgg/data/oxbuildings/.Google ScholarGoogle Scholar
  11. J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In Proc. CVPR'07, pages 1--8, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  12. J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Lost in quantization: Improving particular object retrieval in large scale image databases. In Proc. CVPR'08, pages 1--8, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  13. J. Philbin and A. Zisserman. Paris6k image dataset. http://www.robots.ox.ac.uk/vgg/data/parisbuildings/.Google ScholarGoogle Scholar
  14. J. Sivic and A. Zisserman. Video google: A text retrieval approach to object matching in videos. In Proc. ICCV'03, pages 1470--1477, Oct. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Wang, H. Wang, and F. Zhu. Twin feature and similarity maximal matching for image retrieval. In Proc. ICMR'15, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. P. Xu, L. Zhang, K. Yang, and H. Yao. Nested-SIFT for efficient image matching and retrieval. MultiMedia, 430(3):34--46, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Improving bag-of-words representation with efficient twin feature integration

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICIMCS '15: Proceedings of the 7th International Conference on Internet Multimedia Computing and Service
        August 2015
        397 pages
        ISBN:9781450335287
        DOI:10.1145/2808492
        • General Chairs:
        • Ramesh Jain,
        • Shuqiang Jiang,
        • Program Chairs:
        • John Smith,
        • Jitao Sang,
        • Guohui Li

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 19 August 2015

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        ICIMCS '15 Paper Acceptance Rate20of128submissions,16%Overall Acceptance Rate163of456submissions,36%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader