skip to main content
10.1145/3240508.3240516acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Deep Triplet Quantization

Authors Info & Claims
Published:15 October 2018Publication History

ABSTRACT

Deep hashing establishes efficient and effective image retrieval by end-to-end learning of deep representations and hash codes from similarity data. We present a compact coding solution, focusing on deep learning to quantization approach that has shown superior performance over hashing solutions for similarity retrieval. We propose Deep Triplet Quantization (DTQ), a novel approach to learning deep quantization models from the similarity triplets. To enable more effective triplet training, we design a new triplet selection approach, Group Hard, that randomly selects hard triplets in each image group. To generate compact binary codes, we further apply a triplet quantization with weak orthogonality during triplet training. The quantization loss reduces the codebook redundancy and enhances the quantizability of deep representations through back-propagation. Extensive experiments demonstrate that DTQ can generate high-quality and compact binary codes, which yields state-of-the-art image retrieval performance on three benchmark datasets, NUS-WIDE, CIFAR-10, and MS-COCO.

References

  1. Yue Cao, Mingsheng Long, Jianmin Wang, and Shichen Liu. 2017. Deep visual-semantic quantization for efficient image retrieval. In CVPR.Google ScholarGoogle Scholar
  2. Yue Cao, Mingsheng Long, Jianmin Wang, Han Zhu, and Qingfu Wen. 2016. Deep Quantization Network for Efficient Image Retrieval. AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Zhangjie Cao, Mingsheng Long, Jianmin Wang, and Philip S Yu. 2017. HashNet: Deep Learning to Hash by Continuation. ICCV (2017).Google ScholarGoogle Scholar
  4. Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yan-Tao Zheng. 2009. NUS-WIDE: A Real-World Web Image Database from National University of Singapore. In ICMR. ACM.Google ScholarGoogle Scholar
  5. Thanh-Toan Do, Anh-Dzung Doan, and Ngai-Man Cheung. 2016. Learning to hash with binary deep neural network. In ECCV. Springer.Google ScholarGoogle Scholar
  6. J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. 2014. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. In ICML. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Chao Du and Jingdong Wang. 2014. Inner Product Similarity Search using Compositional Codes. CoRR Vol. abs/1406.4966 (2014).Google ScholarGoogle Scholar
  8. Venice Erin Liong, Jiwen Lu, Gang Wang, Pierre Moulin, and Jie Zhou. 2015. Deep Hashing for Compact Binary Codes Learning. In CVPR. IEEE.Google ScholarGoogle Scholar
  9. D. J. Fleet, A. Punjani, and M. Norouzi. 2012. Fast search in Hamming space with multi-index hashing, In CVPR. CVPR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Tiezheng Ge, Kaiming He, Qifa Ke, and Jian Sun. 2014. Optimized Product Quantization. TPAMI (2014). Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Aristides Gionis, Piotr Indyk, Rajeev Motwani, et al. 1999. Similarity search in high dimensions via hashing. In VLDB, Vol. Vol. 99. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yunchao Gong, Sudhakar Kumar, Henry Rowley, Svetlana Lazebnik, et al. 2013. Learning binary codes for high-dimensional data using bilinear projections. In CVPR. IEEE, 484--491. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Yunchao Gong and Svetlana Lazebnik. 2011. Iterative quantization: A procrustean approach to learning binary codes. In CVPR. 817--824. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. CVPR (2016).Google ScholarGoogle Scholar
  15. Himalaya Jain, Joaquin Zepeda, Patrick Pérez, and Rémi Gribonval. 2017. SUBIC: A supervised, structured binary code for image search. In ICCV.Google ScholarGoogle Scholar
  16. H. Jegou, M. Douze, and C. Schmid. 2011. Product Quantization for Nearest Neighbor Search. TPAMI Vol. 33, 1 (Jan. 2011), 117--128. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Krizhevsky, I. Sutskever, and G. E. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In NIPS. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Brian Kulis and Trevor Darrell. 2009. Learning to hash with binary reconstructive embeddings. In NIPS. 1042--1050. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. 2015. Simultaneous Feature Learning and Hash Coding with Deep Neural Networks. In CVPR. IEEE.Google ScholarGoogle Scholar
  20. Wu-Jun Li, Sheng Wang, and Wang-Cheng Kang. 2016. Feature learning based deep supervised hashing with pairwise labels. In IJCAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In ECCV. Springer, 740--755.Google ScholarGoogle Scholar
  22. Haomiao Liu, Ruiping Wang, Shiguang Shan, and Xilin Chen. 2016. Deep supervised hashing for fast image retrieval. In CVPR. 2064--2072.Google ScholarGoogle Scholar
  23. Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In CVPR. IEEE.Google ScholarGoogle Scholar
  24. Wei Liu, Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2011. Hashing with Graphs. In ICML. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Xianglong Liu, Junfeng He, Bo Lang, and Shih-Fu Chang. 2013. Hash bit selection: a unified solution for selection problems in hashing. In CVPR. IEEE. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Mingsheng Long, Yue Cao, Jianmin Wang, and Philip S. Yu. 2016. Composite Correlation Quantization for Efficient Multimodal Retrieval. In SIGIR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Mohammad Norouzi and David M. Blei. 2011. Minimal loss hashing for compact binary codes. In ICML. ACM, 353--360. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Mohammad Norouzi, David M. Blei, and Ruslan R. Salakhutdinov. 2012. Hamming distance metric learning. In NIPS. 1061--1069. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Ruslan Salakhutdinov and Geoffrey E. Hinton. 2007. Learning a nonlinear embedding by preserving class neighbourhood structure. In AISTATS. 412--419.Google ScholarGoogle Scholar
  30. Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised Discrete Hashing. In CVPR. IEEE.Google ScholarGoogle Scholar
  31. L. J. P van der Maaten and G. E. Hinton. Nov 2008. Visualizing High-Dimensional Data Using t-SNE. JMLR Vol. 9: 2579--2605 (Nov 2008).Google ScholarGoogle Scholar
  32. Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2012. Semi-supervised hashing for large-scale search. TPAMI Vol. 34, 12 (2012), 2393--2406. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Jingdong Wang, Ting Zhang, Jingkuan Song, Nicu Sebe, and Heng Tao Shen. 2018. A Survey on Learning to Hash. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 40, 4 (Feb. 2018), 769--790.Google ScholarGoogle ScholarCross RefCross Ref
  34. Xiaojuan Wang, Ting Zhang, Guo-Jun Qi, Jinhui Tang, and Jingdong Wang. 2016. Supervised quantization for similarity search. In CVPR.Google ScholarGoogle Scholar
  35. Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral Hashing. In NIPS.Google ScholarGoogle Scholar
  36. Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. 2014. Supervised hashing for image retrieval via image representation learning. In AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Felix X. Yu, Sanjiv Kumar, Yunchao Gong, and Shih-Fu Chang. 2014. Circulant binary embedding. In ICML. ACM, 353--360. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Peichao Zhang, Wei Zhang, Wu-Jun Li, and Minyi Guo. 2014. Supervised hashing with latent factor models. In SIGIR. ACM, 173--182. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Ting Zhang, Chao Du, and Jingdong Wang. 2014. Composite Quantization for Approximate Nearest Neighbor Search. In ICML. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Liming Zhao, Xi Li, Jingdong Wang, and Yueting Zhuang. 2017. Deeply-learned part-aligned representations for person re-identification. In ICCV.Google ScholarGoogle Scholar
  41. Han Zhu, Mingsheng Long, Jianmin Wang, and Yue Cao. 2016. Deep Hashing Network for Efficient Similarity Retrieval. AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Deep Triplet Quantization

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MM '18: Proceedings of the 26th ACM international conference on Multimedia
        October 2018
        2167 pages
        ISBN:9781450356657
        DOI:10.1145/3240508

        Copyright © 2018 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 15 October 2018

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        MM '18 Paper Acceptance Rate209of757submissions,28%Overall Acceptance Rate995of4,171submissions,24%

        Upcoming Conference

        MM '24
        MM '24: The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader