research-article

Deep Supervised Quantization by Self-Organizing Map

Authors:
Min Wang

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

,
Wengang Zhou

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

,
Qi Tian

University of Texas at San Antonio, San Antonio, TX, USA

University of Texas at San Antonio, San Antonio, TX, USA
View Profile

,
Junfu Pu

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

,
Houqiang Li

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

MM '17: Proceedings of the 25th ACM international conference on MultimediaOctober 2017Pages 1707–1715https://doi.org/10.1145/3123266.3123415

Published:23 October 2017Publication History

MM '17: Proceedings of the 25th ACM international conference on Multimedia

Pages 1707–1715

ABSTRACT

Approximate Nearest Neighbour (ANN) search is an important research topic in multimedia and computer vision fields. In this paper, we propose a new deep supervised quantization method by Self-Organizing Map (SOM) to address this problem. Our method integrates the Convolutional Neural Networks (CNN) and Self-Organizing Map into a unified deep architecture. The overall training objective includes supervised quantization loss and classification loss. With the supervised quantization loss, we minimize the differences on the maps between similar image pairs, and maximize the differences on the maps between dissimilar image pairs. By optimization, the deep architecture can simultaneously extract deep features and quantize the features into the suitable nodes in the Self-Organizing Map. The experiments on several public standard datasets prove the superiority of our approach over the existing ANN search methods. Besides, as a byproduct, our deep architecture can be directly applied to classification task and visualization with little modification, and promising performances are demonstrated on these tasks in the experiments.

References

Ken Chatfield, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2014. Return of the devil in the details: Delving deep into convolutional nets. arXiv preprint arXiv:1405.3531 (2014).Google Scholar
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore ACM International Conference on Image and Video Retrieval. 48. Google ScholarDigital Library
Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, 12 (2013), 2916--2929. Google ScholarDigital Library
Yen-Chang Hsu and Zsolt Kira. 2015. Neural network-based clustering using pairwise constraints. arXiv preprint arXiv:1511.06321 (2015).Google Scholar
Yen-Chang Hsu, Zhaoyang Lv, and Zsolt Kira. 2016. Deep Image Category Discovery using a Transferred Similarity Function. arXiv preprint arXiv:1612.01253 (2016).Google Scholar
Herve Jegou, Matthijs Douze, and Cordelia Schmid. 2011. Product quantization for nearest neighbor search. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 33, 1 (2011), 117--128. Google ScholarDigital Library
Teuvo Kohonen. 1982. Self-organized formation of topologically correct feature maps. Biological Cybernetics Vol. 43, 1 (1982), 59--69.Google ScholarCross Ref
Teuvo Kohonen and Timo Honkela. 2007. Kohonen network. Scholarpedia, Vol. 2, 1 (2007), 1568.Google ScholarCross Ref
Alex Krizhevsky and Geoffrey Hinton. 2009. Learning multiple layers of features from tiny images. Technical report, University of Toronto (2009).Google Scholar
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks Advances in Neural Information Processing Systems. 1097--1105. Google ScholarDigital Library
Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. 2015. Simultaneous feature learning and hash coding with deep neural networks IEEE Conference on Computer Vision and Pattern Recognition. 3270--3278.Google Scholar
Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE Vol. 86, 11 (1998), 2278--2324.Google ScholarCross Ref
Wu-Jun Li, Sheng Wang, and Wang-Cheng Kang. 2015. Feature learning based deep supervised hashing with pairwise labels. arXiv preprint arXiv:1511.03855 (2015). Google ScholarDigital Library
Renjie Liao, Alex Schwing, Richard Zemel, and Raquel Urtasun. 2016. Learning deep parsimonious representations. In Advances in Neural Information Processing Systems. 5076--5084.Google Scholar
Haomiao Liu, Ruiping Wang, Shiguang Shan, and Xilin Chen. 2016. Deep supervised hashing for fast image retrieval. IEEE Conference on Computer Vision and Pattern Recognition. 2064--2072.Google ScholarCross Ref
Zhen Liu, Houqiang Li, Wengang Zhou, Ruizhen Zhao, and Qi Tian. 2014. Contextual hashing for large-scale image search. IEEE Transactions on Image Processing Vol. 23, 4 (2014), 1606--1614. Google ScholarDigital Library
Mohammad Norouzi and David J Fleet. 2013. Cartesian k-means IEEE Conference on Computer Vision and Pattern Recognition. 3017--3024. Google ScholarDigital Library
Pierre Sermanet, David Eigen, Xiang Zhang, Michaël Mathieu, Rob Fergus, and Yann LeCun. 2013. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229 (2013).Google Scholar
Andrea Vedaldi and Karel Lenc. 2015. Matconvnet: Convolutional neural networks for matlab ACM International Conference on Multimedia. ACM, 689--692. Google ScholarDigital Library
Jianfeng Wang, Jingdong Wang, Nenghai Yu, and Shipeng Li. 2013. Order preserving hashing for approximate nearest neighbor search ACM International Conference on Multimedia. 133--142. Google ScholarDigital Library
Min Wang, Wengang Zhou, Qi Tian, Zhengjun Zha, and Houqiang Li. 2016 b. Linear Distance Preserving Pseudo-Supervised and Unsupervised Hashing Proceedings of the 2016 ACM on Multimedia Conference. ACM, 1257--1266. Google ScholarDigital Library
Xiaojuan Wang, Ting Zhang, Guo-Jun Qi, Jinhui Tang, and Jingdong Wang. 2016 a. Supervised quantization for similarity search. In IEEE Conference on Computer Vision and Pattern Recognition. 2018--2026.Google ScholarCross Ref
Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral hashing Advances in Neural Information Processing Systems. 1753--1760. Google ScholarDigital Library
Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. 2014. Supervised Hashing for Image Retrieval via Image Representation Learning. Association for the Advancement of Artificial Intelligence, Vol. Vol. 1. 2. Google ScholarDigital Library
Lei Zhang, Yongdong Zhang, Jinhui Tang, Xiaoguang Gu, Jintao Li, and Qi Tian. 2013. Topology preserving hashing for similarity search. ACM International Conference on Multimedia. 123--132. Google ScholarDigital Library
Ruimao Zhang, Liang Lin, Rui Zhang, Wangmeng Zuo, and Lei Zhang. 2015. Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. IEEE Transactions on Image Processing Vol. 24, 12 (2015), 4766--4779.Google ScholarDigital Library
Ting Zhang, Chao Du, and Jingdong Wang. 2014. Composite Quantization for Approximate Nearest Neighbor Search International Conference on Machine Learning. 838--846. Google ScholarDigital Library
Fang Zhao, Yongzhen Huang, Liang Wang, and Tieniu Tan. 2015. Deep semantic ranking based hashing for multi-label image retrieval IEEE Conference on Computer Vision and Pattern Recognition. 1556--1564.Google Scholar
Wengang Zhou, Houqiang Li, Richang Hong, Yijuan Lu, and Qi Tian. 2015. BSIFT: Toward data-independent codebook for large scale image search. IEEE Transactions on Image Processing Vol. 24, 3 (2015), 967--979.Google ScholarDigital Library
Wengang Zhou, Yijuan Lu, Houqiang Li, and Qi Tian. 2012. Scalar quantization for large scale image search. ACM International Conference on Multimedia. 169--178. Google ScholarDigital Library
Wengang Zhou, Ming Yang, Houqiang Li, Xiaoyu Wang, Yuanqing Lin, and Qi Tian. 2014. Towards codebook-free: Scalable cascaded hashing for mobile image search. IEEE Transactions on Multimedia Vol. 16, 3 (2014), 601--611. Google ScholarDigital Library
Wengang Zhou, Ming Yang, Xiaoyu Wang, Houqiang Li, Yuanqing Lin, and Qi Tian. 2016. Scalable feature matching by dual cascaded scalar quantization for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), Vol. 38, 1 (2016), 159--171. Google ScholarDigital Library

Index Terms

Deep Supervised Quantization by Self-Organizing Map
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
      1. Top-k retrieval in databases
    2. Retrieval tasks and goals
      1. Clustering and classification

Recommendations

Deep Scalable Supervised Quantization by Self-Organizing Map

Approximate Nearest Neighbor (ANN) search is an important research topic in multimedia and computer vision fields. In this article, we propose a new deep supervised quantization method by Self-Organizing Map to address this problem. Our method ...
Read More
Conformal self-organizing map on curved seamless surface

This paper presents a new mapping to construct the self-organizing map on the curved seamless surface. This mapping is developed for the planar triangle surface derived from the conformal self-organizing map [C.-Y. Liou, Y.-T. Kuo, Conformal self-...
Read More
Supervised kernel self-organizing map
IScIDE'12: Proceedings of the third Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering

We generalize the traditional supervised self-organizing map to supervised kernel self-organizing map by incorporating the kernel function to further improve its capability of solving non-linear problems. The kernel function maps the low-dimensional ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '17: Proceedings of the 25th ACM international conference on Multimedia
October 2017
2028 pages
ISBN:9781450349062
DOI:10.1145/3123266
General Chairs:
Qiong Liu
FXPAL, USA
,
Rainer Lienhart
Universität Augsburg, Germany
,
Haohong Wang
TCL America, USA
,
Program Chairs:
Sheng-Wei "Kuan-Ta" Chen
Academia Sinica, Taiwan
,
Susanne Boll
University of Oldenburg, Germany
,
Phoebe Chen
La Trobe University, Australia
,
Gerald Friedland
Lawrence Livermore National Lab, USA
,
Jia Li
Google, USA
,
Shuicheng Yan
Qihoo 360, China
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 October 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
approximate nearest neighbour search
self-organizing map
supervised quantization
Qualifiers
- research-article
Conference

Acceptance Rates
MM '17 Paper Acceptance Rate189of684submissions,28%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 12
  Total Citations
  View Citations
- 255
  Total Downloads
- Downloads (Last 12 months)6
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Deep Supervised Quantization by Self-Organizing Map

MM '17: Proceedings of the 25th ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Deep Scalable Supervised Quantization by Self-Organizing Map

Conformal self-organizing map on curved seamless surface

Supervised kernel self-organizing map