research-article

Scalable mining of small visual objects

Authors:

Pierre Letessier,

Olivier Buisson,

Alexis JolyAuthors Info & Claims

MM '12: Proceedings of the 20th ACM international conference on Multimedia

Pages 599 - 608

https://doi.org/10.1145/2393347.2393431

Published: 29 October 2012 Publication History

Abstract

This paper presents a scalable method for automatically discovering frequent visual objects in large multimedia collections even if their size is very small. It first formally revisits the problem of mining or discovering such objects, and then generalizes two kinds of existing methods for probing candidate object seeds: weighted adaptive sampling and hashing-based methods. The idea is that the collision frequencies obtained with hashing-based methods can actually be converted into a prior probability density function given as input to a weighted adaptive sampling algorithm. This allows for an evaluation of any hashing scheme effectiveness in a more generalized way, and a comparison with other priors, e.g. guided by visual saliency concerns. We then introduce a new hashing strategy, working first at the visual level, and then at the geometric level. This strategy allows us to integrate weak geometric constraints into the hashing phase itself and not only neighborhood constraints as in previous works. Experiments conducted on a new dataset introduced in this paper will show that using this new hashing-based prior allows a drastic reduction of the number of tentative probes required to discover small objects instantiated several times in a large dataset.

References

[1]

O. Chum, A. Mikulik, M. Perdoch, and J. Matas. Total recall ii: Query expansion revisited. In CVPR, pages 889--896, Colorado Springs, USA, 2011. IEEE.

Digital Library

[2]

O. Chum, M. Perdoch, and J. Matas. Geometric min-hashing: Finding a (thick) needle in a haystack. In CVPR, pages 17--24. IEEE, June 200

[3]

O. Chum, J. Philbin, J. Sivic, M. Isard, and A. Zisserman. Total recall: Automatic query expansion with a generative feature model for object retrieval. In ICCV. IEEE, October 2007.

[4]

M. Datar and P. Indyk. Locality-sensitive hashing scheme based on p-stable distributions. In SCG, pages 253--262. ACM Press, 2004.

Digital Library

[5]

M. A. Fischler and R. C. Bolles. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM}, 24(6):381--395, 1981.

Digital Library

[6]

H. Jégou, M. Douze, and C. Schmid. Hamming Embedding and Weak Geometry Consistency for Large Scale Image Search. Research Report RR-6709, INRIA, 2008.

[7]

A. Joly and O. Buisson. A Posteriori Multi-Probe Locality Sensitive Hashing. In ACM Multimedia, pages 209--218, Vancouver, Canada, oct 2008.

Digital Library

[8]

A. Joly and O. Buisson. Logo retrieval with a contrario visual query expansion. In ACM Multimedia, pages 581--584, Beijing, China, october 2009.

Digital Library

[9]

A. Joly and O. Buisson. Random Maximum Margin Hashing. In CVPR, Colorado Springs, 2011. IEEE.

Digital Library

[10]

Y. Kalantidis, L. Pueyo, M. Trevisiol, R. van Zwol, and Y. Avrithis. Scalable triangulation-based logo recognition. In ICMR, pages 20:1--20:7, Trento, Italy, 2011. ACM.

Digital Library

[11]

P. Letessier, O. Buisson, and A. Joly. Consistent visual words mining with adaptive sampling. In ICMR, pages 49:1--49:8. ACM, april 2011.

Digital Library

[12]

K. Ling and G. Wu. Frequency based locality sensitive hashing. In ICMT, pages 4929 --4932, july 2011.

[13]

D. Lowe. Object recognition from local scale-invariant features. In ICCV, page 1150, Kerkyra, 1999. IEEE.

Digital Library

[14]

F. Olken. Random Sampling from Databases. PhD thesis, University of California, 1993.

[15]

J. Philbin. Scalable Object Retrieval in Very Large Image Collections. PhD thesis, Univ. of Oxford, 2010.

[16]

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Lost in quantization: Improving particular object retrieval in large scale image databases. In CVPR}, pages 1--8, Anchorage, USA, june 2008. IEEE.

[17]

G. F. Pineda, H. Koga, and T. Watanabe. Object discovery by clustering correlated visual word sets. In ICPR, pages 750--753, Washington, USA, 2010. IEEE.

Digital Library

[18]

T. Tuytelaars, C. H. Lampert, M. B. Blaschko, and W. Buntine. Unsupervised object discovery: A comparison. IJCV, 88:284--302, June 2010.

Digital Library

[19]

T. Weyand and B. Leibe. Discovering favorite views of popular places with iconoid shift. In ICCV, pages 1132--1139. IEEE, Nov. 2011.

Digital Library

[20]

W. Zhao, X. Wu, and C.-W. Ngo. On the annotation of web videos by efficient near-duplicate search. IEEE Transactions on Multimedia, 12(5):448--461, 2010.

Digital Library

Cited By

Nag SRaychaudhuri DPaul SRoy-Chowdhury A(2023)Reconstruction Guided Meta-Learning for Few Shot Open Set RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.332073145:12(15394-15405)Online publication date: Dec-2023
https://doi.org/10.1109/TPAMI.2023.3320731
Kang JAhn S(2022)Variational Multi-Prototype Encoder for Object Recognition Using Multiple Prototype ImagesIEEE Access10.1109/ACCESS.2022.315185610(19586-19598)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2022.3151856
Hou QMin WWang JHou SZheng YJiang SShen HZhuang YSmith JYang YCesar PMetze FPrabhakaran B(2021)FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling NetworkProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475289(4670-4679)Online publication date: 17-Oct-2021
https://dl.acm.org/doi/10.1145/3474085.3475289
Show More Cited By

Index Terms

Scalable mining of small visual objects
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing

Recommendations

DSH: data sensitive hashing for high-dimensional k-nnsearch
SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data

The need to locate the k-nearest data points with respect to a given query point in a multi- and high-dimensional space is common in many applications. Therefore, it is essential to provide efficient support for such a search. Locality Sensitive Hashing ...
An Analytical Study on Frequent Itemset Mining Algorithms
MIKE 2013: Proceedings of the First International Conference on Mining Intelligence and Knowledge Exploration - Volume 8284

Data mining is the process of collecting, extracting and analyzing large data set from different perspectives. Fundamental and important task of data mining is the mining of frequent itemsets. Frequent itemsets play an important role in association rule ...
Efficient algorithm for the extraction of association rules in data mining
ICCSA'06: Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part II

The problem of data mining is to discover the pattern or trend in huge volume of data. The problem is similar to knowledge discovery in artificial intelligence. Here our goal is to discover rules that reflect the pattern in the data. These rules are ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '12: Proceedings of the 20th ACM international conference on Multimedia

October 2012

1584 pages

ISBN:9781450310895

DOI:10.1145/2393347

General Chairs:
Noboru Babaguchi
Osaka University, Japan
,
Kiyoharu Aizawa
The University of Tokyo, Japan
,
John Smith
IBM, USA
,
Program Chairs:
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Thomas Plagemann
University of Oslo, Norway
,
Xian-Sheng Hua
Microsoft, USA
,
Rong Yan
Facebook, USA

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 October 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '12

Sponsor:

SIGMM

MM '12: ACM Multimedia Conference

October 29 - November 2, 2012

Nara, Japan

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

20
Total Citations
View Citations
228
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)1

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Nag SRaychaudhuri DPaul SRoy-Chowdhury A(2023)Reconstruction Guided Meta-Learning for Few Shot Open Set RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.332073145:12(15394-15405)Online publication date: Dec-2023
https://doi.org/10.1109/TPAMI.2023.3320731
Kang JAhn S(2022)Variational Multi-Prototype Encoder for Object Recognition Using Multiple Prototype ImagesIEEE Access10.1109/ACCESS.2022.315185610(19586-19598)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2022.3151856
Hou QMin WWang JHou SZheng YJiang SShen HZhuang YSmith JYang YCesar PMetze FPrabhakaran B(2021)FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling NetworkProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475289(4670-4679)Online publication date: 17-Oct-2021
https://dl.acm.org/doi/10.1145/3474085.3475289
Nguyen NNguyen TDo TNgo TLe D(2020)U15-Logos: Unconstrained Logo Dataset with Evaluation by Deep learning Methods2020 International Conference on Multimedia Analysis and Pattern Recognition (MAPR)10.1109/MAPR49794.2020.9237769(1-6)Online publication date: Oct-2020
https://doi.org/10.1109/MAPR49794.2020.9237769
Kim JOh TLee SPan FKweon I(2019)Variational Prototyping-Encoder: One-Shot Learning With Prototypical Images2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR.2019.00969(9454-9462)Online publication date: Jun-2019
https://doi.org/10.1109/CVPR.2019.00969
Gadeski EFard HLe Borgne H(2018)GPU deformable part model for object recognitionJournal of Real-Time Image Processing10.1007/s11554-014-0447-514:2(279-291)Online publication date: 1-Feb-2018
https://dl.acm.org/doi/10.1007/s11554-014-0447-5
Li WLi JWang CZhang LZhang B(2018)Visual instance mining from the graph perspectiveMultimedia Systems10.1007/s00530-016-0533-624:2(147-162)Online publication date: 1-Mar-2018
https://dl.acm.org/doi/10.1007/s00530-016-0533-6
Awad GKraaij WOver PSatoh S(2017)Instance search retrospective with focus on TRECVIDInternational Journal of Multimedia Information Retrieval10.1007/s13735-017-0121-36:1(1-29)Online publication date: 22-Feb-2017
https://doi.org/10.1007/s13735-017-0121-3
Qu BVallet FCarrive JGravier G(2017)Content-based unsupervised segmentation of recurrent TV programs using grammatical inferenceMultimedia Tools and Applications10.1007/s11042-017-4816-576:21(22569-22597)Online publication date: 1-Nov-2017
https://dl.acm.org/doi/10.1007/s11042-017-4816-5
Bhattacharjee SYuan JTan YDuan L(2016)Query-Adaptive Small Object Search Using Object Proposals and Shape-Aware DescriptorsIEEE Transactions on Multimedia10.1109/TMM.2016.253260118:4(726-737)Online publication date: Apr-2016
https://doi.org/10.1109/TMM.2016.2532601
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten