skip to main content
10.1145/2393347.2393431acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Scalable mining of small visual objects

Published: 29 October 2012 Publication History

Abstract

This paper presents a scalable method for automatically discovering frequent visual objects in large multimedia collections even if their size is very small. It first formally revisits the problem of mining or discovering such objects, and then generalizes two kinds of existing methods for probing candidate object seeds: weighted adaptive sampling and hashing-based methods. The idea is that the collision frequencies obtained with hashing-based methods can actually be converted into a prior probability density function given as input to a weighted adaptive sampling algorithm. This allows for an evaluation of any hashing scheme effectiveness in a more generalized way, and a comparison with other priors, e.g. guided by visual saliency concerns. We then introduce a new hashing strategy, working first at the visual level, and then at the geometric level. This strategy allows us to integrate weak geometric constraints into the hashing phase itself and not only neighborhood constraints as in previous works. Experiments conducted on a new dataset introduced in this paper will show that using this new hashing-based prior allows a drastic reduction of the number of tentative probes required to discover small objects instantiated several times in a large dataset.

References

[1]
O. Chum, A. Mikulik, M. Perdoch, and J. Matas. Total recall ii: Query expansion revisited. In CVPR, pages 889--896, Colorado Springs, USA, 2011. IEEE.
[2]
O. Chum, M. Perdoch, and J. Matas. Geometric min-hashing: Finding a (thick) needle in a haystack. In CVPR, pages 17--24. IEEE, June 200
[3]
O. Chum, J. Philbin, J. Sivic, M. Isard, and A. Zisserman. Total recall: Automatic query expansion with a generative feature model for object retrieval. In ICCV. IEEE, October 2007.
[4]
M. Datar and P. Indyk. Locality-sensitive hashing scheme based on p-stable distributions. In SCG, pages 253--262. ACM Press, 2004.
[5]
M. A. Fischler and R. C. Bolles. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM}, 24(6):381--395, 1981.
[6]
H. Jégou, M. Douze, and C. Schmid. Hamming Embedding and Weak Geometry Consistency for Large Scale Image Search. Research Report RR-6709, INRIA, 2008.
[7]
A. Joly and O. Buisson. A Posteriori Multi-Probe Locality Sensitive Hashing. In ACM Multimedia, pages 209--218, Vancouver, Canada, oct 2008.
[8]
A. Joly and O. Buisson. Logo retrieval with a contrario visual query expansion. In ACM Multimedia, pages 581--584, Beijing, China, october 2009.
[9]
A. Joly and O. Buisson. Random Maximum Margin Hashing. In CVPR, Colorado Springs, 2011. IEEE.
[10]
Y. Kalantidis, L. Pueyo, M. Trevisiol, R. van Zwol, and Y. Avrithis. Scalable triangulation-based logo recognition. In ICMR, pages 20:1--20:7, Trento, Italy, 2011. ACM.
[11]
P. Letessier, O. Buisson, and A. Joly. Consistent visual words mining with adaptive sampling. In ICMR, pages 49:1--49:8. ACM, april 2011.
[12]
K. Ling and G. Wu. Frequency based locality sensitive hashing. In ICMT, pages 4929 --4932, july 2011.
[13]
D. Lowe. Object recognition from local scale-invariant features. In ICCV, page 1150, Kerkyra, 1999. IEEE.
[14]
F. Olken. Random Sampling from Databases. PhD thesis, University of California, 1993.
[15]
J. Philbin. Scalable Object Retrieval in Very Large Image Collections. PhD thesis, Univ. of Oxford, 2010.
[16]
J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Lost in quantization: Improving particular object retrieval in large scale image databases. In CVPR}, pages 1--8, Anchorage, USA, june 2008. IEEE.
[17]
G. F. Pineda, H. Koga, and T. Watanabe. Object discovery by clustering correlated visual word sets. In ICPR, pages 750--753, Washington, USA, 2010. IEEE.
[18]
T. Tuytelaars, C. H. Lampert, M. B. Blaschko, and W. Buntine. Unsupervised object discovery: A comparison. IJCV, 88:284--302, June 2010.
[19]
T. Weyand and B. Leibe. Discovering favorite views of popular places with iconoid shift. In ICCV, pages 1132--1139. IEEE, Nov. 2011.
[20]
W. Zhao, X. Wu, and C.-W. Ngo. On the annotation of web videos by efficient near-duplicate search. IEEE Transactions on Multimedia, 12(5):448--461, 2010.

Cited By

View all
  • (2023)Reconstruction Guided Meta-Learning for Few Shot Open Set RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.332073145:12(15394-15405)Online publication date: Dec-2023
  • (2022)Variational Multi-Prototype Encoder for Object Recognition Using Multiple Prototype ImagesIEEE Access10.1109/ACCESS.2022.315185610(19586-19598)Online publication date: 2022
  • (2021)FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling NetworkProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475289(4670-4679)Online publication date: 17-Oct-2021
  • Show More Cited By

Index Terms

  1. Scalable mining of small visual objects

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '12: Proceedings of the 20th ACM international conference on Multimedia
    October 2012
    1584 pages
    ISBN:9781450310895
    DOI:10.1145/2393347
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. LSH
    2. RMMH
    3. computer vision
    4. discovery
    5. hashing
    6. mining
    7. scalable
    8. small objects
    9. weak geometry

    Qualifiers

    • Research-article

    Conference

    MM '12
    Sponsor:
    MM '12: ACM Multimedia Conference
    October 29 - November 2, 2012
    Nara, Japan

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Reconstruction Guided Meta-Learning for Few Shot Open Set RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.332073145:12(15394-15405)Online publication date: Dec-2023
    • (2022)Variational Multi-Prototype Encoder for Object Recognition Using Multiple Prototype ImagesIEEE Access10.1109/ACCESS.2022.315185610(19586-19598)Online publication date: 2022
    • (2021)FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling NetworkProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475289(4670-4679)Online publication date: 17-Oct-2021
    • (2020)U15-Logos: Unconstrained Logo Dataset with Evaluation by Deep learning Methods2020 International Conference on Multimedia Analysis and Pattern Recognition (MAPR)10.1109/MAPR49794.2020.9237769(1-6)Online publication date: Oct-2020
    • (2019)Variational Prototyping-Encoder: One-Shot Learning With Prototypical Images2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR.2019.00969(9454-9462)Online publication date: Jun-2019
    • (2018)GPU deformable part model for object recognitionJournal of Real-Time Image Processing10.1007/s11554-014-0447-514:2(279-291)Online publication date: 1-Feb-2018
    • (2018)Visual instance mining from the graph perspectiveMultimedia Systems10.1007/s00530-016-0533-624:2(147-162)Online publication date: 1-Mar-2018
    • (2017)Instance search retrospective with focus on TRECVIDInternational Journal of Multimedia Information Retrieval10.1007/s13735-017-0121-36:1(1-29)Online publication date: 22-Feb-2017
    • (2017)Content-based unsupervised segmentation of recurrent TV programs using grammatical inferenceMultimedia Tools and Applications10.1007/s11042-017-4816-576:21(22569-22597)Online publication date: 1-Nov-2017
    • (2016)Query-Adaptive Small Object Search Using Object Proposals and Shape-Aware DescriptorsIEEE Transactions on Multimedia10.1109/TMM.2016.253260118:4(726-737)Online publication date: Apr-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media