skip to main content
10.1145/3078971.3078990acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Improving Small Object Proposals for Company Logo Detection

Published:06 June 2017Publication History

ABSTRACT

Many modern approaches for object detection are two-staged pipelines. The first stage identifies regions of interest which are then classified in the second stage. Faster R-CNN is such an approach for object detection which combines both stages into a single pipeline. In this paper we apply Faster R-CNN to the task of company logo detection. Motivated by its weak performance on small object instances, we examine in detail both the proposal and the classification stage with respect to a wide range of object sizes. We investigate the influence of feature map resolution on the performance of those stages.

Based on theoretical considerations, we introduce an improved scheme for generating anchor proposals and propose a modification to Faster R-CNN which leverages higher-resolution feature maps for small objects. We evaluate our approach on the FlickrLogos dataset improving the RPN performance from 0.52 to 0.71 (MABO) and the detection performance from 0.52 to $0.67$ (mAP).

References

  1. S. Bell, C. L. Zitnick, K. Bala, and R. Girshick. 2016. Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2874--2883.Google ScholarGoogle Scholar
  2. S. Bianco, M. Buzzelli, D. Mazzini, and R. Schettini. 2017. Deep Learning for Logo Recognition. CoRR abs/1701.02620 (2017). http://arxiv.org/abs/1701.02620 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Bernhard E. Boser, Isabelle M. Guyon, and Vladimir N. Vapnik. 1992. A Training Algorithm for Optimal Margin Classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory (COLT '92). ACM, New York, NY, USA, 144--152. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Deng, W. Dong, R. Socher, L. J. Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large- scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. 248--255.Google ScholarGoogle ScholarCross RefCross Ref
  5. C. Eggert, A. Winschel, D. Zecha, and R. Lienhart. 2016. Saliency-guided Selective Magnification for Company Logo Detection. In 2016 International Conference on Pattern Recognition (ICPR).Google ScholarGoogle Scholar
  6. M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. 2010. The Pascal Visual Object Classes (VOC) Challenge. International Journal of Computer Vision 88, 2 (June 2010), 303--338. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Girshick. 2015. Fast R-CNN. In IEEE International Conference on Computer Vision. 1440--1448. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Girshick, J. Donahue, T. Darrell, and J. Malik. 2014. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. 580--587. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. B. Hariharan, P. Arbelaez, R. Girshick, and J. Malik. 2016. Object Instance Segmen- tation and Fine-Grained Localization using Hypercolumns. IEEE Transactions on Pattern Analysis and Machine Intelligence PP, 99 (2016). Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Ioffe and C. Szegedy. 2015. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. In International Conference on Machine Learning. 448--465. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2016. Feature Pyramid Networks for Object Detection. arXiv preprint arXiv:1612.03144 (2016).Google ScholarGoogle Scholar
  12. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single Shot MultiBox Detector. (2016). http://arxiv.org/abs/1512.02325Google ScholarGoogle Scholar
  13. G. Oliveira, X. Frazao, A. Pimentel, and B. Ribeiro. 2016. Automatic graphic logo detection via Fast Region-based Convolutional Networks. In 2016 International Joint Conference on Neural Networks (IJCNN). 985--991.Google ScholarGoogle Scholar
  14. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 779--788.Google ScholarGoogle Scholar
  15. S. Ren, K. He, R. Girshick, and J. Sun. 2016. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence (2016). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Romberg, L. G. Pueyo, Lienhart R., and R. van Zwol. 2011. Scalable logo recognition in real-world images. In ACM International Conference on Multimedia Retrieval (ICMR '11). ACM, Article 25, 8 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. E. Shelhamer, J. Long, and T. Darrell. 2016. Fully Convolutional Networks for Semantic Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence PP, 99 (2016). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. K. Simonyan and A. Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations.Google ScholarGoogle Scholar
  19. J. R. R. Uijlings, K. E. A. Sande, T. Gevers, and A. W. M. Smeulders. 2013. Selective Search for Object Recognition. International Journal of Computer Vision 104, 2 (2013), 154--171. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. L. Zitnick and P. Dollár. 2014. Edge Boxes: Locating Object Proposals from Edges. In ECCV. European Conference on Computer Vision.Google ScholarGoogle Scholar

Index Terms

  1. Improving Small Object Proposals for Company Logo Detection

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval
          June 2017
          524 pages
          ISBN:9781450347013
          DOI:10.1145/3078971
          • General Chairs:
          • Bogdan Ionescu,
          • Nicu Sebe,
          • Program Chairs:
          • Jiashi Feng,
          • Martha Larson,
          • Rainer Lienhart,
          • Cees Snoek

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 6 June 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          ICMR '17 Paper Acceptance Rate33of95submissions,35%Overall Acceptance Rate254of830submissions,31%

          Upcoming Conference

          ICMR '24
          International Conference on Multimedia Retrieval
          June 10 - 14, 2024
          Phuket , Thailand

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader