skip to main content
10.1145/2024724.2024949acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

EFFEX: an embedded processor for computer vision based feature extraction

Published:05 June 2011Publication History

ABSTRACT

The deployment of computer vision algorithms in mobile applications is growing at a rapid pace. A primary component of the computer vision software pipeline is feature extraction, which identifies and encodes relevant image features. We present an embedded heterogeneous multicore design named EFFEX that incorporates novel functional units and memory architecture support, making it capable of increasing mobile vision performance while balancing power and area. We demonstrate this architecture running three common feature extraction algorithms, and show that it is capable of providing significant speedups at low cost. Our simulations show a speedup of as much as 14x for feature extraction with a decrease in energy of 40x for memory accesses.

References

  1. ARM. Cortex-A5 Processor, 2010. http://www.arm.com/products/processors/cortex-a/.Google ScholarGoogle Scholar
  2. ARM. Cortex-A8 Processor, 2010. http://www.arm.com/products/processors/cortex-a/.Google ScholarGoogle Scholar
  3. D. G. R. Bradski and A. Kaehler. Learning OpenCV. O'Reilly Media, Inc., 2008.Google ScholarGoogle Scholar
  4. D. Brooks, V. Tiwari, and M. Martonosi. Wattch: a framework for architectural-level power analysis and optimizations. In ISCA, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. L. Chang and J. Hernández-Palancar. A Hardware Architecture for SIFT Candidate Keypoints Detection. In CIARP, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Google. Google Goggles For Android, September 2010. http://www.google.com/mobile/goggles/#text.Google ScholarGoogle Scholar
  8. S. Goyal. Object Detection using OpenCV II - Calculation of HoG Features, October 2009. http://smsoftdev-solutions.blogspot.com/2009/10/object-detection-using-opencv-ii.html.Google ScholarGoogle Scholar
  9. M. Gschwind, H. Hofstee, B. Flachs, M. Hopkin, Y. Watanabe, and T. Yamazaki. Synergistic Processing in Cell's Multicore Architecture. MICRO, 26(2), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. E. Guller, J. P. Singh, and A. Gputa. Parallel Computer Architecture: A Hardware/Software Approach. Morgan Kaufmann, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Hess. SIFT feature detector for OpenCV, February 2009. http://web.engr.oregonstate.edu/~hess.Google ScholarGoogle Scholar
  12. S. Jain, V. Erraguntla, S. Vangal, Y. Hoskote, N. Borkar, T. Mandepudi, and V. Karthik. A 90mW/GFlop 3.4GHz Reconfigurable Fused/Continuous Multiply-Accumulator for Floating-Point and Integer Operands in 65nm. In VLSID, Jan. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Kadota, H. Sugano, M. Hiromoto, H. Ochi, R. Miyamoto, and Y. Nakamura. Hardware Architecture for HOG Feature Extraction. In IIH-MSP, sep. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Layar. Layar Reality Browser, September 2010. http://www.layar.com/.Google ScholarGoogle Scholar
  15. D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. I. T. Ltd. SGX Graphics IP Core Family, 2010.Google ScholarGoogle Scholar
  17. J. E. Miller, H. Kasture, G. Kurian, C. Gruenwald III, N. Beckmann, C. Celio, J. Eastep, and A. Agarwal. Graphite: A Distributed Parallel Simulator for Multicores. In HPCA, January 2010.Google ScholarGoogle ScholarCross RefCross Ref
  18. A. Prengler and K. Adi. A Reconfigurable SIMD-MIMD Processor Architecture for Embedded Vision Processing Applications. SAE World Congress, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  19. J. Qiu, Y. Lu, T. Huang, and T. Ikenaga. An FPGA-Based Real-Time Hardware Accelerator for Orientation Calculation Part in SIFT. IIH-MSP, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. E. Rosten and T. Drummond. Fusing points and lines for high performance tracking. In ICCV, volume 2, October 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. E. Rosten and T. Drummond. Machine learning for high-speed corner detection. In ECCV, May 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. B. Silpa, A. Patney, T. Krishna, P. Panda, and G. Visweswaran. Texture filter memory; a power-efficient and scalable texture memory architecture for mobile graphics processors. In ICCAD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Skribanowitz, T. Knobloch, J. Schreiter, and A. Konig. VLSI implementation of an application-specific vision chip for overtake monitoring, real time eye tracking, and automated visual inspection. In MicroNeuro '99, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. P. Viola and M. Jones. Robust real-time object detection. IJCV, # 2002.Google ScholarGoogle Scholar
  25. J. Wortham. Customers Angered as iPhones Overload ATT. New York Times, September 2009.Google ScholarGoogle Scholar
  26. C. Wu. SIFTGPU, September 2010. http://www.cs.unc.edu/~ccwu/siftgpu/.Google ScholarGoogle Scholar

Index Terms

  1. EFFEX: an embedded processor for computer vision based feature extraction

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          DAC '11: Proceedings of the 48th Design Automation Conference
          June 2011
          1055 pages
          ISBN:9781450306362
          DOI:10.1145/2024724

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 5 June 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate1,770of5,499submissions,32%

          Upcoming Conference

          DAC '24
          61st ACM/IEEE Design Automation Conference
          June 23 - 27, 2024
          San Francisco , CA , USA

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader