Abstract
Automated Vehicle Classification (AVC) based on vision sensors has received active attention from researchers, due to heightened security concerns in Intelligent Transportation Systems. In this work, we propose a categorization of AVC studies based on the granularity of classification, namely Vehicle Type Recognition, Vehicle Make Recognition, and Vehicle Make and Model Recognition. For each category of AVC systems, we present a comprehensive review and comparison of features extraction, global representation, and classification techniques. We also present the accuracy and speed-related performance metrics and discuss how they can be used to compare and evaluate different AVC works. The various datasets proposed over the years for AVC are also compared in light of the real-world challenges they represent, and those they do not. The major challenges involved in each category of AVC systems are presented, highlighting open problems in this area of research. Finally, we conclude by providing future directions of research in this area, paving the way toward efficient large-scale AVC systems. This survey shall help researchers interested in the area to analyze works completed so far in each category of AVC, focusing on techniques proposed for each module, and to chalk out strategies to enhance state-of-the-art technology.
- H. M. Abdelbaki, K. Hussain, and E. Gelenbe. 2001. A laser intensity image based automatic vehicle classification system. In Proceedings of the IEEE Conference on Intelligent Transportation Systems, 2001. 460--465. DOI:http://dx.doi.org/10.1109/ITSC.2001.948701 Google ScholarCross Ref
- M. AbdelMaseeh, I. Badreldin, M. F. Abdelkader, and M. El Saban. 2012. Car make and model recognition combining global and local cues. In Proceedings of the 21st International Conference on Pattern Recognition (ICPR’12). 910--913.Google Scholar
- A. M. Al-Ghaili, S. Mashohor, A. R. Ramli, and A. Ismail. 2013. Vertical-edge-based car-license-plate detection method. IEEE Trans. Vehic. Technol. 62, 1 (Jan 2013), 26--38. Google ScholarCross Ref
- R. P. Avery, Y. Wang, and G. Scott Rutherford. 2004. Length-based vehicle classification using images from uncalibrated video cameras. In Proceedings of the 7th International IEEE Conference on Intelligent Transportation Systems, 2004. 737--742. DOI:http://dx.doi.org/10.1109/ITSC.2004.1398994 Google ScholarCross Ref
- Pawel Badura and Maria Skotnicka. 2014. Automatic car make recognition in low-quality images. In Information Technologies in Biomedicine, Volume 3, Ewa Pietka, Jacek Kawa, and Wojciech Wieclawek (Eds.). Advances in Intelligent Systems and Computing, Vol. 283. Springer International Publishing, 235--246. DOI:http://dx.doi.org/10.1007/978-3-319-06593-9_21 Google ScholarCross Ref
- D. H. Ballard. 1981. Generalizing the hough transform to detect arbitrary shapes. Pattern Recogn. 13, 2 (1981), 111--122. DOI:http://dx.doi.org/10.1016/0031-3203(81)90009-1 Google ScholarCross Ref
- Remigiusz Baran, Andrzej Glowacz, and Andrzej Matiolanski. 2013. The efficient real- and non-real-time make and model recognition of cars. Multimedia Tools and Applications (2013), 1--20.Google Scholar
- P. N. Belhumeur, J. P. Hespanha, and D. Kriegman. 1997. Eigenfaces vs. fisherfaces: Recognition using class specific linear projection. IEEE Trans. Pattern Anal. Mach. Intell. 19, 7 (Jul 1997), 711--720. DOI:http://dx.doi.org/10.1109/34.598228 Google ScholarDigital Library
- S. Belongie, J. Malik, and J. Puzicha. 2002. Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24, 4 (Apr 2002), 509--522. DOI:http://dx.doi.org/10.1109/34.993558 Google ScholarDigital Library
- Noppakun Boonsim and Simant Prakoonwit. 2016. Car make and model recognition under limited lighting conditions at night. Pattern Anal. Appl. (2016), 1--13. Google ScholarDigital Library
- Anna Bosch, Andrew Zisserman, and Xavier Muñoz. 2006. Scene classification via pLSA. In Proceedings of the 9th European Conference on Computer Vision—Volume Part IV (ECCV’06). Springer-Verlag, Berlin, 517--530. DOI:http://dx.doi.org/10.1007/11744085_40 Google ScholarDigital Library
- Anna Bosch, Andrew Zisserman, and Xavier Munoz. 2007. Representing shape with a spatial pyramid kernel. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval (CIVR’07). ACM, New York, NY, 401--408. DOI:http://dx.doi.org/10.1145/1282280.1282340 Google ScholarDigital Library
- Norbert Buch, James Orwell, and Sergio A. Velastin. 2009. 3D extended histogram of oriented gradients (3DHOG) for classification of road users in urban scenes. In Proceedings of the British Machine Vision Conference, London, UK, September 7--10. 1--11. DOI:http://dx.doi.org/10.5244/C.23.15 Google ScholarCross Ref
- Jiewei Cao, Zi Huang, Peng Wang, Chao Li, Xiaoshuai Sun, and Heng Tao Shen. 2016. Quartet-net learning for visual instance retrieval. In Proceedings of the 2016 ACM on Multimedia Conference (MM’16). ACM, New York, NY, 456--460. DOI:http://dx.doi.org/10.1145/2964284.2967262 Google ScholarDigital Library
- Rich Caruana. 1997. Multitask learning. Mach. Learn. 28, 1 (July 1997), 41--75. DOI:http://dx.doi.org/10.1023/A:1007379606734 Google ScholarDigital Library
- Yuning Chai, V. Lempitsky, and A. Zisserman. 2011. BiCoS: A bi-level co-segmentation method for image classification. In Proceedings of the 2011 International Conference on Computer Vision. 2579--2586. DOI:http://dx.doi.org/10.1109/ICCV.2011.6126546 Google ScholarDigital Library
- Y. Chai, V. Lempitsky, and A. Zisserman. 2013. Symbiotic segmentation and part localization for fine-grained categorization. In Proceedings of the 2013 IEEE International Conference on Computer Vision. 321--328. DOI:http://dx.doi.org/10.1109/ICCV.2013.47 Google ScholarDigital Library
- Yuning Chai, Esa Rahtu, Victor Lempitsky, Luc Van Gool, and Andrew Zisserman. 2012. TriCoS: A Tri-level Class-Discriminative Co-segmentation Method for Image Classification. Springer, Berlin, 794--807. DOI:http://dx.doi.org/10.1007/978-3-642-33718-5_57 Google ScholarDigital Library
- W. C. Chang and C. W. Cho. 2008. Real-time side vehicle tracking using parts-based boosting. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (SMC’08). 3370--3375. 1062-922XDOI:http://dx.doi.org/10.1109/ICSMC.2008.4811818 Google ScholarCross Ref
- W. C. Chang and C. W. Cho. 2010. Online boosting for vehicle detection. IEEE Trans. Syst. Man. Cybernet., Part B (Cybernet.) 40, 3 (June 2010), 892--902. DOI:http://dx.doi.org/10.1109/TSMCB.2009.2032527 Google ScholarDigital Library
- A. Chayeb, N. Ouadah, Z. Tobal, M. Lakrouf, and O. Azouaoui. 2014. HOG based multi-object detection for urban navigation. In Proceedings of the 2014 IEEE 17th International Conference on Intelligent Transportation Systems (ITSC’14). 2962--2967. DOI:http://dx.doi.org/10.1109/ITSC.2014.6958165 Google ScholarCross Ref
- G. Chen, J. Yang, H. Jin, J. Brandt, E. Shechtman, A. Agarwala, and T. X. Han. 2014. Large-scale visual font recognition. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 3598--3605. DOI:http://dx.doi.org/10.1109/CVPR.2014.460 Google ScholarDigital Library
- Li-Chih Chen, Jun-Wei Hsieh, Yilin Yan, and Duan-Yu Chen. 2013. Vehicle make and model recognition using sparse representation and symmetrical SURFs. In Proceedings of the 16th International IEEE Conference on Intelligent Transportation Systems. 1143--1148. DOI:http://dx.doi.org/10.1109/ITSC.2013.6728386 Google ScholarCross Ref
- Li-Chih Chen, Jun-Wei Hsieh, Yilin Yan, and Duan-Yu Chen. 2015. Vehicle make and model recognition using sparse representation and symmetrical SURFs. Pattern Recogn. 48, 6 (2015), 1979--1998. Google ScholarDigital Library
- Z. Chen and T. Ellis. 2011. Self-adaptive gaussian mixture model for urban traffic monitoring system. In Proceedings of the 2011 IEEE International Conference on Computer Vision Workshops (ICCV’11). 1769--1776. DOI:http://dx.doi.org/10.1109/ICCVW.2011.6130463 Google ScholarCross Ref
- Zezhi Chen, T. Ellis, and S. A. Velastin. 2011. Vehicle type categorization: A comparison of classification schemes. In Proceedings of the 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC’11). 74--79. Google ScholarCross Ref
- Zezhi Chen, T. Ellis, and S. A. Velastin. 2012a. Vehicle detection, tracking and classification in urban traffic. In Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems. 951--956. DOI:http://dx.doi.org/10.1109/ITSC.2012.6338852Google Scholar
- Z. Chen, T. Ellis, and S. A. Velastin. 2012b. Vehicle detection, tracking and classification in urban traffic. In Proceedings of the 2012 15th International IEEE Conference on Intelligent Transportation Systems (ITSC’12). 951--956. DOI:http://dx.doi.org/10.1109/ITSC.2012.6338852 Google ScholarCross Ref
- Zezhi Chen, Nick Pears, Michael Freeman, and Jim Austin. 2009. Advances in Visual Computing: Proceedings of the 5th International Symposium (ISVC’09), Las Vegas, NV, November 30--December 2, 2009. Proceedings, Part II. Springer, Berlin, Chapter Background Subtraction in Video Using Recursive Mixture Models, Spatio-Temporal Filtering and Shadow Removal, 1141--1150. DOI:http://dx.doi.org/10.1007/978-3-642-10520-3_109 Google ScholarDigital Library
- Minkyu Cheon, Wonju Lee, Changyong Yoon, and Mignon Park. 2012. Vision-based vehicle detection system with consideration of the detecting location. IEEE Trans. Intell. Transport. Syst. 13, 3 (Sept 2012), 1243--1252. DOI:http://dx.doi.org/10.1109/TITS.2012.2188630 Google ScholarDigital Library
- Xavier Clady, Pablo Negri, Maurice Milgram, and Raphael Poulenard. 2008. Multi-class vehicle type recognition system. In Artificial Neural Networks in Pattern Recognition, Lionel Prevost, Simone Marinai, and Friedhelm Schwenker (Eds.). Lecture Notes in Computer Science, Vol. 5064. Springer Berlin, 228--239. Retrieved from http://dx.doi.org/10.1007/978-3-540-69939-2_22 Google ScholarDigital Library
- Gabriella Csurka, Christopher R. Dance, Lixin Fan, Jutta Willamowski, and Cedric Bray. 2004. Visual categorization with bags of keypoints. In Proceedings of the Workshop on Statistical Learning in Computer Vision (ECCV’04). 1--22.Google Scholar
- N. Dalal and B. Triggs. 2005. Histograms of oriented gradients for human detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Vol. 1. 886--893. DOI:http://dx.doi.org/10.1109/CVPR.2005.177 Google ScholarDigital Library
- F. M. de S. Matos and R. M. C. R. de Souza. 2012. An image vehicle classification method based on edge and PCA applied to blocks. In Proceedings of the 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC’12). 1688--1693. DOI:http://dx.doi.org/10.1109/ICSMC.2012.6377980 Google ScholarCross Ref
- Fabrzia Medeiros de S. Matos and Renata Maria Cardoso R. de Souza. 2013. Hierarchical classification of vehicle images using NN with conditional adaptive distance. In Neural Information Processing, Minho Lee, Akira Hirose, Zeng-Guang Hou, and Rhee Man Kil (Eds.). Lecture Notes in Computer Science, Vol. 8227. Springer, Berlin, 745--752. DOI:http://dx.doi.org/10.1007/978-3-642-42042-9_92 Google ScholarDigital Library
- Louka Dlagnekov. 2005. Video-based Car Surveillance: License Plate, Make, and Model recognition. Master’s thesis. University of California, San Diego.Google Scholar
- Zhen Dong, Mingtao Pei, Yang He, Ting Liu, Yanmei Dong, and Yunde Jia. 2014. Vehicle type classification using unsupervised convolutional neural network. In Proceedings of the 22nd International Conference on Pattern Recognition (ICPR’14). 172--177. Google ScholarDigital Library
- Z. Dong, Y. Wu, M. Pei, and Y. Jia. 2015. Vehicle type classification using a semisupervised convolutional neural network. IEEE Trans. Intell. Transport. Syst. 99 (2015), 1--10. Google ScholarDigital Library
- Richard O. Duda, Peer E. Hart, and David G. Stork. 2000. Pattern Classification, 2nd ed. Wiley-Interscience, New York, NY.Google ScholarDigital Library
- A. Farhadi, D. Hoiem, D. Forsyth, and I. Endres. 2009. Describing objects by their attributes. Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR’09). 1778--1785. DOI:http://dx.doi.org/doi.ieeecomputersociety.org/10.1109/CVPRW.2009.5206772Google Scholar
- P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. 2010. Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32, 9 (Sept 2010), 1627--1645. DOI:http://dx.doi.org/10.1109/TPAMI.2009.167 Google ScholarDigital Library
- C. Garcia and M. Delakis. 2004. Convolutional face finder: A neural architecture for fast and robust face detection. IEEE Trans. Pattern Anal. Mach. Intell. 26, 11 (Nov 2004), 1408--1423. DOI:http://dx.doi.org/10.1109/TPAMI.2004.97 Google ScholarDigital Library
- R. Girshick, J. Donahue, T. Darrell, and J. Malik. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 580--587. DOI:http://dx.doi.org/10.1109/CVPR.2014.81 Google ScholarDigital Library
- R. C. Gonzalez and R. E. Woods. 2002. Digital Image Processing, 2nd ed. Pearsons, Singapore.Google Scholar
- W. Eric L. Grimson. 1990. Object Recognition by Computer: The Role of Geometric Constraints. MIT Press, Cambridge, MA.Google Scholar
- Matthieu Guillaumin, Daniel Küttel, and Vittorio Ferrari. 2014. ImageNet auto-annotation with segmentation propagation. Int. J. Comput. Vision 110, 3 (2014), 328--348. DOI:http://dx.doi.org/10.1007/s11263-014-0713-9 Google ScholarDigital Library
- S. Gupte, O. Masoud, R. F. K. Martin, and N. P. Papanikolopoulos. 2002. Detection and classification of vehicles. IEEE Trans. Intell. Transport. Syst. 3, 1 (Mar 2002), 37--47. DOI:http://dx.doi.org/10.1109/6979.994794 Google ScholarDigital Library
- S. Gupte, O. Masoud, and N. P. Papanikolopoulos. 2000. Vision-based vehicle classification. In Proceedings of the IEEE 2000 Conference on Intelligent Transportation Systems. 46--51. DOI:http://dx.doi.org/10.1109/ITSC.2000.881016 Google ScholarCross Ref
- Lykele Hazelhoff, Ivo Creusen, Dennis van de Wouw, and Peter H. N. de With. 2012. Large-scale classification of traffic signs under real-world conditions. In Proceedings of the SPIE 2012 Conference on Multimedia on Mobile Devices and Multimedia Content Access: Algorithms and Systems, Vol. 8304. Google ScholarCross Ref
- Dongmei He, Congyan Lang, Songhe Feng, Xuetao Du, and Chen Zhang. 2015a. Vehicle detection and classification based on convolutional neural network. In Proceedings of the 7th International Conference on Internet Multimedia Computing and Service (ICIMCS’15). ACM, New York, NY, Article 3, 5 pages. DOI:http://dx.doi.org/10.1145/2808492.2808495 Google ScholarDigital Library
- H. He, Z. Shao, and J. Tan. 2015b. Recognition of car makes and models from a single traffic-camera image. IEEE Trans. Intell.t Transport. Syst. 99 (2015), 1--11. DOI:http://dx.doi.org/10.1109/TITS.2015.2437998 Google ScholarDigital Library
- Berthold K. P. Horn. 1986. Robot Vision. MIT Press.Google ScholarDigital Library
- Jun-Wei Hsieh, Li-Chih Chen, and Duan-Yu Chen. 2014. Symmetrical SURF and its applications to vehicle detection and vehicle make and model recognition. IEEE Trans. Intell. Transport. Syst. 15, 1 (Feb 2014), 6--20. DOI:http://dx.doi.org/10.1109/TITS.2013.2294646 Google ScholarDigital Library
- Gee-Sern Hsu, Jiun-Chang Chen, and Yu-Zu Chung. 2013. Application-oriented license plate recognition. IEEE Trans. Vehic. Technol. 62, 2 (Feb 2013), 552--561. DOI:http://dx.doi.org/10.1109/TVT.2012.2226218 Google ScholarCross Ref
- Chung-Lin Huang and Wen-Chieh Liao. 2004. A vision-based vehicle identification system. In Proceedings of the 17th International Conference on Pattern Recognition (ICPR’04). Vol. 4, 364--367. DOI:http://dx.doi.org/10.1109/ICPR.2004.1333778 Google ScholarCross Ref
- Y. Huang, R. Wu, Y. Sun, W. Wang, and X. Ding. 2015. Vehicle logo recognition system based on convolutional neural networks with a pretraining strategy. IEEE Trans. Intell. Transport. Syst. 16, 4 (Aug 2015), 1951--1960. DOI:http://dx.doi.org/10.1109/TITS.2014.2387069 Google ScholarDigital Library
- Brody Huval, Tao Wang, Sameep Tandon, Jeff Kiske, Will Song, Joel Pazhayampallil, Mykhaylo Andriluka, Pranav Rajpurkar, Toki Migimatsu, Royce Cheng-Yue, Fernando Mujica, Adam Coates, and Andrew Y. Ng. 2015. An empirical evaluation of deep learning on highway driving. CoRR abs/1504.01716 (2015). http://arxiv.org/abs/1504.01716Google Scholar
- Asif Iqbal, Carlos Busso, and Nicholas R. Gans. 2015. Adjacent vehicle collision warning system using image sensor and inertial measurement unit. In Proceedings of the 2015 ACM on International Conference on Multimodal Interaction (ICMI’15). ACM, New York, NY, 291--298. DOI:http://dx.doi.org/10.1145/2818346.2820741 Google ScholarDigital Library
- ITS Canada. 2012a. Intelligent Transportation (2012). Retrieved from https://www.itscanada.ca/it/index.html. Last accessed on 11 August 2015.Google Scholar
- ITS Canada. 2012b. ITS in Soceity (2012). Retrieved from https://www.itscanada.ca/it/society/index.html. Last accessed on 11 August 2015.Google Scholar
- D. M. Jang and M. Turk. 2011. Car-rec: A real time car recognition system. In Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV’11). 599--605. DOI:http://dx.doi.org/10.1109/WACV.2011.5711559 Google ScholarDigital Library
- Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross B. Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional architecture for fast feature embedding. CoRR abs/1408.5093 (2014). Retrieved from http://arxiv.org/abs/1408.5093.Google ScholarDigital Library
- M. Kafai and B. Bhanu. 2012. Dynamic bayesian networks for vehicle classification in video. IEEE Trans. Industr. Info. 8, 1 (Feb 2012), 100--109. Google ScholarCross Ref
- Jisu Kim, Jeonghyun Baek, and Euntai Kim. 2015. A novel on-road vehicle detection method using HOG. IEEE Trans. Intell. Transport. Syst. 16, 6 (Dec 2015), 3414--3429. DOI:http://dx.doi.org/10.1109/TITS.2015.2465296 Google ScholarDigital Library
- J. Krause, H. Jin, J. Yang, and L. Fei-Fei. 2015. Fine-grained recognition without part annotations. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). 5546--5555. DOI:http://dx.doi.org/10.1109/CVPR.2015.7299194 Google ScholarCross Ref
- J. Krause, M. Stark, J. Deng, and L. Fei-Fei. 2013. 3D object representations for fine-grained categorization. In Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops. 554--561. DOI:http://dx.doi.org/10.1109/ICCVW.2013.77 Google ScholarDigital Library
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems.Google ScholarDigital Library
- C. H. Lampert, H. Nickisch, and S. Harmeling. 2009. Learning to detect unseen object classes by between-class attribute transfer. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. 951--958. DOI:http://dx.doi.org/10.1109/CVPR.2009.5206594 Google ScholarCross Ref
- S. Lazebnik, C. Schmid, and J. Ponce. 2006. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Vol. 2. 2169--2178. DOI:http://dx.doi.org/10.1109/CVPR.2006.68 Google ScholarDigital Library
- HyoJong Lee. 2006. Neural network approach to identify model of vehicles. In Advances in Neural Networks—ISNN 2006, Jun Wang, Zhang Yi, JacekM. Zurada, Bao-Liang Lu, and Hujun Yin (Eds.). Lecture Notes in Computer Science, Vol. 3973. Springer, Berlin, 66--72. DOI:http://dx.doi.org/10.1007/11760191_10 Google ScholarDigital Library
- K. Levi and Y. Weiss. 2004. Learning object detection from a small number of examples: The importance of good features. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’04). Vol. 2. DOI:http://dx.doi.org/10.1109/CVPR.2004.1315144 Google ScholarCross Ref
- S. Z. Li and A. K. Jain (Eds.). 2005. Handbook of Face Recognition. Springer-Verlag, New York.Google Scholar
- Xudong Li, Mao Ye, Min Fu, Pei Xu, and Tao Li. 2015. Domain adaption of vehicle detector based on convolutional neural networks. Int. J. Control Automat. Syst. 13, 4 (2015), 1020--1031. DOI:http://dx.doi.org/10.1007/s12555-014-0119-z Google ScholarCross Ref
- D. F. Llorca, R. Arroyo, and M. A. Sotelo. 2013. Vehicle logo recognition in traffic images using HOG features and SVM. In Proceedings of the 2013 16th International IEEE Conference on Intelligent Transportation Systems (ITSC’13). 2229--2234. DOI:http://dx.doi.org/10.1109/ITSC.2013.6728559 Google ScholarCross Ref
- D. F. Llorca, D. Colas, I. G. Daza, I. Parra, and M. A. Sotelo. 2014. Vehicle model recognition using geometry and appearance of car emblems from rear view images. In Proceedings of the 17th IEEE International Conference on Intelligent Transportation Systems. 3094--3099. DOI:http://dx.doi.org/10.1109/ITSC.2014.6958187 Google ScholarCross Ref
- D. G. Lowe. 1999. Object recognition from local scale-invariant features. In Proceedings of the 7th IEEE International Conference on Computer Vision, Vol. 2. 1150--1157. DOI:http://dx.doi.org/10.1109/ICCV.1999.790410 Google ScholarCross Ref
- M. Fraz, E. A. Edirisinghe, and M. S. Sarfraz. 2014. Mid-level-representation based lexicon for vehicle make and model recognition. In Proceedings of the 22nd International Conference on Pattern Recognition (ICPR). 393--398. DOI:http://dx.doi.org/10.1109/ICPR.2014.76 Google ScholarDigital Library
- Xiaoxu Ma and W. E. L. Grimson. 2005. Edge-based rich representation for vehicle classification. In Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV’05). Vol. 2, 1185--1192. DOI:http://dx.doi.org/10.1109/ICCV.2005.80 Google ScholarDigital Library
- A. Mammeri, E.-H. Khiari, and A. Boukerche. 2014a. Road-sign text recognition architecture for intelligent transportation systems. In Proceedings of the 80th IEEE Vehicular Technology Conference (VTC’14). 1--5. Google ScholarCross Ref
- A. Mammeri, Depu Zhou, A. Boukerche, and M. Almulla. 2014b. An efficient animal detection system for smart cars using cascaded classifiers. In Proceedings of the IEEE International Conference on Communications (ICC’14). 1854--1859. Google ScholarCross Ref
- A. M. Martinez. 2002. Recognizing imprecisely localized, partially occluded, and expression variant faces from a single sample per class. IEEE Trans. Pattern Anal. Mach. Intell. 24, 6 (Jun 2002), 748--763. DOI:http://dx.doi.org/10.1109/TPAMI.2002.1008382 Google ScholarDigital Library
- N. C. Mithun, N. U. Rashid, and S. M. M. Rahman. 2012. Detection and classification of vehicles from video using multiple time-spatial images. IEEE Trans. Intell. Transport. Syst. 13, 3 (Sept 2012), 1215--1225. DOI:http://dx.doi.org/10.1109/TITS.2012.2186128 Google ScholarDigital Library
- B. T. Morris and M. M. Trivedi. 2008. Learning, modeling, and classification of vehicle track patterns from live video. IEEE Trans. Intell. Transport. Syst. 9, 3 (Sept 2008), 425--437. DOI:http://dx.doi.org/10.1109/TITS.2008.922970 Google ScholarDigital Library
- A. Mukhtar, Likun Xia, and Tong Boon Tang. 2015. Vehicle detection techniques for collision avoidance systems: A review. IEEE Trans. Intell. Transport. Syst. 16, 5 (Oct 2015), 2318--2338. DOI:http://dx.doi.org/10.1109/TITS.2015.2409109 Google ScholarDigital Library
- D. T. Munroe and M. G. Madden. 2005. Multi-class and single-class classification approaches to vehicle model recognition from images. In Proceedings of the 16th Irish Conference on Artificial Intelligence and Cognitive Science. 93--104.Google Scholar
- T. Ojala, M. Pietikainen, and T. Maenpaa. 2002. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24, 7 (Jul 2002), 971--987. DOI:http://dx.doi.org/10.1109/TPAMI.2002.1017623 Google ScholarDigital Library
- Yuanchang Ou, Huicheng Zheng, Shuyue Chen, and Jiangtao Chen. 2014. Vehicle logo recognition based on a weighted spatial pyramid framework. In Proceedings of the IEEE 17th International Conference on Intelligent Transportation Systems (ITSC’14). 1238--1244. DOI:http://dx.doi.org/10.1109/ITSC.2014.6957857 Google ScholarCross Ref
- Dong Kwon Park, Yoon Seok Jeon, and Chee Sun Won. 2000. Efficient use of local edge histogram descriptor. In Proceedings of the 2000 ACM Workshops on Multimedia (MULTIMEDIA’00). ACM, New York, NY, 51--54. Google ScholarDigital Library
- G. Pearce and N. Pears. 2011. Automatic make and model recognition from frontal images of cars. In Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS’11). 373--378. DOI:http://dx.doi.org/10.1109/AVSS.2011.6027353 Google ScholarDigital Library
- Haoyu Peng, Xun Wang, Huiyan Wang, and Wenwu Yang. 2015. Recognition of low-resolution logos in vehicle images based on statistical random sparse distribution. IEEE Trans. Intell. Transport. Syst. 16, 2 (April 2015), 681--691. DOI:http://dx.doi.org/10.1109/TITS.2014.2336675 Google ScholarDigital Library
- Yu Peng, J. S. Jin, Suhuai Luo, Min Xu, and Yue Cui. 2012. Vehicle type classification using PCA with self-clustering. In Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops (ICMEW’12). 384--389. DOI:http://dx.doi.org/10.1109/ICMEW.2012.73 Google ScholarDigital Library
- Yishu Peng, Yunhui Yan, Wenjie Zhu, and Jiuliang Zhao. 2014a. Binary coding-based vehicle image classification. In Proceedings of the 2014 12th International Conference on Signal Processing (ICSP’14). 918--921. DOI:http://dx.doi.org/10.1109/ICOSP.2014.7015138 Google ScholarCross Ref
- Yishu Peng, Yunhui Yan, Wenjie Zhu, and Jiuliang Zhao. 2014b. Vehicle classification using sparse coding and spatial pyramid matching. In Proceedings of the 2014 IEEE 17th International Conference on Intelligent Transportation Systems (ITSC’14). 259--263. DOI:http://dx.doi.org/10.1109/ITSC.2014.6957701 Google ScholarCross Ref
- Florent Perronnin, Jorge Snchez, and Thomas Mensink. 2010. Improving the fisher kernel for large-scale image classification. In Proceedings of the European Conference on Computer Vision (ECCV’10), Kostas Daniilidis, Petros Maragos, and Nikos Paragios (Eds.). Lecture Notes in Computer Science, Vol. 6314. Springer, Berlin, 143--156. DOI:http://dx.doi.org/10.1007/978-3-642-15561-1_11 Google ScholarCross Ref
- V. S. Petrovic and T. Cootes. 2004a. Analysis of features for rigid structure vehicle type recognition. In Proceedings of the British Machine Vision Conference. BMVA Press, 61.1--61.10. doi:10.5244/C.18.61. Google ScholarCross Ref
- V. S. Petrovic and T. F. Cootes. 2004b. Vehicle type recognition with match refinement. In Proceedings of the 17th IEEE International Conference on Pattern Recognition, Vol. 3. 95--98. Google ScholarCross Ref
- J. Prokaj and G. Medioni. 2009. 3-D model based vehicle recognition. In Proceedings of the Workshop on Applications of Computer Vision (WACV’09). 1--7.Google Scholar
- A. Psyllos, C. N. Anagnostopoulos, and E. Kayafas. 2011. Vehicle model recognition from frontal view image measurements. Comput. Stand. Interfaces 33, 2 (2011), 142--151. DOI:http://dx.doi.org/10.1016/j.csi.2010.06.005 XVI IMEKO TC4 Symposium Ëxploring New Frontiers of Instrumentation and Methods for Electrical and Electronic Measurementsänd XIII International Workshop on ADC Modelling and Testing. Google ScholarDigital Library
- A. P. Psyllos, C.-N. E. Anagnostopoulos, and E. Kayafas. 2010. Vehicle logo recognition using a SIFT-based enhanced matching scheme. IEEE Trans. Intell. Transport. Syst. 11, 2 (June 2010), 322--328. DOI:http://dx.doi.org/10.1109/TITS.2010.2042714 Google ScholarDigital Library
- K. Ramnath, S. N. Sinha, R. Szeliski, and E. Hsiao. 2014. Car make and model recognition using 3D curve alignment. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision. 285--292. Google ScholarCross Ref
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). 779--788. DOI:http://dx.doi.org/10.1109/CVPR.2016.91 Google ScholarCross Ref
- Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, and Ali Farhadi. 2015. You only look once: Unified, real-time object detection. CoRR abs/1506.02640 (2015). Retrieved from http://arxiv.org/abs/1506.02640.Google Scholar
- Xiaofeng Ren and D. Ramanan. 2013. Histograms of sparse codes for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’13). 3246--3253. DOI:http://dx.doi.org/10.1109/CVPR.2013.417 Google ScholarDigital Library
- E. Rosten, R. Porter, and T. Drummond. 2010. Faster and better: A machine learning approach to corner detection. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1 (Jan 2010), 105--119. DOI:http://dx.doi.org/10.1109/TPAMI.2008.275 Google ScholarDigital Library
- Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet large scale visual recognition challenge. Int. J. Comput. Vision (IJCV’15) 115, 3 (2015), 211--252. DOI:http://dx.doi.org/10.1007/s11263-015-0816-y Google ScholarDigital Library
- Phillipe Salembier and Thomas Sikora. 2002. Introduction to MPEG-7: Multimedia Content Description Interface. John Wiley 8 Sons, Inc., New York, NY.Google Scholar
- Jorge Sánchez, Florent Perronnin, Thomas Mensink, and Jakob Verbeek. 2013. Image classification with the fisher vector: Theory and practice. Int. J. Comput. Vision 105, 3 (Dec. 2013), 222--245. DOI:http://dx.doi.org/10.1007/s11263-013-0636-x Google ScholarDigital Library
- H. Sandhawalia, J. A. Rodriguez-Serrano, H. Poirier, and G. Csurka. 2013. Vehicle type classification from laser scanner profiles: A benchmark of feature descriptors. In Proceedings of the 16th International IEEE Conference on Intelligent Transportation Systems (ITSC’13). 517--522. DOI:http://dx.doi.org/10.1109/ITSC.2013.6728283 Google ScholarCross Ref
- S. Saravi and E. A. Edirisinghe. 2013. Vehicle make and model recognition in CCTV footage. In Proceedings of the 2013 18th International Conference on Digital Signal Processing (DSP’13). 1--6. Google ScholarCross Ref
- M. S. Sarfraz, A. Shahzad, Muhammad, A. Elahi, M. Fraz, I. Zafar, and E. A. Edirisinghe. 2013. Real-time automatic license plate recognition for CCTV forensic applications. J. Real-Time Image Process. 8, 3 (2013), 285--295. DOI:http://dx.doi.org/10.1007/s11554-011-0232-7 Google ScholarDigital Library
- Xu Shen, Xinmei Tian, Anfeng He, Shaoyan Sun, and Dacheng Tao. 2016. Transform-invariant convolutional neural networks for image classification and search. In Proceedings of the 2016 ACM on Multimedia Conference (MM’16). ACM, New York, NY, 1345--1354. DOI:http://dx.doi.org/10.1145/2964284.2964316 Google ScholarDigital Library
- Abdul Jabbar Siddiqui. 2015. A Robust Vehicle Make and Model Recognition System for ITS Applications. Master’s thesis. University of Ottawa.Google Scholar
- A. J. Siddiqui, A. Mammeri, and A. Boukerche. 2015. Towards efficient vehicle classification in intelligent transportation systems. In Proceedings of the 5th ACM Symposium on Design and Analysis of Intelligent Vehicular Networks and Applications (DIVANet’15). 1--1. DOI:http://dx.doi.org/10.1145/2815347.2815354 Google ScholarDigital Library
- A. J. Siddiqui, A. Mammeri, and A. Boukerche. 2016. Real-time vehicle make and model recognition based on a bag of SURF features. IEEE Trans. Intell. Transport. Syst. 17, 11 (Nov 2016), 3205--3219. DOI:http://dx.doi.org/10.1109/TITS.2016.2545640 Google ScholarDigital Library
- Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014). Retrieved from http://arxiv.org/abs/1409.1556.Google Scholar
- S. Sivaraman and M. M. Trivedi. 2013. Looking at vehicles on the road: A survey of vision-based vehicle detection, tracking, and behavior analysis. IEEE Trans. Intell. Transport. Syst. 14, 4 (Dec 2013), 1773--1795. DOI:http://dx.doi.org/10.1109/TITS.2013.2266661 Google ScholarDigital Library
- S. Sivaraman and M. M. Trivedi. 2010. A general active-learning framework for on-road vehicle recognition and tracking. IEEE Trans. Intell. Transport. Syst. 11, 2 (June 2010), 267--276. DOI:http://dx.doi.org/10.1109/TITS.2010.2040177 Google ScholarDigital Library
- S. Sivaraman and M. M. Trivedi. 2012. Real-time vehicle detection using parts at intersections. In Proceedings of the 2012 15th International IEEE Conference on Intelligent Transportation Systems (ITSC’12). 1519--1524. DOI:http://dx.doi.org/10.1109/ITSC.2012.6338886 Google ScholarCross Ref
- J. Sivic and A. Zisserman. 2003. Video Google: A text retrieval approach to object matching in videos. In Proceedings of the 9th IEEE International Conference on Computer Vision. 1470--1477 vol. 2. DOI:http://dx.doi.org/10.1109/ICCV.2003.1238663 Google ScholarCross Ref
- J. Sochor, A. Herout, and J. Havel. 2016. BoxCars: 3D boxes as CNN input for improved fine-grained vehicle recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). 3006--3015. DOI:http://dx.doi.org/10.1109/CVPR.2016.328 Google ScholarCross Ref
- C. Stauffer and W. E. L. Grimson. 1999. Adaptive background mixture models for real-time tracking. In Proceedings of the 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2. 252. DOI:http://dx.doi.org/10.1109/CVPR.1999.784637 Google ScholarCross Ref
- Chris Stauffer and W. Eric L. Grimson. 2000. Learning patterns of activity using real-time tracking. IEEE Trans. Pattern Anal. Mach. Intell. 22, 8 (Aug. 2000), 747--757. DOI:http://dx.doi.org/10.1109/34.868677 Google ScholarDigital Library
- Y. Sun, X. Wang, and X. Tang. 2014. Deep learning face representation from predicting 10,000 classes. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 1891--1898. DOI:http://dx.doi.org/10.1109/CVPR.2014.244 Google ScholarDigital Library
- Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott E. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2014. Going deeper with convolutions. CoRR abs/1409.4842 (2014). Retrieved from http://arxiv.org/abs/1409.4842.Google Scholar
- S. Taghvaeeyan and R. Rajamani. 2014. Portable roadside sensors for vehicle counting, classification, and speed measurement. IEEE Intell. Transport. Syst. 15, 1 (Feb 2014), 73--83. DOI:http://dx.doi.org/10.1109/TITS.2013.2273876 Google ScholarDigital Library
- Bin Tian, Ye Li, Bo Li, and Ding Wen. 2014. Rear-view vehicle detection and tracking by combining multiple parts for complex urban surveillance. IEEE Trans. Intell. Transport. Syst. 15, 2 (April 2014), 597--606. DOI:http://dx.doi.org/10.1109/TITS.2013.2283302 Google ScholarCross Ref
- K. A. Toh and H. L. Eng. 2008. Between classification-error approximation and weighted least-squares learning. IEEE Trans. Pattern Anal. Mach. Intell. 30, 4 (April 2008), 658--669. DOI:http://dx.doi.org/10.1109/TPAMI.2007.70730 Google ScholarDigital Library
- Kar-Ann Toh, Quoc-Long Tran, and D. Srinivasan. 2004. Benchmarking a reduced multivariate polynomial pattern classifier. IEEE Trans. Pattern Anal. Mach. Intell. 26, 6 (June 2004), 740--755. DOI:http://dx.doi.org/10.1109/TPAMI.2004.3 Google ScholarDigital Library
- V. Varjas and A. Tanacs. 2013. Car recognition from frontal images in mobile environment. In Proceedings of the 2013 8th International Symposium on Image and Signal Processing and Analysis (ISPA’13). 819--823. DOI:http://dx.doi.org/10.1109/ISPA.2013.6703849 Google ScholarCross Ref
- P. Viola and M. Jones. 2001. Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’01). Vol. 1. I--511--I--518. DOI:http://dx.doi.org/10.1109/CVPR.2001.990517 Google ScholarCross Ref
- Jinjun Wang, Jianchao Yang, Kai Yu, Fengjun Lv, T. Huang, and Yihong Gong. 2010. Locality-constrained linear coding for image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’10). 3360--3367. DOI:http://dx.doi.org/10.1109/CVPR.2010.5540018 Google ScholarCross Ref
- Rui Wang, Lei Zhang, Kejiang Xiao, Rongli Sun, and Li Cui. 2014. EasiSee: Real-time vehicle classification and counting via low-cost collaborative sensing. IEEE Trans. Intell. Transport. Syst. 15, 1 (Feb 2014), 414--424. Google ScholarDigital Library
- Yang Wang and Greg Mori. 2010. A discriminative latent model of object classes and attributes. In Proceedings of the 11th European Conference on Computer Vision: Part V (ECCV’10). Springer-Verlag, Berlin, 155--168. Retrieved from http://dl.acm.org/citation.cfm?id=1888150.1888163. Google ScholarCross Ref
- S. Winder, G. Hua, and M. Brown. 2009. Picking the best DAISY. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’09). 178--185. DOI:http://dx.doi.org/10.1109/CVPR.2009.5206839 Google ScholarCross Ref
- Aree Witoelar, Michael Biehl, Anarta Ghosh, and Barbara Hammer. 2008. Learning dynamics and robustness of vector quantization and neural gas. Neurocomputing 71, 79 (2008), 1210--1219. DOI:http://dx.doi.org/10.1016/j.neucom.2007.11.022 Progress in Modeling, Theory, and Application of Computational Intelligence, 15th European Symposium on Artificial Neural Networks 2007. Google ScholarDigital Library
- Jianli Xiao, Wenshu Xiang, and Yuncai Liu. 2015. Vehicle logo recognition by weighted multi-class support vector machine ensembles based on sharpness histogram features. Image Process. IET 9, 7 (2015), 527--534. Google ScholarCross Ref
- S. Xie, T. Yang, Xiaoyu Wang, and Yuanqing Lin. 2015. Hyper-class augmented and regularized deep learning for fine-grained image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). 2645--2654. DOI:http://dx.doi.org/10.1109/CVPR.2015.7298880 Google ScholarCross Ref
- L. Yang, P. Luo, C. C. Loy, and X. Tang. 2015. A large-scale car dataset for fine-grained categorization and verification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). 3973--3981. DOI:http://dx.doi.org/10.1109/CVPR.2015.7299023 Google ScholarCross Ref
- Shuyuan Yu, Shibao Zheng, Hua Yang, and Longfei Liang. 2013. Vehicle logo recognition based on bag-of-words. In Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS’13). 353--358. DOI:http://dx.doi.org/10.1109/AVSS.2013.6636665 Google ScholarCross Ref
- I. Zafar, E. A. Edirisinghe, and B. S. Acar. 2009. Localized contourlet features in vehicle make and model recognition. In Proceedings of the SPIE Conference on Image Processing: Machine Vision Applications II, Vol. 7251. Google ScholarCross Ref
- I. Zafar, E. A. Edirisinghe, S. Acar, and H. E. Bez. 2007. Two-dimensional statistical linear discriminant analysis for real-time robust vehicle-type recognition. Proceedings of the SPIE International Conference on Real-Time Image Processing 6496 (2007), 649602--649602--8.Google Scholar
- Matthew D. Zeiler and Rob Fergus. 2014. Visualizing and Understanding Convolutional Networks. Springer International Publishing, Cham, 818--833. DOI:http://dx.doi.org/10.1007/978-3-319-10590-1_53 Google ScholarCross Ref
- Bailing Zhang. 2013. Reliable classification of vehicle types based on cascade classifier ensembles. IEEE Trans. Intell. Transport. Syst. 14, 1 (March 2013), 322--332. Google ScholarDigital Library
- Bailing Zhang, Yifan Zhou, Hao Pan, and Tammam Tillo. 2013. Hybrid model of clustering and kernel autoassociator for reliable vehicle type classification. Mach. Vision Appl. 25, 2 (2013), 437--450. DOI:http://dx.doi.org/10.1007/s00138-013-0588-8 Google ScholarDigital Library
- Haihong Zhang, Weimin Huang, Zhiyong Huang, and Bailing Zhang. 2004. Kernel autoassociator with applications to visual classification. In Proceedings of the 17th International Conference on Pattern Recognition (ICPR’04)., Vol. 2. 443--446. DOI:http://dx.doi.org/10.1109/ICPR.2004.1334252 Google ScholarCross Ref
- Haihong Zhang, Weimin Huang, Zhiyong Huang, and Bailing Zhang. 2005. A kernel autoassociator approach to pattern classification. IEEE Trans. Syst. Man Cybernet., Part B: Cybernet. 35, 3 (June 2005), 593--606. DOI:http://dx.doi.org/10.1109/TSMCB.2005.843980 Google ScholarDigital Library
- Lun Zhang, S. Z. Li, Xiaotong Yuan, and Shiming Xiang. 2007. Real-time object classification in video surveillance based on appearance learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8. Google ScholarCross Ref
- T. Zhang, W. Zheng, Z. Cui, Y. Zong, J. Yan, and K. Yan. 2016. A deep neural network-driven feature learning method for multi-view facial expression recognition. IEEE Trans. Multimedia 18, 12 (Dec 2016), 2528--2536. DOI:http://dx.doi.org/10.1109/TMM.2016.2598092 Google ScholarDigital Library
- Yiren Zhou and Ngai-Man Cheung. 2016. Vehicle classification using transferable deep neural network features. CoRR abs/1601.01145v2 (2016). Retrieved from http://arxiv.org/abs/1601.01145v2.Google Scholar
- Zoran Zivkovic and Ferdinand van der Heijden. 2006. Efficient adaptive density estimation per image pixel for the task of background subtraction. Pattern Recogn. Lett. 27, 7 (May 2006), 773--780. DOI:http://dx.doi.org/10.1016/j.patrec.2005.11.005 Google ScholarDigital Library
Index Terms
- Automated Vehicle Detection and Classification: Models, Methods, and Techniques
Recommendations
Vision-based Autonomous Vehicle Recognition: A New Challenge for Deep Learning-based Systems
Vision-based Automated Vehicle Recognition (VAVR) has attracted considerable attention recently. Particularly given the reliance on emerging deep learning methods, which have powerful feature extraction and pattern learning abilities, vehicle ...
Vehicle Detection and Neural Network Application for Vehicle Classification
CICN '11: Proceedings of the 2011 International Conference on Computational Intelligence and Communication NetworksIn recent years, video processing have been widely used in traffic management. Vehicle detection and vehicle classification using neural network (NN), can be achieved by video monitoring systems. In most vehicle detection methods in the literature, only ...
A vehicle classification system based on hierarchical multi-SVMs in crowded traffic scenes
Automatic vehicle classification is very important for video surveillance, especially for intelligent transportation system. Currently, some approaches have been proposed. However, almost all of these methods cannot play well in the practical crowded ...
Comments