skip to main content
10.1145/3055635.3056579acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

Siamese Network for Underwater Multiple Object Tracking

Authors Info & Claims
Published:24 February 2017Publication History

ABSTRACT

For underwater videos, the performance of object tracking is greatly affected by illumination changes, background disturbances and occlusion. Hence, there is a need to have a robust function that computes image similarity, to accurately track the moving object. In this work, a hybrid model that incorporates the Kalman Filter, a Siamese neural network and a miniature neural network has been developed for object tracking. It was observed that the usage of the Siamese network to compute image similarity significantly improved the robustness of the tracker. Although the model was developed for underwater videos, it was found that it performs well for both underwater and human surveillance videos. A metric has been defined for analyzing detections-to-tracks mapping accuracy. Tracking results have been analyzed using Multiple Object Tracking Accuracy (MOTA) and Multiple Object Tracking Precision (MOTP)metrics.

References

  1. Yilmaz, Alper and Javed, Omar and Shah, Mubarak, "Object Tracking: A Survey", ACM Comput. Surv., Vol. 38, No. 4, 2006 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Yu Xiang, Alexandre Alahi and Silvio Savarese, "Learning to Track: Online Multi-Object Tracking by Decision Making", IEEE International Conference on Computer Vision, 2015, pp. 4705--4713 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Takala, Valtteri, and MattiPietikainen, "Multi object tracking using color, texture and motion". IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  4. Han, Mei, et al. "A detection-based multiple object tracking method". Image Processing, 2004. ICIP'04. 2004 International Conference on. Vol. 5. IEEE, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  5. Ondruska, Peter, and Ingmar Posner. "Deep tracking: Seeing beyond seeing using recurrent neural networks". arXiv preprint arXiv:1602.00991, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Huang, Phoenix X., Bastiaan J. Boom, and Robert B. Fisher, "Underwater live fish recognition using a balance-guaranteed optimized tree." Asian Conference on Computer Vision. Springer Berlin Heidelberg, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Fish-For-Knowledge @http://groups.inf.ed.ac.uk/f4k/Google ScholarGoogle Scholar
  8. Hemavathy R, Rashmi G N, Pooja Varambally, and Shobha G, "Segmenting of Object in postprocessing stage in Underwater Video Sequences Under Dynamic condition" in Proc IEEE Int. Advance Computing Conference(IACC), Jun2015, PgNo. 111--115Google ScholarGoogle Scholar
  9. Milan, Anton, et al., "MOT16: A Benchmark for Multi Object Tracking", arXiv preprint:1603.00831 (2016).Google ScholarGoogle Scholar
  10. Chopra, Sumit, Raia Hadsell, and Yann LeCun, "Learning a similarity metric discriminatively, with application to face verification", 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05). Vol. 1. IEEE, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jia, Yangqing, et al. "Caffe: Convolutional architecture for fast feature embedding" Proceedings of the 22nd ACM international conference on Multimedia, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yamaguchi, Kota https://github.com/kyamagu/mexopencvGoogle ScholarGoogle Scholar
  13. Hemavathy R, Rashmi G N, Shobha G, "MovingObject Segmentation from Underwater Videos Using Adaptive Collective Background Learning Approach", International Journal of Applied Engineering Research, Vol 11, No.9, pp. 6651--6654.Google ScholarGoogle Scholar
  14. Vondrick, Carl, Donald Patterson, and Deva Ramanan, "Efficiently scaling up crowdsourced video annotation", International Journal of Computer Vision 101.1, 2013, pp. 184--204. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICMLC '17: Proceedings of the 9th International Conference on Machine Learning and Computing
    February 2017
    545 pages
    ISBN:9781450348171
    DOI:10.1145/3055635

    Copyright © 2017 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 24 February 2017

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader