skip to main content
research-article

MultiFusion: A boosting approach for multimedia fusion

Published:26 November 2010Publication History
Skip Abstract Section

Abstract

The multimodal data usually contain complementary, correlated and redundant information. Thus, multimodal fusion is useful for many multisensor applications. Here, a novel multimodal fusion algorithm is proposed, which is referred to as “MultiFusion.” The approach adopts a boosting structure where the atomic event is considered as the fusion unit. The correlation of multimodal data is used to form an overall classifier in each iteration. Moreover, by adopting the Adaboost-like structure, the overall fusion performance is improved. Both the simulation experiment and the real application show the effectiveness of the MultiFusion approach. Our approach can be applied in different multimodal applications to exploit the multimedia data characteristics and improve the performance.

References

  1. Aly, R. and Hiemstra, D. 2009. Concept detectors: how good is good enough? In Procedings of the ACM International Conference on Multimedia. 233--242. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Amores, J., Sebe, N., Radeva, P., Gevers, T., and Smeulders, A. W. M. 2004. Boosting contextual information in content-based image retrieval. In Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval. 31--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Atrey, P. K., Kankanhalli, M. S., and Jain, R. 2006. Information assimilation framework for event detection in multimedia surveillance systems. Multimedia Syst. 12, 3, 239--253.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Barbu, C., Iqbal, R. T., and Peng, J. 2005. Classifier fusion using shared sampling distribution for boosting. In Proceedings of the International Conference on Data Mining. 34--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Contreras, F. M., Urunuela, C. O., and del Rincon, J. M. 2009. Adaboost multiple feature selection and combination for face recognition. In Proceedings of the Iberian Conference on Pattern Recognition and Image Analysis. 338--345. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Dasarathy, B. V. 1994. Decision Fusion. Computer Society Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Freund and Schapire. 1997. A decision-theoretic generalization of on-line learning and an application to boosting. J. Compu. Syst. Sci. 55. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Freund, Y. and Schapire, R. E. 1996. Experiments with a new boosting algorithm. In Proceedings of the International Conference on Machine Learning. 148--156.Google ScholarGoogle Scholar
  9. Guo, G. and Zhang, H. 2001. Boosting for fast face recognition. Tech. rep. MSR-TR-2001-16, Microsoft Research.Google ScholarGoogle Scholar
  10. Guo, W., Qi, Q., Jiang, L., and Zhang, P. 2008. A new method of sar image target recognition based on adaboost algorithm. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium.Google ScholarGoogle Scholar
  11. Iyengar, G., Nock, H., and Neti, C. 2003. Audio-visual synchrony for detection of monologues in video archives. In Proceedings of the IEEE International Conference on Multimedia and Expo 1, 329--332. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Joshi, D., Naphade, M. R., and Natsev, A. 2007. Semantics reinforcement and fusion learning for multimedia streams. In Proceedings of the International Conference on Image and Video Retrieval. 309--316. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Maghooli, K. and Moin, M. S. 2004. A new approach on multimodal biometrics based on combining neural networks using adaboost. In Biometric Authentication. Lecture Notes in Computer Science Vol. 3087 332--341.Google ScholarGoogle Scholar
  14. Parikh, D., Kim, M. T., Oagaro, J., Mandayam, S., and Polikar, R. 2004. Combining classifiers for multisensor data fusion. In Proceedings of the International Conference on Systems, Man and Cybernetics. 1232--1237.Google ScholarGoogle Scholar
  15. Pickering, M. J., Ruger, S. M., and Sinclair, D. 2002. Video retrieval by feature learning in key frames. In Proceedings of the International Conference on Image and Video Retrieval. 309--317. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Polikar, R. 2006. Ensemble based systems in decision making. IEEE Circu. Syst. Maga. 6, 3, 21--45.Google ScholarGoogle ScholarCross RefCross Ref
  17. Polikar, R., Upda, L., Upda, S. S., and Honavar, V. 2001. Learn++: an incremental learning algorithm for supervised neural networks. IEEE Trans. Syst. Man, Cyber. Part C 31, 4, 497--508. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Schapire, R. E. 1990. The strength of weak learnability. Mach. Learn. 5, 2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Tieu, K. and Viola, P. 2004. Boosting image retrieval. Int. J. Comput. Vis. 56, 1--2, 17--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Xue, F. and Ding, X. 2006. 3D+2D face localization using boosting in multi-modal feature space. In Proceedings of the International Conference on Pattern Recognition. III: 499--502. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Zhang, L., Li, M. J., and Zhang, H. J. 2002. Boosting image orientation detection with indoor vs. outdoor classification. In Proceedings of the IEEE Workshop on Applications of Computer Vision. 95--99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Zhang, T. and Yu, B. 2005. Boosting with early stopping: Convergence and consistency. Ann. Stat. 33, 1538--1579.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. MultiFusion: A boosting approach for multimedia fusion

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Multimedia Computing, Communications, and Applications
        ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 6, Issue 4
        November 2010
        159 pages
        ISSN:1551-6857
        EISSN:1551-6865
        DOI:10.1145/1865106
        Issue’s Table of Contents

        Copyright © 2010 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 26 November 2010
        • Accepted: 1 June 2010
        • Revised: 1 May 2010
        • Received: 1 January 2010
        Published in tomm Volume 6, Issue 4

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader