skip to main content
10.1145/2020408.2020592acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
poster

Mining partially annotated images

Published:21 August 2011Publication History

ABSTRACT

In this paper, we study the problem of mining partially annotated images. We first define what the problem of mining partially annotated images is, and argue that in many real-world applications annotated images are typically partially annotated and thus that the problem of mining partially annotated images exists in many situations. We then propose an effective solution to this problem based on a statistical model we have developed called the Semi-Supervised Correspondence Hierarchical Dirichlet Process (SSCHDP). The main idea of this model lies in exploiting the information pertaining to partially annotated images or even unannotated images to achieve semi-supervised learning under the HDP structure. We apply this model to completing the annotations appropriately for partially annotated images in the training data and then to predicting the annotations appropriately and completely for all the unannotated images either in the training data or in any unseen data beyond the training process. Experiments show that SSC-HDP is superior to the peer models from the recent literature when they are applied to solving the problem of mining partially annotated images.

References

  1. http://www.fruitfly.org/.Google ScholarGoogle Scholar
  2. K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. M. Blei and M. I. Jordan. Modeling annotated data. In Proceedings of the 26th International ACM SIGIR Conference, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. R. Boutell, J. Luo, X. Shen, and C. M. Brown. Learning multi-label scene classification. Pattern Recognition, 37(9):1757--1771, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  6. T. S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng. Nus-wide: a real-world web image database from national university of singapore. In Proceedings of the ACM International Conference on Image and Video Retrieval, pages 1--9, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Datta, D. Joshi, J. Li, and J. Z. Wang. Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys, 40:1--60, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Everingham, L. V. Gool, C. Williams, C. K. I., J. Winn, and A. Zisserman. The pascal visual object classes (voc) challenge. International Journal of Computer Vision, 88(2):303--338, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevance models for image and video annotation. In Proceedings of International Conference on Computer Vision and Pattern Recognition, pages 1002--1009, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Z. Guo, Z. Zhang, E. P. Xing, and C. Faloutsos. Enhanced max margin learning on multimodal data mining in a multimedia database. In Proceedings of 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. H. Ishwaran and L. F. James. Gibbs sampling methods for stick-breaking priors. Journal of the American Statistical Association, 96(453):161--173, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  12. J. Jeon, V. Lavrenko, and R. Manmatha. Automatic image annotation and retrieval using cross-media relevance models. In Proceedings of ACM Special Interest Group on Information Retrieval, pages 119--126, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. V. Lavrenko, R. Manmatha, and J. Jeon. A model for learning the semantics of pictures. In Proceedings of Neural Information Processing Systems, 2003.Google ScholarGoogle Scholar
  14. L.-J. Li and L. Fei-Fei. What, where and who? classifying events by scene and object recognition. In Proceedings of International Conference Computer Vision, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  15. W. Li and M. Sun. Semi-supervised learning for image annotation based on conditional random fields. In Proceedings of ACM International Conference on Image and Video Retrieval, pages 463--472, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Z. Li, J. Liu, X. Zhu, T. Liu, and H. Lu. Image annotation using multi-correlation probabilistic matrix factorization. In Proceedings of ACM international conference on Multimedia, pages 1187--1190, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. P. Liang, Petrov, M. I. Jordan, and D. Klein. The infinite pcfg using hierarchical dirichlet processes. In Proceedings of Empirical Methods in Natural Language Processing, pages 688--697, 2007.Google ScholarGoogle Scholar
  18. B. Liu, W. S. Lee, P. S. Yu, and X. Li. Partially supervised classification of text documents. In Proceedings of the 19th International Conference on Machine Learning, pages 387--394, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. N. Loeff, A. Farhadi, I. Endres, and D. A. Forsyth. Unlabeled data improves word prediction. In Proceedings of International Conference Computer Vision, 2009.Google ScholarGoogle Scholar
  20. D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. G. Luenberger and Y. Ye. Linear and Nonlinear Programming. Springer, third edition, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  22. J. Sethuraman. A constructive definition of dirichlet priors. Statistica Sinica, 4:639--650, 1994.Google ScholarGoogle Scholar
  23. A. Sharma, G. Hua, Z. Liu, and Z. Zhang. Meta-tag propagation by co-training an ensemble classifier for improving image search relevance. In Computer Vision and Pattern Recognition Workshop, pages 1--6, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  24. Y.-Y. Sun, Y. Zhang, and Z.-H. Zhou. Multi-label learning with weak label. In Proceedings of Association for the Advancement of Artificial Intelligence, pages 593--598, 2010.Google ScholarGoogle Scholar
  25. Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei. Hierarchical dirichlet processes. Journal of the American Statistical Association, 101:1566--1581, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  26. O. Yakhnenko and V. Honavar. Annotating images and image objects using a hierarchical dirichlet process model. In Proceedings of the 9th International Workshop on Multimedia Data Mining, pages 1--7, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. S.-H. Yang, H. Zha, and B.-G. Hu. Dirichlet-bernoulli alignment: A generative model for multi-class multi-label multi-instance corpora. In Proceedings of Neural Information Processing Systems, pages 2143--2150, 2009.Google ScholarGoogle Scholar
  28. R. Zhang, Z. Zhang, M. Li, W.-Y. Ma, and H.-J. Zhang. A probabilistic semantic model for image annotation and multi-modal image retrieval. In Proceedings of International Conference Computer Vision, pages 846--851, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Z.-H. Zhou and M.-L. Zhang. Multi-instance multilabel learning with application to scene classification. In Proccedings of Neural Information Processing Systems, pages 1609--1616, 2007.Google ScholarGoogle Scholar
  30. X. Zhu. Semi-supervised learning literature survey. Technical report, Computer Sciences TR 1530, University of Wisconsin-Madison, 2005.Google ScholarGoogle Scholar

Index Terms

  1. Mining partially annotated images

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
          August 2011
          1446 pages
          ISBN:9781450308137
          DOI:10.1145/2020408

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 21 August 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • poster

          Acceptance Rates

          Overall Acceptance Rate1,133of8,635submissions,13%

          Upcoming Conference

          KDD '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader