skip to main content
10.1145/2487575.2487610acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open Access

Multi-label relational neighbor classification using social context features

Published:11 August 2013Publication History

ABSTRACT

Networked data, extracted from social media, web pages, and bibliographic databases, can contain entities of multiple classes, interconnected through different types of links. In this paper, we focus on the problem of performing multi-label classification on networked data, where the instances in the network can be assigned multiple labels. In contrast to traditional content-only classification methods, relational learning succeeds in improving classification performance by leveraging the correlation of the labels between linked instances. However, instances in a network can be linked for various causal reasons, hence treating all links in a homogeneous way can limit the performance of relational classifiers.

In this paper, we propose a multi-label iterative relational neighbor classifier that employs social context features (SCRN). Our classifier incorporates a class propagation probability distribution obtained from instances' social features, which are in turn extracted from the network topology. This class-propagation probability captures the node's intrinsic likelihood of belonging to each class, and serves as a prior weight for each class when aggregating the neighbors' class labels in the collective inference procedure. Experiments on several real-world datasets demonstrate that our proposed classifier boosts classification performance over common benchmarks on networked multi-label data.

References

  1. Bhagat, S., Cormode, G., and Muthukrishnan, S. Node classification in social networks. Computing Research Repository (CoRR) abs/1101.3291 (2011).Google ScholarGoogle Scholar
  2. Boughorbely, S., Tarel, J.-P., and Boujemaa, N. Generalized histogram intersection kernel for image recognition. In IEEE International Conference on Image Processing (2005).Google ScholarGoogle ScholarCross RefCross Ref
  3. Chakrabarti, S., Dom, B., , and Indyk, P. Enhanced hypertext categorization using hyperlinks. In Proceedings of the ACM International Conference on Management of Data (SIGMOD) (1998), pp. 307--318. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Fan, R., and Lin, C. A study on threshold selection for multi-label classification. Tech. rep., National Taiwan University, 2007.Google ScholarGoogle Scholar
  5. Fan, Y., and Shelton, C. R. Learning continuous-time social network dynamics. In Proceedings of Conference on Uncertainty in Artificial Intelligence (UAI) (2009), pp. 161--168. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Getoor, L., and Taskar, B. Introduction to Statistical Relational Learning. The MIT Press, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Goldberg, A., Zhu, X., and Wright, S. Dissimilarity in graph-based semi-supervised classification. In Eleventh International Conference on Artificial Intelligence and Statistics (AISTATS) (2007).Google ScholarGoogle Scholar
  8. Guo, Y., and Gu, S. Multi-label classification using conditional dependency networks. In Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI) (2011), pp. 1300--1305. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Heatherly, R., Kantarcioglu, M., and Li, X. Social network classification incorporating link type. In Proceedings of IEEE Intelligence and Security Informatics (ISI) (2009), pp. 19--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ji, M., Han, J., and Danilevsky, M. Ranking-based classification of heterogeneous information networks. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2011), pp. 1298--1306. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Lewis, D. D., Yang, Y., Rose, T. G., and Li, F. RCV1: A new benchmark collection for text categorization research. The Journal of Machine Learning Research 5 (Dec 2004), 361--397. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Lu, Q., and Getoor, L. Link-based classification. In Proceedings of 20th International Conference on Machine Learning (ICML) (2003), pp. 496--503.Google ScholarGoogle Scholar
  13. Macskassy, S. A., and Provost, F. A simple relational classifier. In Proceedings of the Second Workshop on Multi-Relational Data Mining (MRDM) at KDD 2003 (2003), pp. 64--76.Google ScholarGoogle ScholarCross RefCross Ref
  14. Macskassy, S. A., and Provost, F. Classification in networked data: a toolkit and a univariate case study. Journal of Machine Learning 8 (2007), 935--983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. McPherson, M., Smith-Lovin, L., and Cook, J. M. Birds of a feather: Homophily in social networks. Annual Review of Sociology 27, 1 (2001), 415--444.Google ScholarGoogle ScholarCross RefCross Ref
  16. Neville, J., Gallagher, B., Eliassi-Rad, T., and Wang, T. Correcting evaluation bias of relational classifiers with network cross validation. Knowledge and Information Systems (Jan 2011), 1--25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Neville, J., and Jensen, D. Iterative classification in relational data. In Proceedings of the AAAI Workshop on Learning Statistical Models from Relational Data (2000), pp. 42--49.Google ScholarGoogle Scholar
  18. Neville, J., Jensen, D., Friedland, L., and Hay, M. Learning relational probability trees. In Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD) (2003), pp. 625--630. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Newman, M. Networks: An Introduction. Oxford University Press, 2010. Google ScholarGoogle ScholarCross RefCross Ref
  20. Sen, P., Namata, G., Bilgic, M., Getoor, L., Gallagher, B., and Eliassi-Rad, T. Collective classification in network data. AI Magazine (2008), 93--106.Google ScholarGoogle Scholar
  21. Singh, A., and Gordon, G. A Bayesian matrix factorization model for relational data. In Proceedings of Conference on Uncertainty in Artificial Intelligence (UAI) (2010), pp. 556--563.Google ScholarGoogle Scholar
  22. Tang, L., and Liu, H. Relational learning via latent social dimensions. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2009), KDD '09, pp. 817--826. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Tang, L., and Liu, H. Scalable learning of collective behavior based on sparse social dimensions. In Proceedings of International Conference on Information and Knowledge Management (CIKM) (2009). Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Tang, L., and Liu, H. Leveraging social media networks for classification. Data Mining and Knowledge Discovery (DMKD 2011) 23, 3 (Nov. 2011), 447--478. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Taskar, B., Abbeel, P., and Koller, D. Discriminative probabilistic models for relational data. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI) (2002), pp. 895--902. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Wang, X., and Sukthankar, G. Extracting social dimensions using Fiedler embedding. In Proceedings of IEEE International Confernece on Social Computing (2011), pp. 824--829.Google ScholarGoogle ScholarCross RefCross Ref
  27. Yedidia, J. S., Freeman, W. T., and Weiss, Y. Constructing free energy approximations and generalized belief propagation algorithms. IEEE Transactions on Information Theory 51 (2005), 2282--2312. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Zhang, X., Yuan, Q., Zhao, S., Fan, W., Zheng, W., and Wang, Z. Multi-label classification without the multi-label cost. In Proceedings of SIAM International Conference on Data Mining (Apr. 2010).Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Multi-label relational neighbor classification using social context features

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
        August 2013
        1534 pages
        ISBN:9781450321747
        DOI:10.1145/2487575

        Copyright © 2013 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 August 2013

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        KDD '13 Paper Acceptance Rate125of726submissions,17%Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader