skip to main content
10.1145/2661829.2661896acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Customized Organization of Social Media Contents using Focused Topic Hierarchy

Authors Info & Claims
Published:03 November 2014Publication History

ABSTRACT

With the popularity of social media platforms such as Facebook and Twitter, the amount of useful data in these sources is rapidly increasing, making them promising places for information acquisition. This research aims at the customized organization of a social media corpus using focused topic hierarchy. It organizes the contents into different structures to meet with users' different information needs (e.g., "iPhone 5 problem" or "iPhone 5 camera"). To this end, we introduce a novel function to measure the likelihood of a topic hierarchy, by which the users' information need can be incorporated into the process of topic hierarchy construction. Using the structure information within the generated topic hierarchy, we then develop a probability based model to identify the representative contents for topics to assist users in document retrieval on the hierarchy. Experimental results on real world data illustrate the effectiveness of our method and its superiority over state-of-the-art methods for both information organization and retrieval tasks.

References

  1. K. Bade and A. Nürnberger. Creating a cluster hierarchy under constraints of a partially known hierarchy. In SDM'08, pages 13--24. SIAM.Google ScholarGoogle Scholar
  2. D. M. Blei, T. L. Griffiths, M. I. Jordan, and J. B. Tenenbaum. Hierarchical topic models and the nested chinese restaurant process. In NIPS'03.Google ScholarGoogle Scholar
  3. D. Chakrabarti, R. Kumar, and A. Tomkins. Evolutionary clustering. In SIGKDD'06, pages 554--560. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S.-L. Chuang and L.-F. Chien. A practical web-based approach to generating topic hierarchy for text segments. In CIKM '04, pages 127--136. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Danilevsky, C. Wang, F. Tao, S. Nguyen, G. Chen, N. Desai, L. Wang, and J. Han. Amethyst: A system for mining and exploring topical hierarchies of heterogeneous data. In KDD '13, pages 1458--1461. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. I. Davidson and S. Ravi. Agglomerative hierarchical clustering with constraints: Theoretical and empirical results. In PKDD'05, pages 59--70. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Edmonds. Optimum branchings. Journal of Research of the National Bureau of Standards B, 71:233--240, 1967.Google ScholarGoogle ScholarCross RefCross Ref
  8. R. Fu, J. Guo, B. Qin, W. Che, H. Wang, and T. Liu. Learning semantic hierarchies via word embeddings. In ACL'14, volume 1.Google ScholarGoogle Scholar
  9. B. C. Fung, K. Wang, and M. Ester. Hierarchical document clustering using frequent itemsets. In SDM'03, volume 3, pages 59--70. SIAM.Google ScholarGoogle Scholar
  10. X. Han and J. Zhao. Structural semantic relatedness: a knowledge-based method to named entity disambiguation. In ACL'10, pages 50--59. ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. He, V. Hollink, and A. de Vries. Combining implicit and explicit topic representations for result diversification. In SIGIR'12, pages 851--860. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. C. König and E. Brill. Reducing the human overhead in text categorization. In KDD '06, pages 598--603. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. X. Liu, Y. Song, S. Liu, and H. Wang. Automatic taxonomy construction from keywords. In SIGKDD'12, pages 1433--1441. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. O. Medelyan, S. Manion, J. Broekstra, A. Divoli, A.-L. Huang, and I. Witten. Constructing a focused taxonomy from a document collection. In P. Cimiano, O. Corcho, V. Presutti, L. Hollink, and S. Rudolph, editors, ESWC'13, volume 7882, pages 367--381. Springer Berlin Heidelberg.Google ScholarGoogle Scholar
  15. D. Mimno, W. Li, and A. McCallum. Mixtures of hierarchical topics with pachinko allocation. In ICML'07, pages 633--640. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Z.-Y. Ming, K. Wang, and T.-S. Chua. Prototype hierarchy based clustering for the categorization and navigation of web collections. In SIGIR '10, pages 2--9. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Navigli, P. Velardi, and S. Faralli. A graph-based algorithm for inducing lexical taxonomies from scratch. In IJCAI'11, pages 1872--1877. AAAI Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, M. Gatford, et al. Okapi at trec-3. NIST SPECIAL PUBLICATION SP, pages 109--109, 1995.Google ScholarGoogle Scholar
  19. M. Sanderson and B. Croft. Deriving concept hierarchies from text. In SIGIR '99, pages 206--213. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. U. Scaiella, P. Ferragina, A. Marino, and M. Ciaramita. Topical clustering of search results. In WSDM'12, pages 223--232. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Snow, D. Jurafsky, and A. Y. Ng. Semantic taxonomy induction from heterogenous evidence. In ACL'06, pages 801--808. ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. Wang, M. Danilevsky, N. Desai, Y. Zhang, P. Nguyen, T. Taula, and J. Han. A phrase mining framework for recursive construction of a topical hierarchy. In SIGKDD' 13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Wang, C. Kang, Y. Chang, and J. Han. A hierarchical dirichlet model for taxonomy expansion for search engines. Urbana, 51:61801.Google ScholarGoogle Scholar
  24. X. Wang, S. Liu, Y. Song, and B. Guo. Mining evolutionary multi-branch trees from text streams. In KDD '13, pages 722--730. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. H. Yang and J. Callan. A metric-based framework for automatic taxonomy induction. In ACL'09, pages 271--279. ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. J. Yu, Z.-J. Zha, M. Wang, K. Wang, and T.-S. Chua. Domain-assisted product aspect hierarchy generation: towards hierarchical organization of unstructured consumer reviews. In EMNLP '11, pages 140--150. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. H.-J. Zeng, Q.-C. He, Z. Chen, W.-Y. Ma, and J. Ma. Learning to cluster web search results. In SIGIR'04, pages 210--217. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Y. Zhao and G. Karypis. Evaluation of hierarchical clustering algorithms for document datasets. In CIKM'02, pages 515--524. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. X. Zhu, Z.-Y. Ming, X. Zhu, and T.-S. Chua. Topic hierarchy construction for the organization of multi-source user generated contents. In SIGIR '13, pages 233--242. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Customized Organization of Social Media Contents using Focused Topic Hierarchy

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
      November 2014
      2152 pages
      ISBN:9781450325981
      DOI:10.1145/2661829

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 November 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CIKM '14 Paper Acceptance Rate175of838submissions,21%Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader