skip to main content
10.1145/1555400.1555411acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
short-paper

Finding topic trends in digital libraries

Published:15 June 2009Publication History

ABSTRACT

We propose a generative model based on latent Dirichlet allocation for mining distinct topics in document collections by integrating the temporal ordering of documents into the generative process. The document collection is divided into time segments where the discovered topics in each segment is propagated to influence the topic discovery in the subsequent time segments. We conduct experiments on the collection of academic papers from CiteSeer repository. We augment the text corpus with the addition of user queries and tags and integrate the citation graph to boost the weight of the topical terms. The experiment results show that segmented topic model can effectively detect distinct topics and their evolution over time.

References

  1. G. Almpanidis, C. Kotropoulos, andI. Pitas. Combining text and link analysis for focused crawling-an application for vertical search engines. Information Systems, 32(6):886--908, 2007 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Bolelli, S. Ertekin, and C. L. Giles. Clustering scientific literature using sparse citation graph analysis. In PKDD'06, pages 30--41, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Steyvers, P. Smyth, M. Rosen-Zvi, and T. Griffiths. Probabilistic author-topic models for information discovery. In KDD'04, pages 306---315, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. D. X. He, H. Zha and H. Simon. Web document clustering using hyperlink structures. Computational Statistics and Data Analysis, 41:19--45, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Finding topic trends in digital libraries

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      JCDL '09: Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
      June 2009
      502 pages
      ISBN:9781605583228
      DOI:10.1145/1555400

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 June 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate415of1,482submissions,28%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader