skip to main content
10.1145/1980422.1980455acmotherconferencesArticle/Chapter ViewAbstractPublication PagescomputeConference Proceedingsconference-collections
short-paper

A novel approach to keyphrase extraction using augmented transition networks and statistical tools

Published: 25 March 2011 Publication History

Abstract

We present a novel approach to extract keyphrases based on Augmented Transition Networks (abbreviated as ATNs) followed by statistical methods from any given article, notes on a particular subject, or any other document source. The use of ATNs has completely ruled out the need of background corpora in identifying the potential keywords and keyphrases. Moreover, the use of ATNs has greatly reduced the search space for the statistical methods. We have devised two new methods namely, relaxed statistical analysis and stringent statistical analysis to identify the separability of phrases into sub phrases. In this paper, the two tier process is discussed in detail and illustrated with examples. We have also discussed the applications of this process briefly.

References

[1]
Woods, William A (1970). Transition Network Grammars for Natural Language Analysis. Communications of the ACM 13 (10): 591--606.
[2]
Eibe Frank, Gordon W. Paynter, Ian H. Witten, Carl Gutwin, and Craig G. Nevill-Manning. 1999. Domain-specific keyphrase extraction. In IJCAI, pages 668--673.
[3]
Peter D. Turney. 2000. Learning algorithms for keyphrase extraction. Information Retrieval, 2(4):303--336.
[4]
Alan L. Tharp. Augmented Transition Networks As A Design Tool For Personalized Database Systems. Computer Science Department, North Carolina State University, Raleigh, North Carolina.
[5]
Fei Liu, Feifan Liu, and Yang Liu. A Supervised Framework for Keyword Extraction From Meeting Transcripts. IEEE transactions on Audio, Speech, and Language Processing.
[6]
Bilkent University, Department of Computer Science, Turkey. Yasin Uzun. http://www.cs.bilkent.edu.tr/~guvenir/courses/cs550/Workshop/Yasin_Uzun.pdf
[7]
MSc. Dipl.-Inf. Elena Demidova, Erklärung http://www.l3s.de/~demidova/students/thesis_oelze.pdf
[8]
Kristina Toutanova and Christopher D. Manning. 2000. Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC-2000), pp. 63--70.
[9]
Kristina Toutanova, Dan Klein, Christopher Manning, and Yoram Singer. 2003. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network. In Proceedings of HLT-NAACL 2003, pp. 252--259.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
COMPUTE '11: Proceedings of the Fourth Annual ACM Bangalore Conference
March 2011
194 pages
ISBN:9781450307505
DOI:10.1145/1980422
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • ACM Bangalore chapter

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 March 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. augmented transition networks
  2. keyphrase extraction
  3. keyphrase mining
  4. statistical analysis
  5. text mining
  6. word collocation

Qualifiers

  • Short-paper

Conference

Compute'11
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 103
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media