skip to main content
10.1145/355214.355249acmconferencesArticle/Chapter ViewAbstractPublication PagesiralConference Proceedingsconference-collections
Article
Free Access

Research on extracting subject from Chinese text (poster session)

Authors Info & Claims
Published:01 November 2000Publication History

ABSTRACT

Because of the agility and diversity of natural languages, extracting the subject of text is one of the most difficult but important tasks in natural language processing (NLP). Due to the unique linguistics and grammar structures of Chinese, we now can only adopt non-semantic based approaches to extract subject from Chinese text. Three different approaches of extracting subject from Chinese text are presented in this paper. The first one is based a component-word dictionary, the second one is based on a subject-word dictionary and the third one is based on a statistic method. We introduce the process of the approaches. To test our approaches, we develop three independent systems and design a comparison experiment. The experimental results are illuminating and inspiring: every system can extract the text's subject to some extent, however, we may need combine these approaches to get a better one.

References

  1. 1.Text Mining Technology: Turning Information into Knowledge, A while paper from IBM. IBM 1998.Google ScholarGoogle Scholar
  2. 2.McKeown, Radev. Generating summaries of multiple news articles. SIGIR 95 proceeding. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.G. Salton: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer, Addison Wesley, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.Regina Barzilay, Michael Elhadad. Using Lexical Chains for Text Summarization. http://www.cs.bgu.ac.il/elhadad.Google ScholarGoogle Scholar
  1. Research on extracting subject from Chinese text (poster session)

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          IRAL '00: Proceedings of the fifth international workshop on on Information retrieval with Asian languages
          November 2000
          220 pages
          ISBN:1581133006
          DOI:10.1145/355214
          • Chairmen:
          • Kam-Fai Wong,
          • Dik L. Lee,
          • Jong-Hyeok Lee

          Copyright © 2000 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 November 2000

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article
        • Article Metrics

          • Downloads (Last 12 months)16
          • Downloads (Last 6 weeks)2

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader