skip to main content
10.1145/3011141.3011205acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
short-paper

GeTCo: an ontology-based approach for patent classification search

Authors Info & Claims
Published:28 November 2016Publication History

ABSTRACT

The main contribution of this paper is a method for creating a Graph-Embedded-Tree-based ontology, which utilizes domain knowledge from a patent classification scheme, for a patent classification process. Our contribution is twofold. First, we propose a novel definition of GeTCo ontology, which consists of four types of concept: Class, Document, Phrase, and Term. Depending on relationships of each pair of concepts, we further define their semantic information to give our classifier better reasoning capability whenever the semantic ambiguation occurs. Second, we propose a novel method to construct our ontology based on the United State Patent Classification Scheme (USPC) without relying on a rule-based method for concept extraction and thus, it can negate intensive-manual efforts in traditional ontology construction. We developed a prototype application on top of Rocchio classifier, called the GeTCo-enabled Rocchio classifier, to evaluate our proposed ontology. Our experiments with filtered 9703 single-class patents showed that the GeTCo-enabled Rocchio classifier, backed by our proposed directed-graph ontology, yields higher F1-score (i.e., +7%) than original Rocchio classifier without GeTCo supports.

References

  1. Patent Scope - International Patent Cooperation Treaty Database.Google ScholarGoogle Scholar
  2. L. S. Larkey. A patent search and classification system. In Proceedings of the Fourth ACM Conference on Digital Libraries, DL '99, pages 179--187, New York, NY, USA, 1999. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Z. Li and D. Tate. Automatic ontology generation from patents using a pre-built library, wordnet and a class-based n-gram model. International Journal of Product Development (IJPD), 20:142--172, Nov. 2015.Google ScholarGoogle ScholarCross RefCross Ref
  4. C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008. Google ScholarGoogle ScholarCross RefCross Ref
  5. J. W. Reed, Y. Jiao, T. E. Potok, B. A. Klump, M. T. Elmore, and A. R. Hurson. Tf-icf: A new term weighting scheme for clustering dynamic data streams. In Proceedings of the 5th International Conference on Machine Learning and Applications, ICMLA '06, pages 258--263, Washington, DC, USA, 2006. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Taduri, G. T. Lau, K. H. Law, H. Yu, and J. P. Kesan. Developing an ontology for the u.s. patent system. In Proceedings of the 12th Annual International Digital Government Research Conference: Digital Government Innovation in Challenging Times, dg.o '11, pages 157--166, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. European Patent Office. Espacenet - Online Patent Search with CPC scheme support. http://worldwide.espacenet.com/. Accessed: 2016-06-28.Google ScholarGoogle Scholar
  8. Japan Patent Office. J-platpat - Japan Patent Search. https://www.j-platpat.inpit.go.jp/. Accessed: 2016-06-28.Google ScholarGoogle Scholar
  9. Reed Tech - A Lexis Nexis Company. USPTO Data Sets - Patent Grant Red Book (Full Text). http://patents.reedtech.com/pgrbft.php. Accessed: 2016-06-28.Google ScholarGoogle Scholar
  10. United States Patent and Trademark Office. PatFT - Patent Full Text Search. http://patft.uspto.gov/. Accessed: 2016-06-28.Google ScholarGoogle Scholar
  11. United States Patent and Trademark Office. XML Resources - Patent Grants and Red Book. http://www.uspto.gov/learning-and-resources/xml-resources. Accessed: 2016-06-28.Google ScholarGoogle Scholar
  12. V. X. Vinh, H.-Q. Nguyen, and K.-N. Tran. Get-based ontology construction for semantic disambiguation. In Proceedings of the 16th International Conference on Information Integration and Web-based Applications and Services, iiWAS '14, pages 445--453, New York, NY, USA, 2014. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. GeTCo: an ontology-based approach for patent classification search

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      iiWAS '16: Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services
      November 2016
      528 pages
      ISBN:9781450348072
      DOI:10.1145/3011141

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 November 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader