skip to main content
research-article

Cognitive canonicalization of natural language queries using semantic strata

Published:03 January 2014Publication History
Skip Abstract Section

Abstract

Natural language search relies strongly on perceiving semantics in a query sentence. Semantics is captured by the relationship among the query words, represented as a network (graph). Such a network of words can be fed into larger ontologies, like DBpedia or Google Knowledge Graph, where they appear as subgraphs— fashioning the name subnetworks (subnets). Thus, subnet is a canonical form for interfacing a natural language query to a graph database and is an integral step for graph-based searching. In this article, we present a novel standalone NLP technique that leverages the cognitive psychology notion of semantic strata for semantic subnetwork extraction from natural language queries. The cognitive model describes some of the fundamental structures employed by the human cognition to construct semantic information in the brain, called semantic strata. We propose a computational model based on conditional random fields to capture the cognitive abstraction provided by semantic strata, facilitating cognitive canonicalization of the query. Our results, conducted on approximately 5000 queries, suggest that the cognitive canonicals based on semantic strata are capable of significantly improving parsing and role labeling performance beyond pure lexical approaches, such as parts-of-speech based techniques. We also find that cognitive canonicalized subnets are more semantically coherent compared to syntax trees when explored in graph ontologies like DBpedia and improve ranking of retrieved documents.

References

  1. Allemang, D. and Hendler, J. 2011. Semantic Web for the Working Ontologist: Effective Modeling in RDFS and OWL. Morgan Kaufmann, San Fransisco, LA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., and Ives, Z. 2007. DBpedia: A nucleus for a web of open data. In 6th International Semantic Web Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Booth, J., Di Eugenio, B., Cruz, I. F., and Wolfson, O. 2009. Query sentences as semantic (sub) networks. In Proceedings of the IEEE International Conference on Semantic Computing (ICSC'09). 89--94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Broccolo, D., Marcon, L., Nardini, F. M., Perego, R., and Silvestri, F. 2012. Generating suggestions for queries in the long tail with an inverted index. Inf. Process. Manag. 48, 2, 326--339. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bruce, R. and Wiebe, J. 1994. Word-sense disambiguation using decomposable models. In Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics (ACL'94). 139--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cardona, G., Rossello, F., and Valiente, G. 2009. Comparison of tree-child phylogenetic networks. IEEE/ACM Trans. Comput. Biol. Bioinf. 6, 4, 552--569. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Carpineto, C. and Romano, G. 2012. A survey of automatic query expansion in information retrieval. ACM Comput. Surv. 44, 1. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Carroll, J. B. 1993. Human Cognitive Abilities. Cambridge University Press.Google ScholarGoogle Scholar
  9. Clarke, C. A., Craswell, N., Soboroff, I., and Voorhees, E. M. 2011. Overview of the trec 2011 web track. In Proceedings of the 20th Text Retrieval Conference.Google ScholarGoogle Scholar
  10. Chater, N., Tenenbaum, J. B., and Yuille, A. 2006. Probabilistic models of cognition: Conceptual foundations. Trends Cogn. Sci. 10, 7, 287--291.Google ScholarGoogle ScholarCross RefCross Ref
  11. Clifford, J. 1990. Formal Semantics and Pragmatics for Natural Language Querying: Cambridge Tracks in Theoretical Computer Science. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., and KUKSA, P. 2011. Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493--2537. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Coursey, K. and Mihalcea, R. 2009. Topic identification using wikipedia graph centrality. In Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers (NAACL-Short'09). 117--120. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Croce, D., Giannone, C., Annesi, P., and Basili, R. 2010. Towards open-domain semantic role labeling. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. 237--246. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Croft, W. and Cruse, D. A. 2004. Cognitive Linguistics. Cognitive Canonicalization of Natural Language Queries using Semantic Strata. Cambridge University Press.Google ScholarGoogle Scholar
  16. Cucerzan, S. 2007. Large-scale named entity disambiguation based on wikipedia data. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 708--716.Google ScholarGoogle Scholar
  17. Cui, H., Wen, J.-R., Nie, J.-Y., and Ma, W.-Y. 2002. Probabilistic query expansion using query logs. In Proceedings of the 11th International Conference on World Wide Web (WWW'02). ACM Press, New York, 325--332. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Dangalchev, C. 2006. Residual closeness in networks. Phisica A 365, 2, 556--564.Google ScholarGoogle ScholarCross RefCross Ref
  19. Finkel, J. R. and Manning, C. D. 2009. Joint parsing and named entity recognition. In Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL'09). 326--334. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Gabrilovich, E. and Markovitch, S. 2007. Computing semantic relatedness using wikipedia-based explicit semantic analysis. In Proceedings of the International Joint Conference on Artificial Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Guilford, J. P. 1977. Way Beyond the IQ. Creative Education Foundation.Google ScholarGoogle Scholar
  22. Guo, J., Xu, G., Cheng, X., and Li, H. 2009. Named entity recognition in query. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'09). ACM Press, New York, 267--274. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Gurevych, I., Malaka, R., Porzel, R., and Zorn, H.-P. 2003. Semantic coherence scoring using an ontology. In Proceedings of Human Language Technology: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL'03). Vol. 1. 9--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Hatcher, E., Gospodnetic, O., and Mccandless, M. 2004. Lucene in Action. Manning Publications Company. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Herdagdelen, A., Ciaramita, M., Mahler, D., Holmqvist, M., Hall, K., and Riezler, S. 2010. Generalized syntactic and semantic models of query reformulation. In Proceedings of the 33rd ACM SIGIR International Conference on Research and Development in Information Retrieval (SIGIR'10). 283--290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Hu, J. J., Wang, G., Lochovsky, F., Sun, J.-T., and Chen, Z. 2009. Understanding user's query intent with wikipedia. In Proceedings of the 18th International Conference on World Wide Web (WWW'09). ACM Press, New York, 471--480. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Huang, M. and Haralick, R. M. 2009. Identifying patterns in texts. In Proceedings of the IEEE International Conference on Semantic Computing (ICSC'09). 59--64. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Kaufmann, E., Bernstein, A., and Fisher, L. 2007. NLP-reduce: A “naïve” but domain-independent natural language interface for querying ontologies. In Proceedings of the 4th European Semantic Web Conference (ESWC'07).Google ScholarGoogle Scholar
  29. Korf, R. E. 1985. Depth-first iterative-deepening: An optimal admissible tree search. Artif. Intell. 27, 1, 97--109. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Lafferty, J. D., McCallum, A., and Pereira, F. C. N. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th International Conference on Machine Learning (ICML'01). 282--289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Manning, C. D. and Schuetze, H. 1999. Foundations of Statistical Natural Language Processing. MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. McCallum, A., and Li, W. 2003. Early results for named entity recognition with conditional random fields feature induction and web-enhanced lexicons. In Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL. Vol. 4, Association for Computational Linguistics, 188--191. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. McCallum, A., Freitag, D., and Pereira, F. C. N. 2000. Maximum entropy markov models for information extraction and segmentation. In Proceedings of the 17th International Conference on Machine Learning (ICML'00). 591--598. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. McClosky, D., Surdeanu, M., and Manning, C. D. 2011. Event extraction as dependency parsing. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (HLT'11). Vol. 1, 1626--1635. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. MSQA. 2008. http://research.microsoft.com/en-us/downloads/88c0021c-328a-4148-a158-a42d7331c6cf/.Google ScholarGoogle Scholar
  36. Navigli, R. and Lapata, M. 2010. An experimental study of graph connectivity for unsupervised word sense disambiguation. IEEE Trans. Pattern Anal. Mach. Intell. 32, 4, 678--692. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Opsahl, T., Agneessens, F., and Skvoretz, J. 2010. Node centrality in weighted networks: Generalizing degree and shortest paths. Social Netw. 32, 3, 245--251.Google ScholarGoogle ScholarCross RefCross Ref
  38. Popescu, A.-M., Etzioni, O., and Kautz, H. 2003. Towards a theory of natural language interfaces to databases. In Proceedings of the 8th International Conference on Intelligent User Interfaces (IUI'03). ACM Press, New York, 149--157. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Punyakanok, V., Roth, D., and Yih, W. 2008. The importance of syntactic parsing and inference in semantic role labeling. Comput. Linguist. 34, 2, 257--287. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Richardson, M. and Domingos, P. 2001. The intelligent surfer: Probabilistic combination of link and content information in pagerank. In Proceedings of the Conference on Advances in Neural Information Processing Systems.Google ScholarGoogle Scholar
  41. Rinaldi, A. M. 2009. An ontology-driven approach for semantic information retrieval on the web. ACM Trans. Internet Technol. 9, 3, 10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Robertson, N. and Seymour, P. D. 1985. Graph minors—A survey. Surv. Combinatorics 103, 153--171.Google ScholarGoogle Scholar
  43. Roy, S. D. and Zeng, W. 2012. A computational cognitive model for semantics sub-network extraction from natural language queries. In Proceedings of the International Conference on Computational Linguistics (COLING'12).Google ScholarGoogle Scholar
  44. Sha, F. and Pereira, F. 2003. Shallow parsing with conditional random fields. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (NAACL'03). Vol. 1. 134--141. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Steyvers, M. and Tenenbaum, J. B. 2010. The large-scale structure of semantic networks: Statistical analyses and a model of semantic growth. Cogn. Sci. 29, 1, 41--78.Google ScholarGoogle ScholarCross RefCross Ref
  46. Traugott, E. C. 2010. From polysemy to semantic change. Linguist. Typol. 14, 2--3, 292--299.Google ScholarGoogle Scholar
  47. Wang, Y., Wang, L., Li, Y., He, D., Chen, W., and Liu, T.-Y. 2013. A theoretical analysis of ndcg ranking measures. In Proceedings of the 26th Annual Conference on Learning Theory.Google ScholarGoogle Scholar
  48. Ye, N., Li, W. S., Chieu, H. L., and Wu, D. 2009. Conditional random fields with high-order features for sequence labeling. In Advances in Neural Information Processing Systems 22, 2196--2204.Google ScholarGoogle Scholar
  49. Zadeh, L. A. 1998. Some reflections on soft computing, granular computing and their roles in the conception, design and utilization of information/intelligent systems. Soft Comput. 2, 1, 23--25.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Cognitive canonicalization of natural language queries using semantic strata

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Speech and Language Processing
        ACM Transactions on Speech and Language Processing   Volume 10, Issue 4
        December 2013
        206 pages
        ISSN:1550-4875
        EISSN:1550-4883
        DOI:10.1145/2560566
        Issue’s Table of Contents

        Copyright © 2014 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 3 January 2014
        • Revised: 1 November 2013
        • Accepted: 1 November 2013
        • Received: 1 April 2013
        Published in tslp Volume 10, Issue 4

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader