ABSTRACT
We describe an approach to retrieval of documents that contain of both free text and semantically enriched markup. In particular, we present the design and implementation prototype of a framework in which both documents and queries can be marked up with statements in the DAML+OIL semantic web language. These statements provide both structured and semi-structured information about the documents and their content. We claim that indexing text and semantic markup together will significantly improve retrieval performance. Our approach allows inferencing to be done over this information at several points: when a document is indexed, when a query is processed and when query results are evaluated.
- S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. Wiener. The lorel query language for semistructured data. International Journal on Digital Libraries 1, pages 68--88, April 1997.]]Google ScholarCross Ref
- G. Arocena and A. Mendelzon. Weboql: Restructuring documents, databases and webs. In International Conference on Data Engineering, pages 24--33. IEEE Computer Society, 1998.]] Google ScholarDigital Library
- Askjeeves. http://www.askjeeves.com.]]Google Scholar
- Z. Bar-Yossef, Y. Kanza, Y. Kogan, W. Nutt, and Y. Sagiv. Quest: Querying semantically tagged documents on the world wide web. In In Proc. of the 4th Workshop on Next Generation Information Technologies and Systems, volume NGITS'99, Zikhron-Yaakov(Isreal), July 1999.]] Google ScholarDigital Library
- T. Berners-Lee and M. Fischetti. Weaving the web: The original design and ultimate destiny of the World Wide Web by its inventor. Harper, San Francisco.]] Google ScholarDigital Library
- T. Berners-Lee, J. Hendler, and O. Lassila. The Semantic Web. Scientific American, May 2001.]]Google ScholarCross Ref
- T. Bray, J. Paoli, and C. Sperberg-McQueen. Extensible markup language (xml). W3C (Worldwide Web Consortium), 1998. http://www.w3.org/TR/1998/REC-xml-19980210.html.]]Google Scholar
- T. Chinenyanga and N. Kushmerick. Elixir: An expressive and efficient language for xml information retrieval. In SIGIR Workshop on XML and Information Retrieval, 2001.]]Google Scholar
- R. Cost. Wondir, word or n-gram based dynamic information retrieval engine.]]Google Scholar
- R. Cost, T. Finin, A. Joshi, Y. Peng, C. Nicholas, H. Chen, L. Kagal, F. Perich, Y. Zou, and S. Tolia. ITTALKS: A Case Study in the Semantic Web and DAML. In International Semantic Web Working Symposium (SWWS), July 2001.]]Google Scholar
- Darpa agent markup language, 2001. http://www.daml.org.]]Google Scholar
- DAML+OIL Design Rationale, 2001. www.cs.man.ac.uk/horrocks/Slides/index.html.]]Google Scholar
- A. Deutsch, M. Fernandez, D. Florescu, A. Levy, and D. Suciu. Xml-ql: A query language for xml. In In Proc. 8th Int. World Wide Web Conference, 1999.]] Google ScholarDigital Library
- D. Egnor and R. Lord. Structured information retrieval using xml. XYZFind Corporation, Washington, USA.]]Google Scholar
- C. Forgy. Rete: A fast algorithm for the many object pattern match problem. Artificial Intelligence, 19:17--37, 1982.]]Google ScholarDigital Library
- E. Friedman-Hill. Jess, the java expert system shell, 2000. http:/herzberg.ca.sandia.gov/jess/.]]Google Scholar
- N. Fuhr and K. Grojohann. Xirql: An extension of xql for information retireval. In SIGIR Workshop on XML and Information Retrieval, 2000.]] Google ScholarDigital Library
- J. Heflin, J. Hendler, and S. Luke. Shoe: A prototype language for the semantic webs. Linkvping Electronic Articles in Computer and Information Science, 6 2001. http://www.ep.liu.se/ea/cis/1997/013/.]]Google Scholar
- B. Katz. From sentence processing to information access on the world wide web. In Natural Language Processing for the World Wide Web, pages 77--94, 1997. Papers from the 1997 AAAI Spring Symposium.]]Google Scholar
- J. Kopena. http://plan.mcs.drexel.edu/projects/legorobots/design/software/damljesskb/.]]Google Scholar
- C. Kwok, O. Etzioni, and D. Weld. Scaling question answering to the web. In Proceedings of WWW10, Hong Kong, 2001.]] Google ScholarDigital Library
- O. Lassila and S. R. R. (eds). Resource description framework (rdf) model and syntax specification. W3C Recommendation, February 1999. http://www.w3.org/TR/1999/REC-rdf-syntax-19990222/.]]Google Scholar
- P. Martin and P. Eklund. Embedding knowledge in web documents. In Proceedings of World Wide Web Conference (WWW8), Toronto, Canada, 1999.]] Google ScholarDigital Library
- J. Mayfield, P. McNamee, and C. Piatko. The jhu/apl haircut system at trec-8. The Eighth Text Retrieval Conference (TREC-8), pages 445--452, November 1999.]]Google Scholar
- W. V. Quine. Naming, Necessity and Natural Kinds, chapter Natural Kinds. University Press, 1977.]]Google Scholar
- Resource Description Framework (RDF) Model and Syntax Specification, February 1999. www.w3.org/tr/rec-rdf-syntax.]]Google Scholar
- M. Sintek and S. Decker. Triple-an rdf query, inference, and transformation language. DDLP, October 2001. Japan.]]Google Scholar
- S. Staab, M. Erdmann, A. Maedche, and S. Decker. An extensible approach for modeling ontologies in rdf(s). Technical Report 401, AIFB, University of Karlsruhe, March 2000.]]Google Scholar
Information retrieval on the semantic web
Recommendations
Semantic web reasoners and languages
Semantic web reasoners and languages enable the semantic web to function. Some of the latest reasoning models developed in the last few years are: DLP, FaCT, RACER, Pellet, MSPASS, CEL, Cerebra Engine, QuOnto, KAON2, HermiT and others. Some software ...
RDF, Jena, SparQL and the 'Semantic Web'
SIGUCCS '09: Proceedings of the 37th annual ACM SIGUCCS fall conference: communication and collaborationThe Resource Description Format (RDF) is used to represent information modeled as a "graph": a set of individual objects, along with a set of connections among those objects. In that role, RDF is one of the pillars of the so-called Semantic Web. This ...
A semantic retrieval of web documents using domain ontology
The Semantic Web vision offers the potential to express queries in a more semantic way. However, the unstructured nature of existing web documents, which lack semantics, proves to be a difficult task for such a query. To support this, the semantic ...
Comments