ABSTRACT
Recent work on information integration has yielded novel and efficient solutions for gathering data from the World Wide Web. However, there has been little attention given to the problem of providing information management capabilities that closely model how people interact with the web in productive ways - not only collecting information, but monitoring web sites for new or updated data, sending notifications based on the results, building reports, creating local repositories of information, and so on. These needs are unique to the dynamic nature of information in a networked environment. In this paper, we describe Theseus, an efficient plan execution system for information management agents. Through its plan language, Theseus supports a number of capabilities which enable practical information management, including repeated and periodic query execution, conditional plan declarations, query result aggregation, and flexible communication of results. The Theseus executor system focuses on efficiency, with support for data pipelining, and dataflow-based, event driven parallel execution. With Theseus, users can automate the complex but practical ways in which they interact with the web, for both information gathering and management.
- 1.Ambite, I.L. and Knoblock, C.A. 1997. Planning by Rewriting: Efficiently Generating High-Quality Plans. Proceedings of the Fourteenth National Conference on Artificial Intelligence. Google ScholarDigital Library
- 2.Ashish, N.; Knoblock, C.A.; and Shahabi, C. 1999. Selective materializing data in mediators by analyzing user queries. Submitted, Fourth IFCIS Conference on Cooperative Information Systems. Google ScholarDigital Library
- 3.Cohen, W. W. 1998. Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity. SIGMOD Conference 1998:201-212 Google ScholarDigital Library
- 4.DeWitt D.3. and Gray, J. 1992. Parallel Database Systems: The Future of High Performance Database Systems. Comm of the ACM 35(6). Google ScholarDigital Library
- 5.DeWitt, D.}.; Ghandeharizadeh, S., Schneider, D.A.; Bricker, A.; Hsiao, H.; and Rasmussen, R. 1990. The Gamma Database Machine Project. IEEE Transactions on Knowledge and Data Engineering 2~1)~ Google ScholarDigital Library
- 6.Firby, R.3. 1994. Task Networks for Controlling Continuous Processes. Proceedings of the 2nd Intl Conference on AI Planning Syste~v.Google Scholar
- 7.Friedman, M. and Weld, D.S. Efficiently Executing Information- Gathering Plans, Proceedings of the 15th International Joi,t Conference on Artificial Intelligence, Nagoya, Japan, August 1997.Google Scholar
- 8.Friedman, M.; Levy, A.; and Millstein, T. 1999. Navigational Plans for Data Integration, Proceedings of 16th Natl Conf on Artificial Intelligence. Google ScholarDigital Library
- 9.Genesereth, M.R.; Keller, A.M.; and Duschka, O.M. 1997. lnfomaster: An informafon integration system, Proceedings ofACM SIGMOD.97. Google ScholarDigital Library
- 10.Georgeff, M,P. and Lansky, A.L. 1987. Rea~rve reasoning and planning. AAAI Proceedings 1987.Google Scholar
- 11.Graefe, G. 1994. Volcano - An Extensible and Parallel Query Evaluation System. iEEE Transactions on Knowledge and Data Engineering 6(1). Google ScholarDigital Library
- 12.Ires, Z; Florescu, D.; Friedman, M.; Levy, A.; Weld, D. 1999, An Adaptive Query Execution Engine for Data Integration. Proc tff ACM SIGMOD-99. Google ScholarDigital Library
- 13.Levy, A.Y.; Rajaraman, A; Ordille, J.J. 1996. Querying Heterogeneous Information Sources Using Source Descriptions. Proceedings of the 22nd VLDB Conference. Google ScholarDigital Library
- 14.Knoblock, C.A.; Minton, S; Ambite, J.L.; Ashish, N.; Modi, J,; Muslea, 1,; Philpot, A. and Tejada, S. 1998. Modeling Web Sources for Information Integration. Proceedings of the 15rh Natl Conf on Artificial Intelligence. Google ScholarDigital Library
- 15.Kushmerick, N. 1997. Wrapper Induction for Information Extraction. PhD Thesis, Computer Science Dept. University of Washington. Google ScholarDigital Library
- 16.Kwok, C.T and Weld, D.S. 1996. Planning to gather information. In Proceedings of AAAI-96. Google ScholarDigital Library
- 17.Muslea, I.; Minton, S.; and Knoblock, C.A. 1998. STALKER: Learning Extraction Rules for Semistructured, Web-based Information Sources. AAAI-98 Workshop on AI & Information Integration.Google Scholar
- 18.Williamson, M.; Sycara, K., and Williamson, M. 1996. Unified Information and Control Flow in Hierarchical Task Networks. Notes of the AAAI-96 Workshop, "Theories of Action, Planning, and Control."Google Scholar
Index Terms
- An efficient plan execution system for information management agents
Recommendations
Speculative plan execution for information gathering
The execution performance of an information gathering plan can suffer significantly due to remote I/O latencies. A streaming dataflow model of execution addresses the problem to some extent, exploiting all natural opportunities for parallel execution, ...
Multi-agent plan based information gathering
The evolution of the Web has encouraged the development of new Information Gathering techniques. Artificial Intelligence techniques, such as Planning, have also been used for Information Gathering in order to go beyond merely retrieving Web data. ...
Comments