ABSTRACT
An abundance of geospatial information is flourishing on the Internet but mining and disseminating these data is a daunting task. With anything published on the web available to the public it has become a grand repository of volunteered geographic information (VGI). Internet users often provide location information for videos, pictures, travel destinations, or other events. All of these data can be gathered by a web crawling geospatial agent that later performs geospatial data mining. The discovered geoinformation can be stored, analyzed, queried, and visualized as the agent creates a data repository of what it discovered. This paper presents the design and prototypical implementation of the GEDMWA (Geospatial Exploratory Data Mining Web Agent). It reads webpage data and follows links to acquire knowledge in order to add value to geoinformation usable in a GIS. The agent creates a database of webpage text, mines it for location information, and then converts it to proper geospatial data format. The data is quickly visualized and analyzed after GEDMWA converts it into proper GIS and virtual globe formats. This provides diverse user communities a tool that utilizes a variety of distributed sources to discover additional knowledge about their fields of interest.
- Goodchild, M. 2007. Citizens as sensors: the world of volunteered geography. GeoJournal 69, 211--221.Google ScholarCross Ref
- Grossner, K., Clarke, K. and Goodchild, M. 2008. Defining a Digital Earth System. Transactions in GIS 12, 145--160.Google ScholarCross Ref
- Shi, W., Fisher, P. and Goodchild, M. (Eds.) 2002. Spatial Data Quality. Taylor & Francis, New York.Google Scholar
- Larson, K. and Craig, D. 2006. Digiscoping vouchers for diet studies in bill load holding birds. Waterbirds 11, 110--112.Google Scholar
- Russell, S. and Norvig, P. 2002. Artificial Intelligence: A Modern Approach. Prentice Hall. Google ScholarDigital Library
- Wooldridge, M. 1999. Intelligent Agents. In Multiagent Systems - A Modern Approach to Distributed Artificial Intelligence, G. Weiss Ed. MIT Press, Cambridge, MA, 27--77. Google ScholarDigital Library
- Batty, M., Desyllas, J. and Duxbury, E. 2003. The discrete dynamics of small-scale spatial events: agent-based models of mobility in carnivals and street parades. International Journal of Geographical Information Science 17, 673--697.Google ScholarCross Ref
- Benenson, I. and Torrens, P. 2004. Geosimulation - Automata-based modeling of urban phenomena. Wiley, Chicester, England.Google Scholar
- Sengupta, R. and Sieber, R. 2007. Geospatial Agents, Agents Everywhere.. Transactions in GIS 11, 483--506.Google ScholarCross Ref
- Goodchild, M. 2007. Citizens as voluntary sensors: spatial data infrastructure in the world of Web 2.0. International Journal of Spatial Data Infrastructures Research 2, 24--32.Google Scholar
- Jones, C., Purves, R., Clough, P. and Joho, H. 2008. Modelling Vague Places with Knowledge from the Web. International Journal of Geographical Information Science, in press. Google ScholarDigital Library
- Han, J. and Kamber, M. 2006. Data Mining: Concepts and Techniques, Second Edition. Morgan Kaufmann, San Francisco. Google ScholarDigital Library
- Miller, H. 2007. Geographic data mining and knowledge discovery. In Handbook of Geographic Information Science, J. Wilson and A. Fotheringham Eds. Blackwell.Google Scholar
- Chang, G., Healey, M., McHugh, J. and Wang, J. 2001. Mining the World Wide Web: An Information Search Approach. Kluwer, Norwell, MA. Google ScholarDigital Library
- Frank, A. 2003. Pragmatic Information Content - How to Measure the Information in a Route Description. In Perspectives on Geographic Information Science, M. Goodchild, M. Duckham and M. Worboys Eds. Taylor & Francis, London.Google Scholar
Index Terms
- GEDMWA: geospatial exploratory data mining web agent
Recommendations
Discovering geographic knowledge in data rich environments: a report on a specialist meeting
On 18--20 March 1999, a Specialist Meeting on "Discovering geographic knowledge in data-rich environments" was convened under the auspices of the Varenius Project of the National Center for Geographic Information and Analysis (NCGIA). This, workshop ...
Towards Big GeoData Mining and Processing
The proliferation of advanced technologies and the rapid adoption of connected devices and mobiles were behind the tremendous growth of geographic data introducing the concept of Big GeoData. The extraction of useful patterns is an interesting process, ...
A Simple Tags Categorization Framework Using Spatial Coverage to Discover Geospatial Semantics
WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide WebThere exist many popular crowdsourcing and social services (Volunteered Geographic Information (VGI)) to share information and documents such as Flickr, Foursquare, Twitter , Facebook, etc. They all use metadata, folksonomy and more importantly a ...
Comments