|
ABSTRACT
Using visualization techniques to assist conventional data mining tasks has attracted considerable interest in recent years. This paper addresses a challenging issue in the use of visualization for data mining: choosing appropriate parameters for spatial data cleaning methods. On one hand, algorithm performance is improved through visualization. On the other hand, characteristics and properties of methods and features of data are visualized as feedbacks to the user. A 3-D visualization model, called Waterfall, is proposed to assist spatial data cleaning in four important aspects: dimension-independent data visualization, visualization of data quality, algorithm parameter selection, and measurement of noise removing methods on parameter sensitiveness.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Mihael Ankerst , Markus M. Breunig , Hans-Peter Kriegel , Jörg Sander, OPTICS: ordering points to identify the clustering structure, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.49-60, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
| |
2
|
|
| |
3
|
Ertoz, L., Steinbach, M., and Kumar, V., Finding clusters of different sizes, shapes, and densities in noisy, high dimensional data, In Proc. of SIAM DM'03.
|
| |
4
|
Ester, M., Kriegel, H. P., Sander, J., and Xu, X., A density-based algorithm for discovering clusters in large spatial databases with noise, in Proc. of 2nd Int. Conf. on Knowledge Discovery and Data Mining (KDD-96), AAAI Press, 1996, pp. 226--231.
|
| |
5
|
|
| |
6
|
|
 |
7
|
|
| |
8
|
Han, J., Kamber, M., and Tung, A. K. H., Spatial clustering methods in data mining: A survey, H. Miller and J. Han (eds.), Geographic Data Mining and Knowledge Discovery, Taylor and Francis, 2001.
|
 |
9
|
|
| |
10
|
Hinneburg, A., and Keim, D. A., An efficient approach to clustering in large multimedia databases with noise. In Proc. 1998 Int. Conf. Knowledge Discovery and Data Mining, pp. 58--65.
|
| |
11
|
|
 |
12
|
|
 |
13
|
|
| |
14
|
Seidman, S. B., Network structure and minimum degree, Social Networks, 5, 1983, pp. 269--287.
|
| |
15
|
Shekhar, S., Schrater, P. R., Vatsavai, R. R., Wu, W., and Chawla, S., Spatial contextual classification and prediction models for mining geospatial data, IEEE Trans. on Multimedia, Vol. 4, No. 2, pp. 174--188.
|
| |
16
|
Ward, M. O. and Zheng, J., Visualization of spatio-temporal data quality, in Proc. of GIS/LIS, 1993, pp. 727--737.
|
 |
17
|
Tian Zhang , Raghu Ramakrishnan , Miron Livny, BIRCH: an efficient data clustering method for very large databases, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.103-114, June 04-06, 1996, Montreal, Quebec, Canada
|
|