ACM Home Page
Please provide us with feedback. Feedback
The role of visualization in effective data cleaning
Full text PdfPdf (325 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2005 ACM symposium on Applied computing table of contents
Santa Fe, New Mexico
SESSION: Multimedia and visualization (MV) table of contents
Pages: 1239 - 1243  
Year of Publication: 2005
ISBN:1-58113-964-0
Authors
Yu Qian  The University of Texas at Dallas, Richardson, TX
Kang Zhang  The University of Texas at Dallas, Richardson, TX
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 119,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1066677.1066955
What is a DOI?

ABSTRACT

Using visualization techniques to assist conventional data mining tasks has attracted considerable interest in recent years. This paper addresses a challenging issue in the use of visualization for data mining: choosing appropriate parameters for spatial data cleaning methods. On one hand, algorithm performance is improved through visualization. On the other hand, characteristics and properties of methods and features of data are visualized as feedbacks to the user. A 3-D visualization model, called Waterfall, is proposed to assist spatial data cleaning in four important aspects: dimension-independent data visualization, visualization of data quality, algorithm parameter selection, and measurement of noise removing methods on parameter sensitiveness.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
Ertoz, L., Steinbach, M., and Kumar, V., Finding clusters of different sizes, shapes, and densities in noisy, high dimensional data, In Proc. of SIAM DM'03.
 
4
Ester, M., Kriegel, H. P., Sander, J., and Xu, X., A density-based algorithm for discovering clusters in large spatial databases with noise, in Proc. of 2nd Int. Conf. on Knowledge Discovery and Data Mining (KDD-96), AAAI Press, 1996, pp. 226--231.
 
5
 
6
7
 
8
Han, J., Kamber, M., and Tung, A. K. H., Spatial clustering methods in data mining: A survey, H. Miller and J. Han (eds.), Geographic Data Mining and Knowledge Discovery, Taylor and Francis, 2001.
9
 
10
Hinneburg, A., and Keim, D. A., An efficient approach to clustering in large multimedia databases with noise. In Proc. 1998 Int. Conf. Knowledge Discovery and Data Mining, pp. 58--65.
 
11
12
13
 
14
Seidman, S. B., Network structure and minimum degree, Social Networks, 5, 1983, pp. 269--287.
 
15
Shekhar, S., Schrater, P. R., Vatsavai, R. R., Wu, W., and Chawla, S., Spatial contextual classification and prediction models for mining geospatial data, IEEE Trans. on Multimedia, Vol. 4, No. 2, pp. 174--188.
 
16
Ward, M. O. and Zheng, J., Visualization of spatio-temporal data quality, in Proc. of GIS/LIS, 1993, pp. 727--737.
17