|
ABSTRACT
Data mining techniques frequently find a large number of patterns or rules, which make it very difficult for a human analyst to interpret the results and to find the truly interesting and actionable rules. Due to the subjective nature of "interestingness", human involvement in the analysis process is crucial. In this paper, we propose a novel visual data mining framework for the purpose of identifying actionable knowledge quickly and easily from discovered rules and data. This framework is called the Opportunity Map. It is inspired by some interesting ideas from Quality Engineering, in particular Quality Function Deployment (QFD) and the House of Quality. It associates summarized data or discovered rules with the application objective using an interactive matrix, which enables the user to quickly identify where the opportunities are. The proposed system can be used to visually analyze discovered rules, and other statistical properties of the data. The user can also interactively group actionable attributes and values, and see how they affect the targets of interest. Combined with drill-down and comparative analysis, the user can analyze rules and data at different levels of detail. The proposed visualization framework thus represents a systematic and yet flexible method of rule analysis. Applications of the system to large-scale data sets from our industrial partner have yielded promising results.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Adomavicius, G. and Tuzhilin, A. Discovery of actionable patterns in databases: the action hierarchy approach. KDD-97, 1997.
|
| |
2
|
|
 |
3
|
Roberto J. Bayardo, Jr. , Rakesh Agrawal, Mining the most interesting rules, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.145-154, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312219]
|
| |
4
|
Blanchard J, Guillet F, Briand H. Exploratory Visualization for Association Rule Rummaging. KDD-03 Workshop on Multimedia Data Mining (MDM-03), 2003.
|
| |
5
|
Bruzzese D., Davino C. Visual Post-Analysis of Association Rules. ECML/PKDD02 VDM Workshop, 2002.
|
| |
6
|
Cohen L. Quality Function Deployment. Prentice Hall, 1995.
|
 |
7
|
|
| |
8
|
Han, J., Fu, Y., Wang, W., Koperski, K. and Zaiane, O. DMQL: a data mining query language for relational databases. SIGMOD Workshop on DMKD, 1996.
|
 |
9
|
Heike Hofmann , Arno P. J. M. Siebes , Adalbert F. X. Wilhelm, Visualizing association rules with interactive mosaic plots, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, p.227-235, August 20-23, 2000, Boston, Massachusetts, United States
[doi> 10.1145/347090.347133]
|
| |
10
|
|
 |
11
|
|
| |
12
|
Jorge A., Pocas J., Azevedo P. Post-processing environment for browsing large sets of association rules. PKDD-02 VDM Workshop, 2002.
|
| |
13
|
|
| |
14
|
Liu, B., Hsu, W., and Chen, S. Using general impressions to analyze discovered classification rules. KDD-97, 1997.
|
| |
15
|
Liu B., Hsu W, Ma Y. Integrating Classification and Association Rule Mining. KDD-98, 1998.
|
 |
16
|
Bing Liu , Wynne Hsu , Yiming Ma, Mining association rules with multiple minimum supports, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.337-341, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312274]
|
 |
17
|
|
| |
18
|
Ma S., Hellerstein J. Ordering Categorical Data to Improve Visualization. INFOVIS-99, 1999.
|
| |
19
|
|
| |
20
|
Ong K-H, Ong K-L, Ng W-K, Lim E-P. CrystalClear: Active Visualization of Association Rules. ICDM-02 Workshop on Active Mining (AM-02), 2002.
|
| |
21
|
Padmanabhan, B. and Tuzhilin, A. A belief-driven method for discovering unexpected patterns. KDD-98, 1998.
|
| |
22
|
Piatesky-Shapiro, G., and Matheus, C. The interestingness of deviations. KDD-94, 1994.
|
| |
23
|
|
| |
24
|
Rasmussen M, Karypis G. gCLUTO - an interactive clustering, visualization, and analysis system. Technical Report # 04-021, University of Minnesota, 2004.
|
| |
25
|
Shah, D., Lakshmanan, L.V.S, Ramamritham, K., and Sudarshan, S., 1999. Interestingness and Pruning of Mined Patterns. DMKD-99, 1999.
|
| |
26
|
|
| |
27
|
Suzuki, E. Autonomous discovery of reliable exception rules. KDD-97, 1997.
|
| |
28
|
Terninko J. Step-by-Step QFD: Customer-Driven Product Design. Saint Lucie Press. 1997.
|
 |
29
|
|
 |
30
|
|
| |
31
|
|
| |
32
|
|
| |
33
|
Zhao K. Liu B. Visual Analysis of the Behavior of Discovered Rules. KDD-2001 Workshop on Visual DM.
|
 |
34
|
Kaidi Zhao , Bing Liu , Thomas M. Tirpak , Andreas Schaller, V-Miner: using enhanced parallel coordinates to mine product design and test data, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
[doi> 10.1145/1014052.1014108]
|
|