ABSTRACT
The tremendous growth in volume of web usage data results in the boost of web mining research with focus on discovering potentially useful knowledge from web usage data.
This paper presents a new web usage mining process for finding sequential patterns in web usage data which can be used for predicting the possible next move in browsing sessions for web personalization. This process consists of three main stages: preprocessing web access sequences from the web server log, mining preprocessed web log access sequences by a tree-based algorithm, and predicting web access sequences by using a dynamic clustering-based model. It is designed based on the integration of the dynamic clustering-based Markov model with the Pre-Order Linked WAP-Tree Mining (PLWAP) algorithm to enhance mining performance. The proposed mining process is verified by experiments with promising results.
- Pierrakakos, D., Paliouras, G., Papatheodorou, C., and Spyropoulos, C. D. 2003. Web Usage Mining as a Tool for Personalization: A Survey. User Modelling and User-Adapted Interaction. 13, 4, 311--372. DOI=http://dx.doi.org/10.1023/A:1026238916441. Google ScholarDigital Library
- Chen, L., Bhowmick, S. S., and Li, J. 2006. COWES: Clustering Web Users Based on Historical Web Sessions. In Database Systems for Advanced Applications, Springer Berlin / Heidelberg, 541--556. DOI=10.1007/11733836 Google ScholarDigital Library
- Zhu, J., Hong, J., and Hughes, J. G. 2004. PageCluster: Mining Conceptual Link Hierarchies from Web Log Files for Adaptive Web Site Navigation. ACM Transactions on Internet Technology. 4, 185--208. DOI=http://doi.acm.org/10.1145/990301.990305. Google ScholarDigital Library
- Ezeife, C. I., and Lu, Y. 2005. Mining Web Log Sequential Patterns with Position Coded Pre-Order Linked WAP-Tree. Data Mining and Knowledge Discovery. 10, 1, 5--38. DOI=10.1007/s10618-005-0248-3. Google ScholarDigital Library
- Liu, Y., Huang, X., and An, A. 2007. Personalized Recommendation with Adaptive Mixture of Markov Models. The American Society for Information Science and Technology. 58, 12, 1851--1870. DOI=10.1002/asi.20631. Google ScholarDigital Library
- Khalil, F. 2008 Combining Web Data Mining Techniques for Web Page Access Prediction. Doctoral thesis. University of Southern Queensland.Google Scholar
- Borges, J., and Levene, M. 2004 A Dynamic Clustering-Based Markov Model for Web Usage Mining. Technical Report. Available online at http://xxx.arxiv.org/abs/cs.IR/0406032.Google Scholar
- Bhaumik, R., Burke, R., and Mobasher, B. 2007. Effectiveness of Crawling Attacks Against Web-based Recommender Systems. In: Proceedings of the 5th workshop on intelligent techniques for web personalization (ITWP-07)Google Scholar
- Mobasher, B., Burke, R., Bhaumik, R., and Williams, C. 2007. Toward Trustworthy Recommender Systems: An Analysis of Attack Models and Algorithm Robustness. ACM Transactions on Internet Technology. 7, 4. DOI=10.1145/1278366.1278372. Google ScholarDigital Library
- Jalali, M., Mustapha, N., Mamat, A., and Sulaiman, M. N. B. 2008. A New Clustering Approach based on Graph Partitioning for Navigation Patterns Mining. Proc. ICPR 2008. IEEE. pp. 1--4.Google Scholar
- Mobasher, B. 2007. Data Mining for Web Personalization. In The Adaptive Web, P. Brusilovsky, A. K., and W. Nejdl, Springer Berlin / Heidelberg, 90--135. DOI=10.1007/978-3-540-72079-9_3 Google ScholarDigital Library
Index Terms
- Efficient web usage mining process for sequential patterns
Recommendations
Web usage mining: discovery and applications of usage patterns from Web data
Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications. Web usage mining consists of three phases, namely preprocessing, pattern ...
Mining Web Log Sequential Patterns with Position Coded Pre-Order Linked WAP-Tree
Sequential mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. An important application of sequential mining ...
Effective database transformation and efficient support computation for mining sequential patterns
AbstractIn this paper, we propose a novel algorithm for mining frequent sequences from transaction databases. The transactions of the same customers form a set of customer sequences. A sequence (an ordered list of itemsets) is frequent if the number of ...
Comments