ABSTRACT
Finding Contiguous Sequential Patterns (CSP) is an important problem in Web usage mining. In this paper we propose a new data structure, UpDown Tree, for CSP mining. An UpDown Tree combines suffix tree and prefix tree for efficient storage of all the sequences that contain a given item. The special structure of UpDown Tree ensures efficient detection of CSPs. Experiments show that UpDown Tree improves CSP mining in terms of both time and memory usage comparing to previous approaches.
- Agrawal R. and Srikant R. Mining sequential patterns. In Proceedings ICDE'95 (1995). 3--14. Google ScholarDigital Library
- Antunes C. and Oliveira A. L. Sequential pattern mining algorithms: Trade--offs between speed and memory. In 2nd Workshop on Mining Graphs, Trees and Seq. (2004).Google Scholar
- Morrison D.R. Practical Algorithm to Retrieve Information Coded in Alphanumeric. J. ACM, 15 (1968), 514--534. Google ScholarDigital Library
- Nakagawa M. and Mobasher B. A Hybrid Web Personalization Model Based on Site Connectivity. In WEBKDD 2003 (2003). 59--70.Google Scholar
Index Terms
- Mining contiguous sequential patterns from web logs
Recommendations
Contiguous item sequential pattern mining using UpDown Tree
In this paper the problem of Contiguous Item Sequential Pattern (CISP) Mining is presented as a sequential pattern mining problem under two constraints. First, each element in a sequence consists of only one item. Second, items appearing in the ...
A two stage approach for contiguous sequential pattern mining
IRI'09: Proceedings of the 10th IEEE international conference on Information Reuse & IntegrationContiguous Sequential Pattern (CSP) mining is an important problem with many applications. Using general sequential pattern mining algorithms for CSP mining may lead to poor performance due to the lack of consideration on the contiguous property of CSP. ...
Mining Web Log Sequential Patterns with Position Coded Pre-Order Linked WAP-Tree
Sequential mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. An important application of sequential mining ...
Comments