计算机科学
前缀
特里亚
集合(抽象数据类型)
约束(计算机辅助设计)
编码(内存)
序列模式挖掘
树(集合论)
序列(生物学)
数据挖掘
模式识别(心理学)
算法
序列数据库
人工智能
数学
数据结构
数学分析
哲学
语言学
遗传学
几何学
生物
程序设计语言
生物化学
化学
基因
作者
Thi-Thiet Pham,Thuy-Duong Thi Vu,Tai-Du Nguyen,Bao Huynh,Trang Van
标识
DOI:10.1142/s219688882350001x
摘要
The purpose of mining sequential patterns problem with weighted constraints is to find high-valued patterns, including infrequent patterns but having items which appear in the pattern of high importance in the sequence database (SD). Therefore, weighted sequential pattern mining will collect a set of more complete patterns with items of low support but of high importance. This paper proposes a new algorithm called WSPM_PreTree to find highly weighted sequential patterns. To collect a set of complete sequential patterns with the stricter weighted constraints of sequential patterns, the proposed algorithm uses both the minimum support constraint and the actual values of items appearing in the SD. To increase the performance of the finding weighted sequential patterns process, the algorithm uses the parent–child relationship on the prefix tree structure to create candidates and combines the weighted mean of the sequential 1-patterns that is calculated from the actual value of items in the SD as conditions to find the weighted sequential patterns. Experimental results show that the proposed algorithm is more efficient than sequential patterns mining with weight constraint (SPMW) algorithm [Ref. 20 ] in the runtime.
科研通智能强力驱动
Strongly Powered by AbleSci AI