Top-k Self-Adaptive Contrast Sequential Pattern Mining

可解释性 符号 判别式 计算机科学 序列(生物学) 选择(遗传算法) 财产(哲学) 人工智能 数据挖掘 对比度(视觉) 数学 模式识别(心理学) 算术 生物 遗传学 认识论 哲学
作者
Youxi Wu,Wang Yue-hua,Yan Li,Xingquan Zhu,Xindong Wu
出处
期刊:IEEE transactions on cybernetics [Institute of Electrical and Electronics Engineers]
卷期号:52 (11): 11819-11833 被引量:21
标识
DOI:10.1109/tcyb.2021.3082114
摘要

For sequence classification, an important issue is to find discriminative features, where sequential pattern mining (SPM) is often used to find frequent patterns from sequences as features. To improve classification accuracy and pattern interpretability, contrast pattern mining emerges to discover patterns with high-contrast rates between different categories. To date, existing contrast SPM methods face many challenges, including excessive parameter selection and inefficient occurrences counting. To tackle these issues, this article proposes a top- $k$ self-adaptive contrast SPM, which adaptively adjusts the gap constraints to find top- $k$ self-adaptive contrast patterns (SCPs) from positive and negative sequences. One of the key tasks of the mining problem is to calculate the support (the number of occurrences) of a pattern in each sequence. To support efficient counting, we store all occurrences of a pattern in a special array in a Nettree, an extended tree structure with multiple roots and multiple parents. We employ the array to calculate the occurrences of all its superpatterns with one-way scanning to avoid redundant calculation. Meanwhile, because the contrast SPM problem does not satisfy the Apriori property, we propose Zero and Less strategies to prune candidate patterns and a Contrast-first mining strategy to select patterns with the highest contrast rate as the prefix subpattern and calculate the contrast rate of all its superpatterns. Experiments validate the efficiency of the proposed algorithm and show that contrast patterns significantly outperform frequent patterns for sequence classification. The algorithms and datasets can be downloaded from https://github.com/wuc567/Pattern-Mining/tree/master/SCP-Miner .
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
魔幻凝云发布了新的文献求助10
刚刚
3秒前
qifunongsuo1213完成签到,获得积分10
3秒前
北望完成签到,获得积分10
3秒前
今后应助西扬采纳,获得10
3秒前
3秒前
Pshan发布了新的文献求助30
3秒前
小马甲应助魔幻凝云采纳,获得10
3秒前
拉长的大侠完成签到 ,获得积分10
3秒前
4秒前
小李完成签到,获得积分10
4秒前
从心开始完成签到 ,获得积分10
4秒前
xixixiu72完成签到,获得积分10
4秒前
芋泥完成签到,获得积分20
5秒前
平常无颜发布了新的文献求助10
7秒前
7秒前
7秒前
北望发布了新的文献求助10
7秒前
Ostfang完成签到,获得积分10
9秒前
9秒前
张张发布了新的文献求助20
10秒前
田家溢发布了新的文献求助10
10秒前
11秒前
小小完成签到,获得积分10
12秒前
张泽崇应助Mitsuha采纳,获得10
13秒前
jiangcai发布了新的文献求助10
13秒前
Weinn完成签到,获得积分20
13秒前
Chun发布了新的文献求助20
13秒前
Eton完成签到,获得积分10
14秒前
14秒前
Aiden发布了新的文献求助10
14秒前
桐桐应助XD0026采纳,获得10
14秒前
bingmeishi发布了新的文献求助10
14秒前
15秒前
DongWei95完成签到,获得积分10
15秒前
15秒前
镜花水月完成签到,获得积分10
16秒前
17秒前
TK完成签到 ,获得积分20
17秒前
bkagyin应助angzhang采纳,获得10
17秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Teaching Social and Emotional Learning in Physical Education 900
Edestus (Chondrichthyes, Elasmobranchii) from the Upper Carboniferous of Xinjiang, China 500
Chinese-English Translation Lexicon Version 3.0 500
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
マンネンタケ科植物由来メロテルペノイド類の網羅的全合成/Collective Synthesis of Meroterpenoids Derived from Ganoderma Family 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 440
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2381667
求助须知:如何正确求助?哪些是违规求助? 2088907
关于积分的说明 5247436
捐赠科研通 1815660
什么是DOI,文献DOI怎么找? 905908
版权声明 558834
科研通“疑难数据库(出版商)”最低求助积分说明 483772