计算机科学
修剪
集团
数据挖掘
导线
光学(聚焦)
散列函数
先验与后验
哈希表
地理
生物
农学
组合数学
认识论
光学
大地测量学
计算机安全
数学
物理
哲学
作者
Junyi Li,Lizhen Wang,Vanha Tran,Junyi Li,Xiwen Jiang
标识
DOI:10.1007/978-3-031-32910-4_13
摘要
Traditional prevalent co-location pattern mining (PCPM) methods generate complete table instances (TIs) of all candidates, which is both time and space consuming. Existing Apriori-like methods focus on improving the efficiency of TI generation, while existing Clique-based methods still contain many repeated traversing processes, and therefore neither of them can very well detect the overlap in TIs. To address these challenges, this paper first proposes the concept of extended maximal cliques (EMCs) to detect instance overlap situations, and designs a hash-based storage structure SSHT to reduce mining consumption. Second, a novel approach PCPM-EMC is introduced to detect all prevalent co-location patterns (PCPs), which uses the proposed P-BKp,d algorithm to generate EMCs, and adopts a bidirectional pruning strategy for PCP detection. Lastly, extensive experiments on both real-world and synthetic datasets show that the proposed approach is efficient, and reducing more over 80% space consumption and more than 50% time consumption than existing methods, especially in dense datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI