清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

PathSim

计算机科学 相似性(几何) 路径(计算) 理论计算机科学 语义学(计算机科学) 语义相似性 对象(语法) 数据挖掘 情报检索 人工智能 图像(数学) 程序设计语言
作者
Yizhou Sun,Jiawei Han,Xifeng Yan,Philip S. Yu,Tianyi Wu
出处
期刊:Proceedings of the VLDB Endowment [Association for Computing Machinery]
卷期号:4 (11): 992-1003 被引量:1654
标识
DOI:10.14778/3402707.3402736
摘要

Similarity search is a primitive operation in database and Web search engines. With the advent of large-scale heterogeneous information networks that consist of multi-typed, interconnected objects, such as the bibliographic networks and social media networks, it is important to study similarity search in such networks. Intuitively, two objects are similar if they are linked by many paths in the network. However, most existing similarity measures are defined for homogeneous networks. Different semantic meanings behind paths are not taken into consideration. Thus they cannot be directly applied to heterogeneous networks. In this paper, we study similarity search that is defined among the same type of objects in heterogeneous networks. Moreover, by considering different linkage paths in a network, one could derive various similarity semantics. Therefore, we introduce the concept of meta path-based similarity , where a meta path is a path consisting of a sequence of relations defined between different object types ( i.e. , structural paths at the meta level). No matter whether a user would like to explicitly specify a path combination given sufficient domain knowledge, or choose the best path by experimental trials, or simply provide training examples to learn it, meta path forms a common base for a network-based similarity search engine. In particular, under the meta path framework we define a novel similarity measure called PathSim that is able to find peer objects in the network ( e.g. , find authors in the similar field and with similar reputation), which turns out to be more meaningful in many scenarios compared with random-walk based similarity measures. In order to support fast online query processing for PathSim queries, we develop an efficient solution that partially materializes short meta paths and then concatenates them online to compute top- k results. Experiments on real data sets demonstrate the effectiveness and efficiency of our proposed paradigm.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
1秒前
森森发布了新的文献求助10
4秒前
zgx完成签到 ,获得积分10
17秒前
紫熊发布了新的文献求助20
18秒前
25秒前
在水一方应助森森采纳,获得10
27秒前
研友_nxw2xL完成签到,获得积分10
1分钟前
muriel完成签到,获得积分10
1分钟前
cadcae完成签到,获得积分10
1分钟前
yi完成签到,获得积分10
1分钟前
曾经的背包完成签到 ,获得积分10
2分钟前
k001boyxw完成签到,获得积分10
2分钟前
一见憘完成签到 ,获得积分10
2分钟前
王磊完成签到 ,获得积分10
2分钟前
乐观海云完成签到 ,获得积分10
3分钟前
jjj完成签到 ,获得积分10
3分钟前
3分钟前
3分钟前
BLUK发布了新的文献求助10
3分钟前
森森发布了新的文献求助10
3分钟前
完美世界应助zyq111111采纳,获得10
3分钟前
3分钟前
zyq111111发布了新的文献求助10
3分钟前
森森完成签到,获得积分10
4分钟前
taku完成签到 ,获得积分10
4分钟前
zyq111111完成签到,获得积分10
4分钟前
Gary完成签到 ,获得积分10
5分钟前
搜集达人应助科研通管家采纳,获得10
5分钟前
Wang完成签到 ,获得积分20
5分钟前
科研通AI5应助几钱采纳,获得20
5分钟前
糟糕的翅膀完成签到,获得积分10
6分钟前
qq完成签到 ,获得积分10
6分钟前
6分钟前
几钱发布了新的文献求助20
6分钟前
6分钟前
觅海发布了新的文献求助10
7分钟前
naczx完成签到,获得积分0
7分钟前
7分钟前
觅海完成签到,获得积分10
7分钟前
高分求助中
中华人民共和国出版史料 4 1000
Applied Survey Data Analysis (第三版, 2025) 800
Narcissistic Personality Disorder 700
Assessing and Diagnosing Young Children with Neurodevelopmental Disorders (2nd Edition) 700
Handbook of Experimental Social Psychology 500
The Martian climate revisited: atmosphere and environment of a desert planet 500
建国初期十七年翻译活动的实证研究. 建国初期十七年翻译活动的实证研究 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3845559
求助须知:如何正确求助?哪些是违规求助? 3387836
关于积分的说明 10550653
捐赠科研通 3108452
什么是DOI,文献DOI怎么找? 1712813
邀请新用户注册赠送积分活动 824508
科研通“疑难数据库(出版商)”最低求助积分说明 774877