余弦相似度
三角函数
维数(图论)
相似性(几何)
度量(数据仓库)
矢量化(数学)
相似性度量
算法
欧几里德距离
离散余弦变换
数学
距离测量
计算机科学
距离测量
人工智能
模式识别(心理学)
图像(数学)
组合数学
几何学
数据挖掘
并行计算
作者
Jianqiang Zhang,Fangxu Wang,Futan Ma,Guoxing Song
标识
DOI:10.1109/icedcs57360.2022.00015
摘要
In the text similarity calculation algorithm, the distance algorithm is widely used to measure the distance difference between two text vectors, and the cosine distance algorithm is fully verified. However, the measurement Angle of cosine distance focuses on the included Angle after text vectorization, that is, the direction of vector, without considering the speed of change of each vector. The improved algorithm proposed in this paper not only considers the direction dimension of the vector, but also studies the influence of the change of each dimension on the text similarity of cosine distance measure. Experiments show that compared with traditional cosine similarity, the improved algorithm in this paper has obvious improvement in accuracy and efficiency.
科研通智能强力驱动
Strongly Powered by AbleSci AI