计算机科学
自然语言处理
语义相似性
人工智能
相似性(几何)
文字嵌入
特征(语言学)
词(群论)
文字蕴涵
皮尔逊积矩相关系数
任务(项目管理)
情报检索
嵌入
语言学
逻辑后果
数学
统计
图像(数学)
管理
经济
哲学
作者
Marwah Alian,Arafat Awajan
标识
DOI:10.1142/s0219649220500331
摘要
Semantic similarity is the task of measuring relations between sentences or words to determine the degree of similarity or resemblance. Several applications of natural language processing require semantic similarity measurement to achieve good results; these applications include plagiarism detection, text entailment, text summarisation, paraphrasing identification, and information extraction. Many researchers have proposed new methods to measure the semantic similarity of Arabic and English texts. In this research, these methods are reviewed and compared. Results show that the precision of the corpus-based approach exceeds 0.70. The precision of the descriptive feature-based technique is between 0.670 and 0.86, with a Pearson correlation coefficient of over 0.70. Meanwhile, the word embedding technique has a correlation of 0.67, and its accuracy is in the range 0.76–0.80. The best results are achieved by the feature-based approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI