Arabic Information Retrieval

阿拉伯语 情报检索 计算机科学 自然语言处理 人工智能 语言学 哲学
作者
Kareem Darwish,Walid Magdy
出处
期刊:Foundations and Trends in Information Retrieval [Now Publishers]
卷期号:7 (4): 239-342 被引量:76
标识
DOI:10.1561/1500000031
摘要

Arabic is ranked as the seventh largest language on the Internet but it has also been the fastest growing language in the last decade in terms of users. At this rate of growth, Arabic users should have the fourth largest user population on the Internet by 2020. Given these facts, it is not surprising that Arabic Information Retrieval (IR) has garnered significant attention. The main research interests have focused on retrieval of formal language, mostly in the news domain, with ad hoc retrieval, OCR document retrieval, and cross-language retrieval. The literature on other aspects of retrieval continues to be sparse or non-existent, though some of these aspects have been investigated by industry. Others aspects of Arabic retrieval that have received attention include document image retrieval, speech search, filtering, and social media and web search. However, efforts within different aspects of Arabic retrieval continue to be deficient and severely lacking behind efforts in other languages. Arabic Information Retrieval reviews Arabic IR including the nature of the Arabic language, the techniques used for pre-processing the language, the latest research in Arabic IR in different domains, and the open areas in Arabic IR. It covers general properties of the Arabic language, aspects of Arabic that affect retrieval, Arabic processing necessary for effective Arabic retrieval, Arabic retrieval in public IR evaluations, Arabic IR and NLP resources, and specialized retrieval problems such as Arabic-English CLIR, Arabic Document Image Retrieval, Arabic Social Search, Arabic Web Search, Question Answering, Image retrieval, and Arabic Speech Search. Lastly, it also discusses open IR problems that require further attention.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
天天发布了新的文献求助10
2秒前
4秒前
4秒前
5秒前
煎蛋公主关注了科研通微信公众号
7秒前
7秒前
Anthony_潇完成签到,获得积分10
8秒前
8秒前
10秒前
壹拾柒发布了新的文献求助10
11秒前
Tao发布了新的文献求助10
12秒前
寻道图强应助庄严采纳,获得20
13秒前
13秒前
韦老虎发布了新的文献求助10
14秒前
星河完成签到,获得积分10
15秒前
16秒前
16秒前
ssp发布了新的文献求助10
17秒前
18秒前
zZ少年发布了新的文献求助10
18秒前
煎蛋公主发布了新的文献求助10
20秒前
20秒前
窦鞅发布了新的文献求助10
21秒前
Ava应助HXH采纳,获得10
22秒前
22秒前
的谷秋完成签到,获得积分20
24秒前
X先生发布了新的文献求助30
26秒前
窦鞅完成签到,获得积分10
26秒前
哈哈哈哈完成签到 ,获得积分10
27秒前
FashionBoy应助星河采纳,获得10
27秒前
Leon发布了新的文献求助10
27秒前
29秒前
小蘑菇应助PPP采纳,获得10
29秒前
积木发布了新的文献求助10
29秒前
31秒前
CipherSage应助character577采纳,获得30
33秒前
十斤辰完成签到,获得积分10
35秒前
我爱学习发布了新的文献求助10
36秒前
阿喜发布了新的文献求助10
37秒前
37秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
De arte gymnastica. The art of gymnastics 600
Berns Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
Stephen R. Mackinnon - Chen Hansheng: China’s Last Romantic Revolutionary (2023) 500
Sport in der Antike Hardcover – March 1, 2015 500
Boris Pesce - Gli impiegati della Fiat dal 1955 al 1999 un percorso nella memoria 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2422983
求助须知:如何正确求助?哪些是违规求助? 2111892
关于积分的说明 5347271
捐赠科研通 1839354
什么是DOI,文献DOI怎么找? 915625
版权声明 561230
科研通“疑难数据库(出版商)”最低求助积分说明 489747