已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

Logistic Regression Matching Pursuit algorithm for text classification

匹配追踪 算法 逻辑回归 计算机科学 水准点(测量) 人工智能 统计分类 分类器(UML) 残余物 模式识别(心理学) 数学 机器学习 大地测量学 压缩传感 地理
作者
Qing Li,Shuai Zhao,Shancheng Zhao,Jinming Wen
出处
期刊:Knowledge Based Systems [Elsevier]
卷期号:277: 110761-110761 被引量:4
标识
DOI:10.1016/j.knosys.2023.110761
摘要

Text classification is a challenging problem due to the high dimensionality of the text, which can limit classification performance. The orthogonal matching pursuit (OMP) algorithm is one of the most popular sparse recovery algorithms. An OMP based text classification algorithm, called the Logistic-OMP algorithm, was recently proposed by Skianis et al.. Simulation tests indicate that Logistic-OMP has excellent performance in text dimensionality reduction. This paper optimizes the Logistic-OMP algorithm, and proposes a new text classification algorithm called the Logistic Regression Matching Pursuit (LRMP) algorithm. The LRMP algorithm defines a new loss function and residual update function. It requires only one iteration to solve the negative log likelihood minimization problem, and its classification performance is guaranteed by the strong Wolfe condition, which makes it optimizes the classification accuracy while significantly speeding up the training speed. Simulation tests on topic classification and sentiment analysis from 20Newsgroups, Amazon product reviews, and movie reviews datasets show that the LRMP algorithm has a shorter computation time of a single iteration than the Logistic-OMP algorithm, with a total training time of 8.08%–21.16% shorter than that of the Logistic-OMP algorithm, and the memory usage is 3.20%–6.21% lower than that of the Logistic-OMP algorithm. Furthermore, the average Accuracy and F1-Score of the LRMP algorithm are improved by 1.61%–26.55% and 1.83%–25.46%, respectively, compared with the benchmark classifier. Compared with the advanced classifiers (including Logistic-OMP), the average Accuracy and F1-Score of the LRMP algorithm are improved by 0.46%–8.97% and 0.57%–9.40%, respectively.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
搜集达人应助shzshz采纳,获得10
2秒前
小爽发布了新的文献求助10
3秒前
逆风飞扬发布了新的文献求助10
3秒前
潇洒一曲完成签到,获得积分10
5秒前
HGalong应助Hemp采纳,获得10
6秒前
英姑应助金城武采纳,获得10
6秒前
9秒前
东方傲儿发布了新的文献求助20
9秒前
CodeCraft应助逆风飞扬采纳,获得10
10秒前
领导范儿应助逆风飞扬采纳,获得10
10秒前
JamesPei应助清逸之风采纳,获得10
10秒前
魏少爷完成签到 ,获得积分10
12秒前
13秒前
sunow77完成签到,获得积分10
13秒前
16秒前
16秒前
小懒猪发布了新的文献求助10
22秒前
jjjmsekk完成签到,获得积分20
23秒前
SOLOMON应助tsttst采纳,获得10
28秒前
28秒前
但行好事完成签到 ,获得积分10
30秒前
传奇3应助禹代秋采纳,获得10
31秒前
33秒前
Zcz发布了新的文献求助10
33秒前
shinysparrow应助xiu-er采纳,获得10
36秒前
99giddens应助傻傻的思远采纳,获得50
36秒前
36秒前
Hello应助烧烤店在逃花肉采纳,获得10
37秒前
情怀应助无情的宛儿采纳,获得10
39秒前
沉默的友安完成签到,获得积分10
39秒前
东方立轩发布了新的文献求助10
39秒前
爆米花应助krystian11采纳,获得10
42秒前
橘子不甜发布了新的文献求助10
43秒前
东方立轩完成签到,获得积分10
45秒前
隐形曼青应助激动的严青采纳,获得10
45秒前
烧烤店在逃花肉完成签到 ,获得积分10
48秒前
48秒前
49秒前
开放的平凡完成签到 ,获得积分10
50秒前
Unpredictable发布了新的文献求助10
53秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
Aspect and Predication: The Semantics of Argument Structure 666
De arte gymnastica. The art of gymnastics 600
少脉山油柑叶的化学成分研究 530
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
Berns Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2411850
求助须知:如何正确求助?哪些是违规求助? 2106667
关于积分的说明 5323767
捐赠科研通 1834056
什么是DOI,文献DOI怎么找? 913863
版权声明 560898
科研通“疑难数据库(出版商)”最低求助积分说明 488704