Analytics of machine learning-based algorithms for text classification

计算机科学 机器学习 人工智能 朴素贝叶斯分类器 支持向量机 随机森林 统计分类 k-最近邻算法 数据挖掘 多项式logistic回归 算法
作者
Sayar Ul Hassan,Jameel Ahamed,Khaleel Ahmad
出处
期刊:Sustainable operations and computers [Elsevier BV]
卷期号:3: 238-248 被引量:28
标识
DOI:10.1016/j.susoc.2022.03.001
摘要

Text classification is the most vital area in natural language processing in which text data is automatically sorted into a predefined set of classes. The application of text classification is wide in commercial works like spam filtering, decision making, extracting information from raw data, and many other applications. Text classification is more significant for many enterprises since it eliminates the need for manual data classification, a more expensive and time-consuming mechanism. In this paper, a comparative analysis of text classification is done in which the efficiency of different machine learning algorithms on different datasets is analyzed and compared. Support Vector Machine (SVM), k-Nearest Neighbor (k-NN), Logistic Regression (LR), Multinomial Naïve Bayes (MNB), and Random Forest (RF) are Machine Learning based algorithms used in this work. Two different datasets are used to make a comparative analysis of these algorithms. This paper further analyzes the machine learning techniques employed for text classification on the basis of performance metrics viz accuracy, precision, recall and f1- score. The resullltsss reveals that Logistic Regression and Support Vector Machine outperforms the other models in the IMDB dataset, and kNN outperforms the other models for the SPAM dataset as per the results obtained from the proposed system.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
快乐的窝瓜完成签到 ,获得积分10
2秒前
小蘑菇应助科研通管家采纳,获得10
2秒前
酷波er应助科研通管家采纳,获得10
2秒前
2秒前
海蓝云天应助科研通管家采纳,获得10
2秒前
2秒前
2秒前
Yi应助科研通管家采纳,获得10
2秒前
思源应助科研通管家采纳,获得10
2秒前
Ava应助科研通管家采纳,获得10
3秒前
烟花应助科研通管家采纳,获得10
3秒前
3秒前
NexusExplorer应助科研通管家采纳,获得10
3秒前
Yi应助科研通管家采纳,获得10
3秒前
Yi应助科研通管家采纳,获得10
3秒前
Jasper应助科研通管家采纳,获得10
3秒前
酷波er应助科研通管家采纳,获得10
3秒前
4秒前
药学生发布了新的文献求助10
5秒前
臭小子完成签到,获得积分20
5秒前
eryu25发布了新的文献求助20
5秒前
文静人达完成签到 ,获得积分10
8秒前
典雅的丹寒完成签到,获得积分10
8秒前
柚子完成签到 ,获得积分10
8秒前
8秒前
8秒前
英姑应助Simms采纳,获得10
10秒前
Owen应助rilin采纳,获得50
10秒前
qiqiqiqiqi完成签到 ,获得积分10
10秒前
kkk完成签到,获得积分10
12秒前
shafulin发布了新的文献求助10
12秒前
标致梦玉发布了新的文献求助10
14秒前
14秒前
15秒前
科研通AI6.1应助药学生采纳,获得10
16秒前
16秒前
聪明梦松完成签到,获得积分10
16秒前
科研通AI6.2应助Zhou采纳,获得10
17秒前
sduwl完成签到,获得积分10
19秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
晶种分解过程与铝酸钠溶液混合强度关系的探讨 8888
Les Mantodea de Guyane Insecta, Polyneoptera 2000
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
Signals, Systems, and Signal Processing 610
The formation of Australian attitudes towards China, 1918-1941 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6420722
求助须知:如何正确求助?哪些是违规求助? 8239990
关于积分的说明 17510700
捐赠科研通 5474352
什么是DOI,文献DOI怎么找? 2891977
邀请新用户注册赠送积分活动 1868531
关于科研通互助平台的介绍 1705762