社会化媒体
计算机科学
机器学习
可读性
钥匙(锁)
自然语言处理
误传
随机森林
特征工程
特征(语言学)
人工智能
tf–国际设计公司
特征提取
散列函数
语言学
索引(排版)
深度学习
期限(时间)
万维网
哲学
物理
计算机安全
量子力学
程序设计语言
作者
Sonal Garg,Dilip Kumar Sharma
标识
DOI:10.1016/j.cie.2022.108432
摘要
• Fake news detection using 26 linguistic features. • Three feature extraction techniques named Tf-idf, count-vectorizer and hash vectorizer is applied. • Artificial intelligence solution is used in social media and fact-checking industry. • Machine-learning models are implemented on four popular datasets. Social media platforms now a day are mainly used for news consumption among users. Political groups use social media platforms to attract users by enclosing users' votes in their favor. Due to the large volume of data on social media, it is essential to verify the authenticity of the content. The use of artificial intelligence techniques including the development of embedding and deployment of the machine-learning algorithm is required to combat misinformation. This paper focused on various categories of linguistic features covering complexity features, readability index, psycholinguistic features, and stylometric features for competent fake news identification. The linguistic model helps in computing language-driven features by learning the properties of news content. In this work, we have selected twenty-six significant features and applied various machine learning models for implementation. For feature extraction, three different techniques named term frequency-inverse document frequency (tf-idf), count vectorizer (CV), and hash-vectorizer (HV) are applied. Then, we tested those models in different training dataset sizes to obtain accuracy for each model and compared them. We used four existing datasets for the experiment. The proposed framework achieved 90.8 % accuracy using Reuter dataset. Buzzfeed dataset obtained highest of 90% accuracy. Random Political and Mc_Intire dataset achieved an accuracy of 93.8 and 86.9% respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI