Class imbalance-sensitive approach based on PLMs for the detection of cyberbullying in English and Arabic datasets

阿拉伯语 班级(哲学) 计算机科学 人工智能 自然语言处理 机器学习 语言学 哲学
作者
Azzeddine Rachid Benaissa,Azza Harbaoui,Hajjami Henda Ben Ghezala
出处
期刊:Behaviour & Information Technology [Taylor & Francis]
卷期号:: 1-18 被引量:1
标识
DOI:10.1080/0144929x.2024.2313142
摘要

Social Networking increases allowed the spreading of cyberbullying worldwide. The latter invaded cyberspace, kids and adolescents are no more safe in their virtual playgrounds. Indeed, online bullying is attracting considerable concern due to the societal and health issues it causes, ranging from depression, anxiety, and low self-esteem to sui cide attempts. Automatic cyberbullying detection is becoming a vital factor in protecting individuals' lives. It has received much attention in the last decade. Researchers use machine learning and deep learning models to detect online bullying content. An automatic cyberbullying detection model would flag any bullying text as efficiently as possible. Yet, several challenges lie ahead for the development of such a robust model. Our study discerned class imbalance and bullying text representation as being the major issues concerning cyberbullying classification. In this context, we tried to handle the class imbalance problem through data augmentation, cost-sensitive learning, and lever- aging a Computer Vision loss function for the task. Moreover, we consider a prominent solution for bullying content representation, which consists of fine-tuning Pre-trained Language Models for cyberbullying detection and using these latter as feature extractors for Multichannel ConvNets and Bidirectional LSTMs. The results show the effectiveness of the proposed models, which outperform several past works and provide high Recall values (78%–96%) on English and Arabic datasets.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
隐形曼青应助汐夕采纳,获得10
1秒前
1秒前
lewu完成签到,获得积分10
1秒前
恃6发布了新的文献求助10
2秒前
hanhou发布了新的文献求助10
6秒前
6秒前
2423发布了新的文献求助10
6秒前
科研通AI6.4应助科研小白采纳,获得10
6秒前
楚留香完成签到,获得积分10
6秒前
Wuyiqin发布了新的文献求助20
8秒前
思源应助执笔诉余生1采纳,获得10
9秒前
nnnnnjk完成签到,获得积分10
9秒前
9秒前
恃6完成签到,获得积分20
10秒前
10秒前
ljp完成签到,获得积分10
10秒前
黄子诚关注了科研通微信公众号
10秒前
chuzijia发布了新的文献求助30
10秒前
10秒前
12秒前
靎藥完成签到,获得积分10
12秒前
Ember发布了新的文献求助20
12秒前
姜姜姜姜完成签到 ,获得积分10
13秒前
sunny完成签到,获得积分10
13秒前
蒯秀燕发布了新的文献求助20
14秒前
巫颤发布了新的文献求助10
14秒前
tfldog完成签到,获得积分10
14秒前
魔幻的摩托完成签到 ,获得积分10
15秒前
早睡早起发布了新的文献求助10
15秒前
16秒前
16秒前
救驾来迟完成签到,获得积分10
16秒前
刘言发布了新的文献求助20
16秒前
17秒前
章鱼发布了新的文献求助10
17秒前
AllRightReserved应助ZHY采纳,获得10
17秒前
无花果应助ZHY采纳,获得10
17秒前
东东完成签到,获得积分10
18秒前
哈哈完成签到 ,获得积分10
18秒前
18秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
The Resilient Mindset 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
Disturbing the Quiet Life? Competition and CEO Incentives 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6652611
求助须知:如何正确求助?哪些是违规求助? 8406460
关于积分的说明 17974950
捐赠科研通 5848033
什么是DOI,文献DOI怎么找? 2971759
邀请新用户注册赠送积分活动 1947257
关于科研通互助平台的介绍 1867762