K1K2NN: A Novel Multi-Label Classification Approach Based on Neighbors for Predicting COVID-19 Drug Side Effects

雅卡索引药品分类器（UML） 2019年冠状病毒病（COVID-19）药物重新定位多标签分类计算机科学相似性（几何）人工智能样品（材料）机器学习 k-最近邻算法数据挖掘模式识别（心理学）医学药理学化学疾病病理传染病（医学专业）图像（数学）色谱法

作者

Pranab Jyoti Das,Dilwar Hussain Mazumder

出处

期刊：Computational Biology and Chemistry [Elsevier]
日期：2024-06-01 卷期号：110: 108066-108066

链接

nih.govdoi.org

标识

DOI：10.1016/j.compbiolchem.2024.108066

摘要

COVID-19, a novel ailment, has received comparatively fewer drugs for its treatment. Side Effects (SE) of a COVID-19 drug could cause long-term health issues. Hence, SE prediction is essential in COVID-19 drug development. Efficient models are also needed to predict COVID-19 drug SE since most existing research has proposed many classifiers to predict SE for diseases other than COVID-19. This work proposes a novel classifier based on neighbors named K1 K2 Nearest Neighbors (K1K2NN) to predict the SE of the COVID-19 drug from 17 molecules' descriptors and the chemical 1D structure of the drugs. The model is implemented based on the proposition that chemically similar drugs may be assigned similar drug SE, and co-occurring SE may be assigned to chemically similar drugs. The K1K2NN model chooses the first K1 neighbors to the test drug sample by calculating its similarity with the train drug samples. It then assigns the test sample with the SE label having the majority count on the SE labels of these K1 neighbor drugs obtained through a voting mechanism. The model then calculates the SE-SE similarity using the Jaccard similarity measure from the SE co-occurrence values. Finally, the model chooses the most similar K2 SE neighbors for those SE determined by the K1 neighbor drugs and assigns these SE to that test drug sample. The proposed K1K2NN model has showcased promising performance with the highest accuracy of 97.53% on chemical 1D drug structure and outperforms the state-of-the-art multi-label classifiers. In addition, we demonstrate the successful application of the proposed model on gene expression signature datasets, which aided in evaluating its performance and confirming its accuracy and robustness.

求助该文献

最长约 10秒，即可获得该文献文件

K1K2NN: A Novel Multi-Label Classification Approach Based on Neighbors for Predicting COVID-19 Drug Side Effects

今日热心研友