化学
公共化学
人工智能
计算机科学
深度学习
集成学习
机器学习
人工神经网络
集合预报
数据挖掘
癌症
虚拟筛选
药物发现
卷积神经网络
管道(软件)
集合(抽象数据类型)
数据集
训练集
支持向量机
堆积
质量(理念)
计算生物学
作者
Ahmad, Shaban,Raza, Khalid
标识
DOI:10.6084/m9.figshare.30763082.v1
摘要
Supplementary Dataset for:DrLungker: A Deep Ensemble Learning Framework for Predicting Anti-Lung Cancer Compound Activity and Validating Multitarget Potency through WaterMap, DFT, MD Simulations, and MM-GBSA AnalysisPublished in: Advanced Theory and Simulations
Manuscript DOI: https://doi.org/10.1002/adts.202501550
More information: https://github.com/ShabanAhmad/DrLungkerDescriptionThis repository provides the full training datasets used to develop the DrLungker deep ensemble learning framework for predicting anti-lung cancer compound activity. The data were curated from PubChem and ChEMBL lung cancer bioassays, followed by preprocessing, structure standardization, descriptor generation, and quality filtering.The final dataset includes 26,396 unique compounds, each encoded with 5,883 molecular descriptors (AlvaDesc + QikProp), and was used to train the hybrid ensemble model integrating ResNet, Feedforward Neural Network (FNN), and RNN-LSTM architectures with Averaging, Voting, and Stacking techniques.These files are archived here due to their large size and ensure full reproducibility of all machine-learning results described in the manuscript.
科研通智能强力驱动
Strongly Powered by AbleSci AI