Transfer Learning with a Graph Attention Network and Weighted Loss Function for Screening of Persistent, Bioaccumulative, Mobile, and Toxic Chemicals

生物累积 构造(python库) 计算机科学 环境科学 生化工程 图形 化学 环境化学 工程类 理论计算机科学 程序设计语言
作者
Haobo Wang,Wenjia Liu,Jingwen Chen,Shixin Ji
出处
期刊:Environmental Science & Technology [American Chemical Society]
卷期号:59 (1): 578-590 被引量:18
标识
DOI:10.1021/acs.est.4c11085
摘要

In silico methods for screening hazardous chemicals are necessary for sound management. Persistent, bioaccumulative, mobile, and toxic (PBMT) chemicals persist in the environment and have high mobility in aquatic environments, posing risks to human and ecological health. However, lack of experimental data for the vast number of chemicals hinders identification of PBMT chemicals. Through an extensive search of measured chemical mobility data, as well as persistent, bioaccumulative, and toxic (PBT) chemical inventories, this study constructed comprehensive data sets on PBMT chemicals. To address the limited volume of the PBMT chemical data set, a transfer learning (TL) framework based on graph attention network (GAT) architecture was developed to construct models for screening PBMT chemicals, designating the PBT chemical inventories as source domains and the PBMT chemical data set as target domains. A weighted loss (LW) function was proposed and proved to mitigate the negative impact of imbalanced data on the model performance. Results indicate the TL-GAT models outperformed GAT models, along with large coverage of applicability domains and interpretability. The constructed models were employed to identify PBMT chemicals from inventories consisting of about 1 × 106 chemicals. The developed TL-GAT framework with the LW function holds broad applicability across diverse tasks, especially those involving small and imbalanced data sets.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
1秒前
LabRat完成签到 ,获得积分10
1秒前
2秒前
2秒前
2秒前
3秒前
3秒前
jennifer_zhuang完成签到,获得积分10
3秒前
Yoci完成签到,获得积分10
4秒前
saber完成签到,获得积分10
4秒前
彭于晏应助尔东采纳,获得10
4秒前
5秒前
无极微光应助因子采纳,获得30
5秒前
helloworld发布了新的文献求助10
6秒前
淡漠完成签到,获得积分10
6秒前
炙热尔烟完成签到,获得积分20
6秒前
夏雪儿发布了新的文献求助20
7秒前
小马甲应助657采纳,获得10
8秒前
lxt发布了新的文献求助10
8秒前
liliy发布了新的文献求助10
8秒前
Lin_sandwich发布了新的文献求助10
8秒前
可爱的函函应助魔幻秋蝶采纳,获得10
8秒前
满意的鱼完成签到 ,获得积分10
8秒前
高大晓曼发布了新的文献求助10
8秒前
9秒前
9秒前
SepChopin发布了新的文献求助10
9秒前
不安从灵发布了新的文献求助10
9秒前
10秒前
NexusExplorer应助淡漠采纳,获得10
10秒前
11秒前
ZhuXX发布了新的文献求助10
12秒前
12秒前
深情安青应助fj采纳,获得10
13秒前
mos2003发布了新的文献求助30
14秒前
lyx发布了新的文献求助10
14秒前
嘿嘿发布了新的文献求助10
15秒前
Jasper应助liliy采纳,获得10
16秒前
O倻发布了新的文献求助10
16秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
The Composition and Relative Chronology of Dynasties 16 and 17 in Egypt 1500
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
Scientific Writing and Communication: Papers, Proposals, and Presentations 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6370594
求助须知:如何正确求助?哪些是违规求助? 8184442
关于积分的说明 17267632
捐赠科研通 5425164
什么是DOI,文献DOI怎么找? 2870157
邀请新用户注册赠送积分活动 1847204
关于科研通互助平台的介绍 1693896