自编码
计算机科学
财产(哲学)
人工智能
机器学习
水准点(测量)
任务(项目管理)
主动学习(机器学习)
对抗制
过程(计算)
集合(抽象数据类型)
深度学习
数据挖掘
工程类
操作系统
哲学
认识论
程序设计语言
系统工程
地理
大地测量学
作者
Linjie Li,Yi Xiao,Dewei Ma,Kai Zheng
标识
DOI:10.1109/ccis57298.2022.10016422
摘要
Molecular property prediction is a fundamental task in drug discovery. The majority of the high-performing molecular property prediction methods currently were developed using deep learning techniques, which rely on massive labeled data. However, accurate molecular property annotation is time-consuming and expensive. Due to the fact that different samples usually have unequal importance in model training, we propose a pre-trained variational adversarial active learning, PREVAIL for short, to query the most informative samples to be annotated to reduce the annotation cost. Specifically, different from previous active learning whose initial set is sampled randomly, PREVAIL selects the most informative initial dataset by an autoencoder and K-Center greedy algorithm, which can avoid biases that affect the accuracy of the early decision-making process. Furthermore, PREVAIL simultaneously adapts the distribution of molecules and the information of the prediction task by incorporating the loss information of the molecular property prediction task into the latent space using task-aware variational adversarial active learning. Our benchmark experiments demonstrate that PREVAIL outperforms state-of-the-art active learning methods on molecular property prediction tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI