缺少数据
插补(统计学)
计算机科学
人工智能
机器学习
数据挖掘
一般化
数据建模
人工神经网络
数学
数据库
数学分析
作者
Yongshun Gong,Zhibin Li,Wei Liu,Xiankai Lu,Xinwang Liu,Ivor W. Tsang,Yilong Yin
标识
DOI:10.1109/tpami.2023.3262784
摘要
Many real-world problems deal with collections of data with missing values, e.g., RNA sequential analytics, image completion, video processing, etc. Usually, such missing data is a serious impediment to a good learning achievement. Existing methods tend to use a universal model for all incomplete data, resulting in a suboptimal model for each missingness pattern. In this paper, we present a general model for learning with incomplete data. The proposed model can be appropriately adjusted with different missingness patterns, alleviating competitions between data. Our model is based on observable features only, so it does not incur errors from data imputation. We further introduce a low-rank constraint to promote the generalization ability of our model. Analysis of the generalization error justifies our idea theoretically. In additional, a subgradient method is proposed to optimize our model with a proven convergence rate. Experiments on different types of data show that our method compares favorably with typical imputation strategies and other state-of-the-art models for incomplete data. More importantly, our method can be seamlessly incorporated into the neural networks with the best results achieved. The source code is released at https://github.com/YS-GONG/missingness-patterns.
科研通智能强力驱动
Strongly Powered by AbleSci AI