估计员
一致性(知识库)
推论
渐近分布
投影(关系代数)
班级(哲学)
标记数据
正态性
数学
统计
计算机科学
人工智能
数据挖掘
数学优化
机器学习
算法
作者
Shanshan Song,Yuanyuan Lin,Yong Zhou
标识
DOI:10.1080/01621459.2023.2169699
摘要
We study a class of general M-estimators in the semi-supervised setting, wherein the data are typically a combination of a relatively small labeled dataset and large amounts of unlabeled data. A new estimator, which efficiently uses the useful information contained in the unlabeled data, is proposed via a projection technique. We prove consistency and asymptotic normality, and provide an inference procedure based on K-fold cross-validation. The optimal weights are derived to balance the contributions of the labeled and unlabeled data. It is shown that the proposed method, by taking advantage of the unlabeled data, produces asymptotically more efficient estimation of the target parameters than the supervised counterpart. Supportive numerical evidence is shown in simulation studies. Applications are illustrated in analysis of the homeless data in Los Angeles. Supplementary materials for this article are available online.
科研通智能强力驱动
Strongly Powered by AbleSci AI