梯度升压
参数化复杂度
数学
稳健性(进化)
Boosting(机器学习)
回归
数学优化
算法
数据点
计算机科学
统计
人工智能
随机森林
生物化学
化学
基因
标识
DOI:10.1016/s0167-9473(01)00065-2
摘要
Gradient boosting constructs additive regression models by sequentially fitting a simple parameterized function (base learner) to current “pseudo”-residuals by least squares at each iteration. The pseudo-residuals are the gradient of the loss functional being minimized, with respect to the model values at each training data point evaluated at the current step. It is shown that both the approximation accuracy and execution speed of gradient boosting can be substantially improved by incorporating randomization into the procedure. Specifically, at each iteration a subsample of the training data is drawn at random (without replacement) from the full training data set. This randomly selected subsample is then used in place of the full sample to fit the base learner and compute the model update for the current iteration. This randomized approach also increases robustness against overcapacity of the base learner.
科研通智能强力驱动
Strongly Powered by AbleSci AI