RidgeSketch: A Fast Sketching Based Solver for Large Scale Ridge Regression

素描 计算机科学 解算器 收敛速度 算法 次线性函数 理论计算机科学 数学 程序设计语言 离散数学 计算机网络 频道(广播)
作者
Nidham Gazagnadou,Mark Ibrahim,Robert M. Gower
出处
期刊:SIAM Journal on Matrix Analysis and Applications [Society for Industrial and Applied Mathematics]
卷期号:43 (3): 1440-1468 被引量:5
标识
DOI:10.1137/21m1422963
摘要

We propose new variants of the sketch-and-project method for solving large scale ridge regression problems. First, we propose a new momentum alternative and provide a theorem showing it can speed up the convergence of sketch-and-project, through a fast sublinear convergence rate. We carefully delimit under what settings this new sublinear rate is faster than the previously known linear rate of convergence of sketch-and-project without momentum. Second, we consider combining the sketch-and-project method with new modern sketching methods such as Count sketch, SubCount sketch (a new method we propose), and subsampled Hadamard transforms. We show experimentally that when combined with the sketch-and-project method, the (Sub)Count sketch is very effective on sparse data and the standard Subsample sketch is effective on dense data. Indeed, we show that these sketching methods, combined with our new momentum scheme, result in methods that are competitive even when compared to the conjugate gradient method on real large scale data. On the contrary, we show the subsampled Hadamard transform does not perform well in this setting, despite the use of fast Hadamard transforms, and nor do recently proposed acceleration schemes work well in practice. To support all of our experimental findings, and invite the community to validate and extend our results, with this paper we are also releasing an open source software package: RidgeSketch. We designed this object-oriented package in Python for testing sketch-and-project methods and benchmarking ridge regression solvers. RidgeSketch is highly modular, and new sketching methods can easily be added as subclasses. We provide code snippets of our package in the appendix.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
hooooow发布了新的文献求助10
1秒前
星辰大海应助周伯通采纳,获得10
1秒前
2秒前
2秒前
匆聪完成签到,获得积分10
3秒前
jclin发布了新的文献求助10
6秒前
6秒前
yyy完成签到,获得积分10
6秒前
YYY发布了新的文献求助10
7秒前
思源应助小狮子采纳,获得10
7秒前
LXYang发布了新的文献求助10
7秒前
9秒前
susu完成签到,获得积分10
9秒前
洁净雨发布了新的文献求助10
11秒前
研友_LkD09n完成签到,获得积分20
11秒前
随霖完成签到 ,获得积分10
11秒前
12秒前
12秒前
笨笨凡松发布了新的文献求助10
14秒前
Leanne应助zhou采纳,获得10
15秒前
爱壹帆完成签到,获得积分10
15秒前
15秒前
周伯通发布了新的文献求助10
17秒前
科研通AI6.4应助kk采纳,获得10
18秒前
NexusExplorer应助xiaolizi采纳,获得10
18秒前
非而者厚发布了新的文献求助10
18秒前
Hawnyoung完成签到,获得积分10
21秒前
笨笨凡松完成签到,获得积分10
21秒前
21秒前
22秒前
24秒前
睡个懒觉完成签到,获得积分10
25秒前
25秒前
26秒前
26秒前
27秒前
辛尘发布了新的文献求助10
27秒前
heyunxiang完成签到 ,获得积分10
28秒前
hooooow完成签到,获得积分10
29秒前
小狮子发布了新的文献求助10
30秒前
高分求助中
The Graphene Handbook (2019 Edition) 800
Signals, Systems, and Signal Processing 610
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500
Cold War Transcended: Australia's China Policy, 1949-1990 470
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6599058
求助须知:如何正确求助?哪些是违规求助? 8368470
关于积分的说明 17911948
捐赠科研通 5753588
什么是DOI,文献DOI怎么找? 2954007
邀请新用户注册赠送积分活动 1929216
关于科研通互助平台的介绍 1824259