RidgeSketch: A Fast Sketching Based Solver for Large Scale Ridge Regression

素描 计算机科学 解算器 收敛速度 算法 次线性函数 理论计算机科学 数学 程序设计语言 离散数学 计算机网络 频道(广播)
作者
Nidham Gazagnadou,Mark Ibrahim,Robert M. Gower
出处
期刊:SIAM Journal on Matrix Analysis and Applications [Society for Industrial and Applied Mathematics]
卷期号:43 (3): 1440-1468 被引量:5
标识
DOI:10.1137/21m1422963
摘要

We propose new variants of the sketch-and-project method for solving large scale ridge regression problems. First, we propose a new momentum alternative and provide a theorem showing it can speed up the convergence of sketch-and-project, through a fast sublinear convergence rate. We carefully delimit under what settings this new sublinear rate is faster than the previously known linear rate of convergence of sketch-and-project without momentum. Second, we consider combining the sketch-and-project method with new modern sketching methods such as Count sketch, SubCount sketch (a new method we propose), and subsampled Hadamard transforms. We show experimentally that when combined with the sketch-and-project method, the (Sub)Count sketch is very effective on sparse data and the standard Subsample sketch is effective on dense data. Indeed, we show that these sketching methods, combined with our new momentum scheme, result in methods that are competitive even when compared to the conjugate gradient method on real large scale data. On the contrary, we show the subsampled Hadamard transform does not perform well in this setting, despite the use of fast Hadamard transforms, and nor do recently proposed acceleration schemes work well in practice. To support all of our experimental findings, and invite the community to validate and extend our results, with this paper we are also releasing an open source software package: RidgeSketch. We designed this object-oriented package in Python for testing sketch-and-project methods and benchmarking ridge regression solvers. RidgeSketch is highly modular, and new sketching methods can easily be added as subclasses. We provide code snippets of our package in the appendix.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
KX2024完成签到,获得积分10
刚刚
1秒前
1秒前
落后的嚣完成签到,获得积分10
2秒前
留胡子的小甜瓜完成签到,获得积分20
2秒前
cdercder应助爱学习的小钟采纳,获得10
2秒前
HY发布了新的文献求助10
4秒前
呆头鹅完成签到 ,获得积分10
6秒前
华仔应助maclogos采纳,获得10
7秒前
8秒前
科目三应助顺心的巨人采纳,获得10
10秒前
10秒前
NexusExplorer应助You采纳,获得10
11秒前
四喜丸子完成签到,获得积分10
12秒前
13秒前
yu发布了新的文献求助10
14秒前
14秒前
14秒前
14秒前
GCB完成签到,获得积分20
14秒前
天天快乐应助虚拟的秋寒采纳,获得10
15秒前
Xx发布了新的文献求助10
15秒前
CICI完成签到,获得积分10
17秒前
赘婿应助清瓷采纳,获得10
17秒前
Hello应助顺心的巨人采纳,获得10
17秒前
may驳回了田様应助
18秒前
搜集达人应助GCB采纳,获得10
18秒前
19秒前
天天快乐应助Xx采纳,获得10
19秒前
20秒前
shirely发布了新的文献求助10
20秒前
啊啊完成签到 ,获得积分10
22秒前
Owen应助风带走黎明采纳,获得10
22秒前
夭夭完成签到,获得积分10
22秒前
22秒前
24秒前
Hello应助邵钰博采纳,获得10
24秒前
25秒前
26秒前
wztao完成签到 ,获得积分10
27秒前
高分求助中
The Graphene Handbook (2019 Edition) 800
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500
Cold War Transcended: Australia's China Policy, 1949-1990 470
Non-Sequential Optical Design using Zemax OpticStudio®: Design Process and Practical Examples 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6604540
求助须知:如何正确求助?哪些是违规求助? 8372501
关于积分的说明 17917838
捐赠科研通 5762922
什么是DOI,文献DOI怎么找? 2955852
邀请新用户注册赠送积分活动 1930905
关于科研通互助平台的介绍 1828405