计算机科学
歧管(流体力学)
欧几里得空间
黎曼流形
信息几何学
统计流形
黎曼几何
空格(标点符号)
伪黎曼流形
人工智能
非线性降维
欧几里德几何
欧几里德距离
数学
几何学
里希曲率
纯数学
降维
标量曲率
机械工程
曲率
工程类
操作系统
作者
Yanhong Fei,Yingjie Liu,Chentao Jia,LI Zheng-yu,Xian Wei,Mingsong Chen
摘要
Deep Learning (DL) has achieved remarkable success in tackling complex Artificial Intelligence tasks. The standard training of neural networks employs backpropagation to compute gradients and utilizes various optimization algorithms in the Euclidean space \(\mathbb {R}^n \) . However, this optimization process faces challenges, such as the local optimal issues and the problem of gradient vanishing and exploding. To address these problems, Riemannian optimization offers a powerful extension to solve optimization problems in deep learning. By incorporating the prior constraint structure and the metric information of the underlying geometric information, Riemannian optimization-based DL offers a more stable and reliable optimization process, as well as enhanced adaptability to complex data structures. This article presents a comprehensive survey of applying geometric optimization in DL, including the basic procedure of geometric optimization, various geometric optimizers, and some concepts of the Riemannian manifold. In addition, it investigates various applications of geometric optimization in different DL networks for diverse tasks and discusses typical public toolboxes that implement optimization on the manifold. This article also includes a performance comparison among different deep geometric optimization methods in image recognition scenarios. Finally, this article elaborates on future opportunities and challenges in this field.
科研通智能强力驱动
Strongly Powered by AbleSci AI