规范化(社会学)
人工神经网络
数学
计算机科学
人工智能
社会学
社会科学
作者
Daniel Brooks,Olivier Schwander,Frédéric Barbaresco,Jean-Yves Schneider,Matthieu Cord
出处
期刊:Cornell University - arXiv
日期:2019-01-01
被引量:53
标识
DOI:10.48550/arxiv.1909.02414
摘要
Covariance matrices have attracted attention for machine learning applications due to their capacity to capture interesting structure in the data. The main challenge is that one needs to take into account the particular geometry of the Riemannian manifold of symmetric positive definite (SPD) matrices they belong to. In the context of deep networks, several architectures for these matrices have recently been proposed. In our article, we introduce a Riemannian batch normalization (batchnorm) algorithm, which generalizes the one used in Euclidean nets. This novel layer makes use of geometric operations on the manifold, notably the Riemannian barycenter, parallel transport and non-linear structured matrix transformations. We derive a new manifold-constrained gradient descent algorithm working in the space of SPD matrices, allowing to learn the batchnorm layer. We validate our proposed approach with experiments in three different contexts on diverse data types: a drone recognition dataset from radar observations, and on emotion and action recognition datasets from video and motion capture data. Experiments show that the Riemannian batchnorm systematically gives better classification performance compared with leading methods and a remarkable robustness to lack of data.
科研通智能强力驱动
Strongly Powered by AbleSci AI