计算机科学
特征(语言学)
粒度
解码方法
特征学习
代表(政治)
约束(计算机辅助设计)
人工智能
特征提取
失真(音乐)
特征模型
模式识别(心理学)
算法
数学
操作系统
哲学
政治
语言学
放大器
程序设计语言
法学
计算机网络
带宽(计算)
政治学
几何学
软件
作者
Shurun Wang,Shiqi Wang,Wenhan Yang,Xinfeng Zhang,Shanshe Wang,Siwei Ma
标识
DOI:10.1109/icassp39728.2021.9413506
摘要
In this paper, we propose a novel end-to-end feature compression scheme by leveraging the representation and learning capability of deep neural networks, towards intelligent front-end equipped analysis with promising accuracy and efficiency. In particular, the extracted features are compactly coded in an end-to-end manner by optimizing the rate- distortion cost to achieve feature-in-feature representation. The multi-granularity constraint is further imposed, serving as the optimization objective to make the feature compression more "healthier" from the perspective of ultimate utility. More specifically, the analysis accuracy is considered in the coarse granularity level constraint, ensuring the capability of facial analysis with the reconstructed feature. Furthermore, at the fine granularity level the feature fidelity is involved to preserve the original feature quality. Moreover, a latent code level teacher-student enhancement model is proposed to efficiently transfer the low bit-rate representation into a high bit- rate one. Such a strategy further allows us to adaptively shift the representation cost to decoding computations, leading to more flexible feature compression with enhanced decoding capability. We verify the effectiveness of the proposed model with the facial feature, and experimental results reveal better compression performance in terms of rate-accuracy compared with existing models.
科研通智能强力驱动
Strongly Powered by AbleSci AI