判别式
计算机科学
图像(数学)
频域
人工智能
人工神经网络
模式识别(心理学)
领域(数学分析)
傅里叶变换
滤波器(信号处理)
计算机视觉
机器学习
数学
数学分析
作者
José Augusto Stuchi,Natália Gil Canto,Romis Ribeiro de Faissol Attux,Levy Boccato
标识
DOI:10.1016/j.asoc.2024.111443
摘要
Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain due to the great improvements brought by deep neural networks (DNN). The majority of state-of-the-art architectures are DNN-related, but only a few explicitly explore the frequency domain to extract useful information and improve the results. This paper presents a new approach for exploring the Fourier transform of the input images, which is composed of trainable frequency filters that boost discriminative components in the spectrum. Additionally, we propose a cropping procedure to allow the network to learn both global and local spectral features of the image blocks. The proposed method proved to be competitive concerning well-known DNN architectures in the selected experiments, which involved texture classification, cataract detection, and retina image analysis, where there is a noticeable appeal for the frequency domain, with the advantage of being a lightweight model.
科研通智能强力驱动
Strongly Powered by AbleSci AI