计算机科学
机器视觉
人工智能
多路复用
能源消耗
卷积(计算机科学)
人工神经网络
计算机视觉
电气工程
电信
工程类
作者
Hanyu Zheng,Quan Liu,Ivan I. Kravchenko,Xiaomeng Zhang,Yuankai Huo,Jason Valentine
标识
DOI:10.1038/s41565-023-01557-2
摘要
Rapid developments in machine vision technology have impacted a variety of applications, such as medical devices and autonomous driving systems. These achievements, however, typically necessitate digital neural networks with the downside of heavy computational requirements and consequent high energy consumption. As a result, real-time decision-making is hindered when computational resources are not readily accessible. Here we report a meta-imager designed to work together with a digital back end to offload computationally expensive convolution operations into high-speed, low-power optics. In this architecture, metasurfaces enable both angle and polarization multiplexing to create multiple information channels that perform positively and negatively valued convolution operations in a single shot. We use our meta-imager for object classification, achieving 98.6% accuracy in handwritten digits and 88.8% accuracy in fashion images. Owing to its compactness, high speed and low power consumption, our approach could find a wide range of applications in artificial intelligence and machine vision applications. A metasurface-based approach is used to implement computationally expensive digital convolution operations in high-speed, low-power optics for improving the latency and power consumption of machine vision systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI