自编码
计算机科学
稳健性(进化)
人工智能
机器学习
卷积神经网络
特征提取
模式识别(心理学)
数据挖掘
药丸
特征学习
深度学习
特征(语言学)
预处理器
人工神经网络
软件部署
标识符
神经编码
特征向量
图像处理
计算机视觉
一般化
作者
Weihua Wang,Xiaoqian Jing,Songyao Jiang,Yi Han
标识
DOI:10.1109/jiot.2026.3664411
摘要
Pill recognition is critical for preventing medication misuse and ensuring patient safety, yet it remains challenging due to the growing number of pharmaceutical products and their highly similar visual appearances. Existing pill recognition methods often rely on single-scale feature representations and deterministic models, which limits their robustness and generalization under complex imaging conditions such as blur, illumination variation, and pill wear. To address these limitations, we propose an IoT-based Multi-Scale Variational Autoencoder (Pill-MSVAE) for automatic pill image recognition. The proposed framework integrates convolutional neural networks and Transformers to jointly capture local texture details and global structural information, while variational encoding with reconstruction constraints enhances feature robustness and generalization. Extensive experiments on a public pill image dataset show that Pill-MSVAE achieves an F1-score of 0.9718, Accuracy of 0.9716, Precision of 0.9727, and AUC of 0.9989, consistently outperforming state-of-the-art methods across all evaluation metrics. These results validate the effectiveness of the proposed approach for reliable pill recognition and highlight its potential for practical deployment in IoT-enabled healthcare and medication management systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI