人工智能
计算机科学
卷积神经网络
混淆矩阵
模式识别(心理学)
分割
稳健性(进化)
深度学习
特征提取
机器学习
计算机视觉
生物化学
基因
化学
摘要
Biomechanical image recognition has important applications in clinical diagnosis and biomedical engineering, but traditional convolutional neural network (CNN) has limitations in capturing global features. In this paper, a biomechanical image recognition method based on Vision Transformer (ViT) is proposed to improve the classification performance of complex images. Biomechanical image dataset containing five types of data is constructed, and ViT input features are represented by standardization, data enhancement and Patch segmentation. Accuracy, precision, recall, F1 score and confusion matrix are used to evaluate the performance, and compared with ResNet-50 and DenseNet-121. The experimental results show that the accuracy of ViT model is 92.3%, and it performs best in the categories of “normal bones” and “soft tissue lesions”, and other indicators are better than the traditional CNN model. ViT realizes global feature modeling through self-attention mechanism, which significantly improves the recognition accuracy and robustness, provides efficient and accurate technical support for clinical diagnosis, disease screening and surgical planning, and shows its application potential in the field of biomechanical image recognition.
科研通智能强力驱动
Strongly Powered by AbleSci AI