学习迁移
计算机科学
生物声学
提取器
人工智能
录音和复制
剪裁(形态学)
传输(计算)
特征(语言学)
机器学习
模式识别(心理学)
电信
语言学
哲学
物理
工艺工程
并行计算
声学
工程类
作者
S. Bhuvaneswari,M. Jagadeesh,V. Subramaniyaswamy
标识
DOI:10.1016/j.ecoinf.2024.102471
摘要
As part of ornithology, bird species classification is vital to understanding species distribution, habitat requirements and environmental changes that affect bird populations. It is possible for ornithologists to assess the health of a certain habitat by tracking changes in bird species distributions. This work has extended an efficient transfer learning technique for labelling and classifying multiple bird species from real-time audio recordings. For this purpose, Wav2vec is fine-tuned using the back propagation technique, which makes the feature extractor more effective in learning each bird's pitch and other sound characteristics. To perform the task, each audio recording has been clipped as chunks from the overlapping audio to determine multi-labels from it. Through the application of transfer learning, the features of audio recordings have been automatically extracted for classification and fed to a feed-forward network. Subsequently, probabilities associated with each audio segment is aggregated through the clipping approach to represent multiple species of bird call. These probability scores are then used to determine the presence of predominant bird species in the audio recording for multi-labelling. The proposed Wav2vec demonstrates remarkable performance, achieving an F1-score of 0.89 using the Xeno-Canto dataset in which outperforming other multi-label classifiers.
科研通智能强力驱动
Strongly Powered by AbleSci AI