Robust sound event detection in bioacoustic sensor networks

生物声学 计算机科学 光谱图 过度拟合 背景(考古学) 卷积神经网络 噪音(视频) 语音识别 水准点(测量) 人工智能 模式识别(心理学) 人工神经网络 电信 地图学 生物 古生物学 图像(数学) 地理
作者
Vincent Lostanlen,Justin Salamon,Andrew Farnsworth,Steve Kelling,Juan Pablo Bello
出处
期刊:PLOS ONE [Public Library of Science]
卷期号:14 (10): e0214168-e0214168 被引量:61
标识
DOI:10.1371/journal.pone.0214168
摘要

Bioacoustic sensors, sometimes known as autonomous recording units (ARUs), can record sounds of wildlife over long periods of time in scalable and minimally invasive ways. Deriving per-species abundance estimates from these sensors requires detection, classification, and quantification of animal vocalizations as individual acoustic events. Yet, variability in ambient noise, both over time and across sensors, hinders the reliability of current automated systems for sound event detection (SED), such as convolutional neural networks (CNN) in the time-frequency domain. In this article, we develop, benchmark, and combine several machine listening techniques to improve the generalizability of SED models across heterogeneous acoustic environments. As a case study, we consider the problem of detecting avian flight calls from a ten-hour recording of nocturnal bird migration, recorded by a network of six ARUs in the presence of heterogeneous background noise. Starting from a CNN yielding state-of-the-art accuracy on this task, we introduce two noise adaptation techniques, respectively integrating short-term (60 ms) and long-term (30 min) context. First, we apply per-channel energy normalization (PCEN) in the time-frequency domain, which applies short-term automatic gain control to every subband in the mel-frequency spectrogram. Secondly, we replace the last dense layer in the network by a context-adaptive neural network (CA-NN) layer, i.e. an affine layer whose weights are dynamically adapted at prediction time by an auxiliary network taking long-term summary statistics of spectrotemporal features as input. We show that PCEN reduces temporal overfitting across dawn vs. dusk audio clips whereas context adaptation on PCEN-based summary statistics reduces spatial overfitting across sensor locations. Moreover, combining them yields state-of-the-art results that are unmatched by artificial data augmentation alone. We release a pre-trained version of our best performing system under the name of BirdVoxDetect, a ready-to-use detector of avian flight calls in field recordings.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
liu发布了新的文献求助10
刚刚
1秒前
112233发布了新的文献求助10
1秒前
旁白发布了新的文献求助10
1秒前
1秒前
慕青应助帅气的小鸭子采纳,获得10
1秒前
丘比特应助帅气的小鸭子采纳,获得10
1秒前
科目三应助帅气的小鸭子采纳,获得10
1秒前
Owen应助帅气的小鸭子采纳,获得10
1秒前
蜡笔小心眼子完成签到,获得积分10
2秒前
逆风发布了新的文献求助10
2秒前
2秒前
小鱼发布了新的文献求助10
2秒前
zhengzheng发布了新的文献求助10
2秒前
Xue完成签到,获得积分10
3秒前
晨晨发布了新的文献求助10
3秒前
JJ发布了新的文献求助10
3秒前
3秒前
3秒前
小马甲应助结尾曲采纳,获得30
3秒前
尔蓝红颜完成签到,获得积分10
3秒前
4秒前
读书人给读书人的求助进行了留言
4秒前
做的出来完成签到,获得积分10
4秒前
科研通AI6.4应助lmx采纳,获得10
5秒前
量子星尘发布了新的文献求助10
5秒前
萧离完成签到,获得积分10
6秒前
6秒前
dara997完成签到,获得积分10
6秒前
6秒前
andorado完成签到,获得积分10
6秒前
7秒前
7秒前
7秒前
自由寄柔完成签到,获得积分10
7秒前
8秒前
祎薇应助为你博弈采纳,获得10
8秒前
April完成签到,获得积分10
8秒前
8秒前
Brocade发布了新的文献求助10
8秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kinesiophobia : a new view of chronic pain behavior 2000
Burger's Medicinal Chemistry, Drug Discovery and Development, Volumes 1 - 8, 8 Volume Set, 8th Edition 1800
Cronologia da história de Macau 1600
BRITTLE FRACTURE IN WELDED SHIPS 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
Developmental Peace: Theorizing China’s Approach to International Peacebuilding 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6138833
求助须知:如何正确求助?哪些是违规求助? 7966828
关于积分的说明 16539196
捐赠科研通 5253533
什么是DOI,文献DOI怎么找? 2805233
邀请新用户注册赠送积分活动 1785870
关于科研通互助平台的介绍 1655946