生物多样性
数据质量
数据收集
检查表
计算机科学
全球生物多样性
采样(信号处理)
工作流程
鉴定(生物学)
空间分析
数据挖掘
数据库
地理
生物
生态学
统计
遥感
公制(单位)
古生物学
运营管理
数学
滤波器(信号处理)
经济
计算机视觉
作者
Pablo Hendrigo Alves de Melo,Nadia Bystriakova,Eve Lucas,Alexandre K. Monro
标识
DOI:10.1038/s41598-024-56158-3
摘要
Abstract Biodiversity data aggregators, such as Global Biodiversity Information Facility (GBIF) suffer from inflation of the number of occurrence records when data from different databases are merged but not fully reconciled. The ParseGBIF workflow is designed to parse duplicate GBIF species occurrence records into unique collection events (gatherings) and to optimise the quality of the spatial data associated with them. ParseGBIF provides tools to verify and standardize species scientific names according to the World Checklist of Vascular Plants taxonomic backbone, and to parse duplicate records into unique ‘collection events’, in the process compiling the most informative spatial data, where more than one duplicate is available, and providing crude estimates of taxonomic and spatial data quality. When GBIF occurrence records for a medium-sized vascular plant family, the Myrtaceae, were processed by ParseGBIF, the average number of records useful for spatial analysis increased by 180%. ParseGBIF could therefore be valuable in the evaluation of species’ occurrences at the national scale in support for national biodiversity plans, identification of plant areas important for biodiversity, sample bias estimation to inform future sampling efforts, and to forecast species range shifts in response to global climate change.
科研通智能强力驱动
Strongly Powered by AbleSci AI