地理编码
鉴定(生物学)
垂钓
离群值
算法
计算机科学
环境科学
数据挖掘
遥感
人工智能
地质学
渔业
植物
生物
摘要
In modern fishery management, fishing vessel trajectory data are used to monitor and analyze fishing vessel activities. However, trajectory data are often of low quality, probably due to environmental factors, equipment failures, signal loss and operation errors, leading to numerous outliers in these data. These outliers not only undermine the credibility of the data but also negatively affect the subsequent data mining and decision-making. In this study, a data cleaning method for the identification of outlier points in fishing vessel trajectories based on the Geohash geocoding algorithm is given, which involves several key steps: obtaining and preprocessing the raw trajectory data; generating the corresponding Geohash codes for each ship position based on its latitude and longitude; calculating the reachable distance considering the time interval between the current point and the following points and their speeds; querying the neighborhood of the current point based on the reachable distance; and obtaining all Geohash codes of the reachable areas of the fishing vessels within the time interval as the reachable range grid set of the current position. The reachable range grid set of the current position is compared with the reachable range grid sets of the previous point identified as normal and the next point in the fishing vessel trajectory. If there is no intersection, it is determined that the current fishing vessel position is an outlier, and this point will be excluded. The method proposed in this study is able to effectively identify outliers in trajectory data, achieving efficient and effective trajectory data cleaning and improving the accuracy and reliability of the data.
科研通智能强力驱动
Strongly Powered by AbleSci AI