计算机科学
搜索引擎索引
可扩展性
分拆(数论)
空间数据库
数据挖掘
人工神经网络
秩(图论)
集合(抽象数据类型)
空间分析
空间分割
数据点
人工智能
数据库索引
树(集合论)
钥匙(锁)
机器学习
算法
数据库
数学
数学分析
程序设计语言
组合数学
统计
计算机安全
作者
Jianzhong Qi,Guanli Liu,Christian S. Jensen,Lars Kulik
标识
DOI:10.14778/3407790.3407829
摘要
Machine learning, especially deep learning, is used increasingly to enable better solutions for data management tasks previously solved by other means, including database indexing. A recent study shows that a neural network can not only learn to predict the disk address of the data value associated with a one-dimensional search key but also outperform B-tree-based indexing, thus promises to speed up a broad range of database queries that rely on B-trees for efficient data access. We consider the problem of learning an index for two-dimensional spatial data. A direct application of a neural network is unattractive because there is no obvious ordering of spatial point data. Instead, we introduce a rank space based ordering technique to establish an ordering of point data and group the points into blocks for index learning. To enable scalability, we propose a recursive strategy that partitions a large point set and learns indices for each partition. Experiments on real and synthetic data sets with more than 100 million points show that our learned indices are highly effective and efficient. Query processing using our indices is more than an order of magnitude faster than the use of R-trees or a recently proposed learned index.
科研通智能强力驱动
Strongly Powered by AbleSci AI