计算机科学
机器学习
人工智能
数据建模
数据管理
学习管理
基数(数据建模)
管理制度
数据挖掘
软件工程
工程类
多媒体
运营管理
作者
Guoliang Li,Xuanhe Zhou
标识
DOI:10.1109/icde53745.2022.00297
摘要
Machine learning techniques have been proposed to optimize data management in recent years. Compared with traditional empirical data management, learning-based methods extract knowledge from historical tasks, generalize the extracted knowledge to similar new tasks, and can achieve better performance in many scenarios (e.g., knob tuning, cardinality estimation). However, data management systems require to handle various and dynamic workloads in different scenarios, and there are some challenges in applying machine learning techniques for data management systems. First, with various workloads and hundreds of system metrics, how to select and characterize effective features for data management problems? Second, with diversified machine learning models, how to design the proper models? Third, with various data management requirements, how to validate whether the machine learning models can meet the requirements? In this tutorial, we discuss existing learning-based data management studies and how they solve the above challenges, and provide some future research directions.
科研通智能强力驱动
Strongly Powered by AbleSci AI