A proposed model for data warehouse ETL processes

数据仓库 计算机科学 量纲建模 数据库 在线分析处理 星型模式 概念模型 概念图式 统一建模语言 个性化 过程(计算) 数据挖掘 软件 数据库设计 数据库架构 万维网 心理学 发展心理学 性别图式理论 程序设计语言 操作系统
作者
Shaker El–Sappagh,Abdeltawab Hendawi,Ali Hamed El Bastawissy
出处
期刊:Journal of King Saud University - Computer and Information Sciences [Elsevier BV]
卷期号:23 (2): 91-104 被引量:147
标识
DOI:10.1016/j.jksuci.2011.05.005
摘要

Extraction–transformation–loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, its cleansing, customization, reformatting, integration, and insertion into a data warehouse. Building the ETL process is potentially one of the biggest tasks of building a warehouse; it is complex, time consuming, and consumes most of data warehouse project's implementation efforts, costs, and resources. Building a data warehouse requires focusing closely on understanding three main areas: the source area, the destination area, and the mapping area (ETL processes). The source area has standard models such as entity relationship diagram, and the destination area has standard models such as star schema, but the mapping area has not a standard model till now. In spite of the importance of ETL processes, little research has been done in this area due to its complexity. There is a clear lack of a standard model that can be used to represent the ETL scenarios. In this paper we will try to navigate through the efforts done to conceptualize the ETL processes. Research in the field of modeling ETL processes can be categorized into three main approaches: Modeling based on mapping expressions and guidelines, modeling based on conceptual constructs, and modeling based on UML environment. These projects try to represent the main mapping activities at the conceptual level. Due to the variation and differences between the proposed solutions for the conceptual design of ETL processes and due to their limitations, this paper also will propose a model for conceptual design of ETL processes. The proposed model is built upon the enhancement of the models in the previous models to support some missing mapping features.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
yi完成签到 ,获得积分10
2秒前
3333发布了新的文献求助10
2秒前
烟花应助踏雪去哪儿了采纳,获得10
2秒前
3秒前
dracovu发布了新的文献求助10
3秒前
tierra发布了新的文献求助10
3秒前
文艺点点完成签到,获得积分10
3秒前
科研F5完成签到,获得积分10
3秒前
窦慕卉完成签到,获得积分10
4秒前
领导范儿应助公卫小白采纳,获得10
8秒前
tierra完成签到,获得积分10
11秒前
3333完成签到,获得积分10
12秒前
孤独的涔完成签到,获得积分10
15秒前
15秒前
15秒前
火龙果发布了新的文献求助20
15秒前
17秒前
诚心的小熊猫完成签到,获得积分10
17秒前
111完成签到 ,获得积分10
19秒前
zhinian完成签到 ,获得积分10
20秒前
量子星尘发布了新的文献求助10
20秒前
21秒前
飘逸成威发布了新的文献求助10
22秒前
Zhuzhu发布了新的文献求助10
23秒前
23秒前
研友_VZG7GZ应助小铭采纳,获得10
24秒前
25秒前
26秒前
火龙果发布了新的文献求助10
26秒前
29秒前
mhcsci发布了新的文献求助10
30秒前
jf发布了新的文献求助10
33秒前
34秒前
37秒前
Ava应助jianghs采纳,获得30
37秒前
异、空完成签到,获得积分10
40秒前
mhcsci完成签到,获得积分10
41秒前
量子星尘发布了新的文献求助10
41秒前
天天快乐应助jf采纳,获得10
43秒前
高分求助中
(应助此贴封号)【重要!!请各位详细阅读】【科研通的精品贴汇总】 10000
Voyage au bout de la révolution: de Pékin à Sochaux 700
血液中补体及巨噬细胞对大肠杆菌噬菌体PNJ1809-09活性的影响 500
Methodology for the Human Sciences 500
First Farmers: The Origins of Agricultural Societies, 2nd Edition 500
Simulation of High-NA EUV Lithography 400
Metals, Minerals, and Society 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 4314123
求助须知:如何正确求助?哪些是违规求助? 3833469
关于积分的说明 11993042
捐赠科研通 3473737
什么是DOI,文献DOI怎么找? 1904893
邀请新用户注册赠送积分活动 951670
科研通“疑难数据库(出版商)”最低求助积分说明 853181