数据治理
数据质量
可比性
透明度(行为)
1998年数据保护法
计算机科学
数据科学
信息治理
通用数据保护条例
背景(考古学)
文档
数据完整性
标识符
数据验证
数据匿名化
质量(理念)
欧洲联盟
标准化
元数据
数据挖掘
数据整理
信息隐私
数据安全
数据集成
最佳实践
唯一标识符
数据管理
原始数据
标杆管理
公司治理
打开数据
信息存储库
数据收集
医疗保健
数据共享
链接数据
知识管理
数据建模
资料保护方针
数据虚拟化
作者
Puja Myles,Eleanor Axson,Colin Mitchell
摘要
There have been numerous papers discussing data quality and data protection independently, but there has been little discussion on how data quality relates to data protection and other data governance regulatory frameworks. This paper is a step towards addressing that gap and makes the case for why data quality is relevant for data protection and legal compliance professionals. Real-world data in the context of healthcare refers to data that is routinely collected in the course of delivering healthcare. From a data protection regulatory perspective, Article 5 of the General Data Protection Regulation (GDPR) lists data accuracy as one of the principles for data processing. The recently adopted European Union Artificial Intelligence Act (EU AI Act) Article 10 outlines requirements for data and data governance, specifically quality criteria for datasets used to train, test and validate high-risk AI models to address concerns around algorithmic bias due to biases in the training data. The Standards for Data Diversity, Inclusivity and Generalisability (STANDING) Together consensus recommendations for dataset curators on transparency in dataset documentation enable an informed assessment of the suitability of data and examination of biases, for development of AI health technologies. This includes information on data provenance, modifications, sociodemographic composition and bias assessment findings. The Clinical Practice Research Datalink (CPRD) database is used to illustrate how these recommendations can be implemented in a practical way using unique identifiers such as digital object identifiers (DOIs), metadata, published data resource profiles with sociodemographic information and data quality assessments using validation and comparability studies. There is considerable alignment between established scientific standards, medical product regulatory and data governance legal requirements on data quality, as well as emerging international consensus which will reduce the compliance burden on curators and users of real-world data. This article is also included in The Business & Management Collection which can be accessed at https://hstalks.com/business/.
科研通智能强力驱动
Strongly Powered by AbleSci AI