数据科学
杠杆(统计)
计算机科学
科学发现
数据发现
领域(数学分析)
人工智能
万维网
认知科学
元数据
心理学
数学
数学分析
作者
Anuj Karpatne,Gowtham Atluri,James H. Faghmous,Michael Steinbach,Arindam Banerjee,Auroop R. Ganguly,Shashi Shekhar,Nagiza Samatova,Vipin Kumar
标识
DOI:10.1109/tkde.2017.2720168
摘要
Data science models, although successful in a number of commercial domains,\nhave had limited applicability in scientific problems involving complex\nphysical phenomena. Theory-guided data science (TGDS) is an emerging paradigm\nthat aims to leverage the wealth of scientific knowledge for improving the\neffectiveness of data science models in enabling scientific discovery. The\noverarching vision of TGDS is to introduce scientific consistency as an\nessential component for learning generalizable models. Further, by producing\nscientifically interpretable models, TGDS aims to advance our scientific\nunderstanding by discovering novel domain insights. Indeed, the paradigm of\nTGDS has started to gain prominence in a number of scientific disciplines such\nas turbulence modeling, material discovery, quantum chemistry, bio-medical\nscience, bio-marker discovery, climate science, and hydrology. In this paper,\nwe formally conceptualize the paradigm of TGDS and present a taxonomy of\nresearch themes in TGDS. We describe several approaches for integrating domain\nknowledge in different research themes using illustrative examples from\ndifferent disciplines. We also highlight some of the promising avenues of novel\nresearch for realizing the full potential of theory-guided data science.\n
科研通智能强力驱动
Strongly Powered by AbleSci AI