计算机科学
桥接(联网)
多元统计
数据科学
维数(图论)
数据挖掘
数据集
集合(抽象数据类型)
数据分析
样品(材料)
生物学数据
高维数据聚类
多元分析
理论计算机科学
机器学习
人工智能
聚类分析
数学
程序设计语言
生物信息学
色谱法
化学
生物
纯数学
计算机网络
标识
DOI:10.1017/cbo9781139025805
摘要
'Big data' poses challenges that require both classical multivariate methods and contemporary techniques from machine learning and engineering. This modern text equips you for the new world - integrating the old and the new, fusing theory and practice and bridging the gap to statistical learning. The theoretical framework includes formal statements that set out clearly the guaranteed 'safe operating zone' for the methods and allow you to assess whether data is in the zone, or near enough. Extensive examples showcase the strengths and limitations of different methods with small classical data, data from medicine, biology, marketing and finance, high-dimensional data from bioinformatics, functional data from proteomics, and simulated data. High-dimension low-sample-size data gets special attention. Several data sets are revisited repeatedly to allow comparison of methods. Generous use of colour, algorithms, Matlab code, and problem sets complete the package. Suitable for master's/graduate students in statistics and researchers in data-rich disciplines.
科研通智能强力驱动
Strongly Powered by AbleSci AI