大数据分析全英文 (16).pdf
《大数据分析全英文 (16).pdf》由会员分享,可在线阅读,更多相关《大数据分析全英文 (16).pdf(13页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、2What is Data reduction?Data reduction(subtraction)technology is used to help obtain a condensed data setfrom the original huge data set,and make this condensed data set maintain the integrity of the original data set,so that data analysis on the condensed data set is obviously efficient higher,and
2、the results of analysis are basically the same as those obtained by using the original data set.3Data reduction standardThe time spent on data reduction should not exceed or offset the time saved by analysis on the reduced dataThe data obtained by the reduction is much smaller than the original data
3、,but can produce the same or almost the same analysis results4Data reduction technology7Attributes subset selection8Attributes subset selection9Attributes subset selectionDecision tree(decision tree)inductionUse the decision tree induction method to classify and induct the initial data to obtain an
4、initial decision tree.All attributes that do not appear on the decision tree are considered irrelevant attributes.Therefore,delete these attributes from the initial attribute set to obtain an initial decision tree.A better subset of attributes.Reduction based on statistical analysisData reduction-Da
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 大数据分析全英文 16 数据 分析 英文 16
限制150内