(5.15.1)--Chapter5-6Inmemorycomputing-Spar.pdf
《(5.15.1)--Chapter5-6Inmemorycomputing-Spar.pdf》由会员分享,可在线阅读,更多相关《(5.15.1)--Chapter5-6Inmemorycomputing-Spar.pdf(13页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、In memory Computing-Spark2Data Processing System Architecture Computing algorithmComputing ModelData processing systemComputing Platform&EngineComputing Platforms that provide various development kits and operating environmentsData storing systemData application systemComputing Models for different
2、types of data,such as 1.Batch Processing Model for massive data,MapReduce2.Stream Computing model for dynamic data streams,3.Large-scale concurrent processing(MPP)model for structured data4.large-scale physical memory In-memory Computing model;5.Data Flow Graph model;Computing Engine Hadoop,Spark,St
3、orm,etc34L3-SparkSpark was initially started by Matei Zaharia at UC Berkeleys AMP Lab in 2009,and open sourced in 2010.In 2013,donated to the Apache Software Foundation.one of the most active open source big data projects,Top-Level Apache ProjectParallel processing framework based on the memory comp
4、uting model.It can be built on the Hadoop platform and use the HDFS file system to store data,but a Resilient Distributed dataset(RDD)architecture is built on top of the file system for Supports efficient Distributed Memory Computing.5What is Spark6RDD(Resilient Distributed Dataset)78Spark Driver(ru
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 5.15 Chapter5 Inmemorycomputing Spar
限制150内