基于OpenStack Docker打造Spark大数据服务.pdf

上传人:李主任 文档编号:3332887 上传时间:2019-08-13 格式:PDF 页数:16 大小:2.81MB
返回 下载 相关 举报
基于OpenStack Docker打造Spark大数据服务.pdf_第1页
第1页 / 共16页
基于OpenStack Docker打造Spark大数据服务.pdf_第2页
第2页 / 共16页
基于OpenStack Docker打造Spark大数据服务.pdf_第3页
第3页 / 共16页
基于OpenStack Docker打造Spark大数据服务.pdf_第4页
第4页 / 共16页
基于OpenStack Docker打造Spark大数据服务.pdf_第5页
第5页 / 共16页
点击查看更多>>
资源描述

《基于OpenStack Docker打造Spark大数据服务.pdf》由会员分享,可在线阅读,更多相关《基于OpenStack Docker打造Spark大数据服务.pdf(16页珍藏版)》请在三一文库上搜索。

1、SuperVessel: Enabling Spark as a Service with OpenStack and Docker Guan Cheng (G.C.) Chen IBM Research - China SuperVessel Cloud 4/18/15 IBM Research - China 2 Public cloud built on the POWER7/POWER8 servers with OpenStack It provides free access for students, researchers, developers across the wor

2、ld, and helps grow OpenPOWER ecosystem (used in 30+ universiNes now) It provides advanced technology services such as Spark as a Service, Docker Services, CogniNve CompuNng Service, IoT Service, Accelerator as a Service (FPGA and GPU) Spark as a Service 4/18/15 IBM Research - China 3 Step 1: Login S

3、tep 2: Create Step 3: Ready! 3 steps to launch a Spark cluster, easy! Try it at: Why OpenStack? 4/18/15 IBM Research - China 4 Most popular IaaS soXware Supports Docker* Heat can orchestra docker containers easily good for provision a Spark cluster Picture source: h-p:/ Why Docker? Less resource co

4、nsumpNon than KVM We can provision more Spark clusters! Boot faster than KVM Users like fast provision! Incrementally build, revert and reuse your container We love Git and AUFS! However, Docker is not offi cially supported in OpenStack yet Nova Docker is an external component of OpenStack Port Dock

5、er to POWER Architecture (ppc64 and ppc64le) Ubuntu 15.04 includes Docker for POWER8 ppc64le 4/18/15 IBM Research - China 5 Picture source: h-p:/ darn-popular/ Why Spark? Fast Unifi ed Ecosystem PorNng to POWER Bugfi x submieed to the community Spark 1.3 works smoothly! J 4/18/15 IBM Research - Chin

6、a 6 Why not Sahara? Sahara is a component for Hadoop/Spark as a Service in OpenStack We started from OpenStack Icehouse DockerizaNon Beeer service deployment and isolaNon for Big Data dashboard server CustomizaNon WaiNng for Saharas improvements is somehow *SLOW* Docker, user analyNcs, Spark 1.4, Sp

7、ark IDE, scheduling, data visualizaNon etc. 4/18/15 IBM Research - China 7 4/18/15 IBM Research - China 8 Architecture Design Big Data Big Data Dashboard Dashboard KeystoneKeystone GlanceGlance NeutronNeutron HeatHeat NovaNova Nova Nova DockerDocker Cinder/Cinder/ ManilaManila Spark Cluster Docker D

8、ocker ImageImage Spark Master Spark Worker Spark Worker Container 1 Container 2 Container 3 NameNode Spark Driver DataNode DataNode Billing&AuthBilling&Auth 1 2 23 Dockerize everything! We use containers to run applicaons run other containers run OpenStack python daemons run OpenStack services insid

9、e OpenStack (with a special trunk link in neutron/OVS) run OpenStack inside OpenStack Good for mulN-site expansion 4/18/15 IBM Research - China 9 Heat template design Heat is a component for orchestration in OpenStack Parameters Cinder/Malina/Neutron uuid Size of Cinder/Malina resources Resources Ma

10、ster/slave node Neutron Cinder/Manila Need to modify nova-docker to mount the cinder/ manila resources when booNng the docker container 4/18/15 IBM Research - China 10 Spark Docker Image Built from Ubuntu 14.04.1 All Spark nodes use the same image, with diff erent iniNalizaNon scripts by using cloud

11、init IniNalizaNon scripts will Sync /etc/hosts across all nodes Set HDFS and Spark confi guraNons accordingly Format HDFS and launch HDFS Launch Spark 4/18/15 IBM Research - China 11 Big Data Dashboard Development 2 Developers (frond end + backend) Online in 2 months Separates Heat related stuff and

12、 Dashboard Reskul API (for billing and authenNcaNon etc) Dockerize the big data dashboard server Separates development and producNon environment 4/18/15 IBM Research - China 12 Where should I put the data? Shared File System for Cloud and Spark as a Service 4/18/15 IBM Research - China 13 Docker (Sy

13、mphon y) Horizon OpenStack controller HEAT Neutron Glance Manila Nova Cloud Infrastructure Cloud Infrastructure ServiceService Big Data ServiceBig Data Service Select Big data compuNng framework (Mapreduce, SPARK Select cluster size Select data folder size HEAT template for big data cluster Docker (

14、Symphon y) Docker (Symphon y) Docker (Symphon y) Docker (Symphon y) Docker (SPARK) POWER7/POWER8 KVM/ Docker (Web app) Folder A User B User A Folder B User A HEAT will orchestrate docker instances, subnet and data folder based on users request Manila provides the NFS service using GPFS as backend, a

15、nd the folder will be mounted via nova-docker (with -v support) Folder created by Manila could be accessed by the KVM/docker instances created for big data and other purpose GPFS FPO POWER7/POWER8 Servers GPFS FPO Servers GPFS FPO KeyStone Cinder SuperVessel Services Roadmap 4/18/15 IBM Research - C

16、hina 14 SuperVessel Cloud Infrastructure SuperVessel Cloud Service SuperVessel Big Data and HPC Service Super Class Service OpenPOWER Enablement Service Super Project Team Service Super Marketplace 1. VM and container service 2. Storage service 3. Network service 4. Accelerator as service 5. Image s

17、ervice 1. Big Data: MapReduce (Symphony), SPARK 2. Performance tuning service 1. X-to-P migraNon: AutoPort tool 2. OpenPOWER new system test service 1. On-line video courses 2. Teacher course management 3. User contribuNon management 1. Project management service 2. DevOps automaNon Storage IBM POWE

18、R servers OpenPOWER server FPGA/GPU Docker (Online) (Online) (Preparing) (Online) Summary Spark + OpenStack + Docker works very well on OpenPOWER servers Dockerized services made DevOps easier Docker issues Zombie process Cant dynamically aeach volume Commit, aeach operaNons is not supported on Nova-docker Monitoring everything (API, resources etc) key for operaNng a cloud TODO: Spark 1.4, Spark IDE, SWIFT, Data VisualizaNon 4/18/15 IBM Research - China 15 4/18/15 IBM Research - China 16 Join Us! QQ group: SuperVessel SuperVessel WeChat group 冠诚

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 建筑/环境 > 装饰装潢


经营许可证编号:宁ICP备18001539号-1