云计算技术及应用ppt课件.ppt

上传人:本田雅阁 文档编号:3248158 上传时间:2019-08-06 格式:PPT 页数:39 大小:1.67MB
返回 下载 相关 举报
云计算技术及应用ppt课件.ppt_第1页
第1页 / 共39页
云计算技术及应用ppt课件.ppt_第2页
第2页 / 共39页
云计算技术及应用ppt课件.ppt_第3页
第3页 / 共39页
云计算技术及应用ppt课件.ppt_第4页
第4页 / 共39页
云计算技术及应用ppt课件.ppt_第5页
第5页 / 共39页
点击查看更多>>
资源描述

《云计算技术及应用ppt课件.ppt》由会员分享,可在线阅读,更多相关《云计算技术及应用ppt课件.ppt(39页珍藏版)》请在三一文库上搜索。

1、云计算技术及应用,大连理工大学计算机科学与技术学院 2010年春季,基本情况,申彦明 B810 助教:齐恒 B812 Office hour: Fri 3:30-4:30 PM Course website: http:/ 教材内容 Project 论文,教材内容,分布式系统的概况 分布式与集群基本概念 分布式数据库 分布式文件系统 GFS 分布式编程 MapReduce算法介绍 搜索引擎与PageRank 其它相关技术 Data Center BigTable AppEngine,Grading,HW:40% Final Project: 60% Final project propos

2、al Project reports 12 teams, 4-5 students,Syllabus (Subject to change),Week 2 Mar 8: Lecture 1: Introduction Mar 10: Lecture 2: Map/Reduce Theory and Implementation, Hadoop Week 3 Mar 15: Lecture 3 & 4: Guest Speaker (8:00 AM-11:35AM研教楼102) Mar 17: Lecture 5: Distributed File System and the Google F

3、ile System Week 4 Mar 22: Lecture 6 & 7: Guest Speaker(8:00 AM-11:35AM研教楼102) Mar 24: Lecture 8: Distributed Graph Algorithms and PageRank Week 5 Mar 29: Lecture 9: Introduction to Some Projects Mar 31: Lecture 10: Data Centers,Syllabus (Subject to change),Week 6 Apr 5: Lecture 11: Some Google Techn

4、ologies Apr 7: Lecture 12: Virtualization Week 7 Lecture 13 & 14: Project Presentation Week 8: No class Week 9: Lecture 15 &16: Project Presentation,Gartner Report,Top 10 Strategic Technology Areas for 2009 Virtualization Cloud Computing Servers: Beyond Blades Web-Oriented Architectures Enterprise M

5、ashups Specialized Systems Social Software and Social Networking Unified Communications Business Intelligence Green Information Technology,Top 10 Strategic Technology Areas for 2010 Cloud Computing Advanced Analytics Client Computing IT for Green Reshaping the Data Center Social Computing Security A

6、ctivity Monitoring Flash Memory Virtualization for Availability Mobile Applications,From Desktop/HPC/Grids to Internet Clouds in 30 Years,HPC moving from centralized supercomputers to geographically distributed desktops, clusters, and grids to clouds over last 30 years R/D efforts on HPC, clusters,

7、Grids, P2P, and virtual machines has laid the foundation of cloud computing that has been greatly advocated since 2007 Location of computing infrastructure in areas with lower costs in hardware, software, datasets, space, and power requirements moving from desktop computing to datacenter-based cloud

8、s,What is Cloud Computing?,1. Web-scale problems 2. Large data centers 3. Different models of computing 4. Highly-interactive Web applications,1. “Web-Scale” Problems,Characteristics: Definitely data-intensive May also be processing intensive Examples: Crawling, indexing, searching, mining the Web D

9、ata warehouses Sensor networks “Post-genomics” life sciences research Other scientific data (physics, astronomy, etc.) Web 2.0 applications ,How much data?,Google processes 20 PB a day (2008) “all words ever spoken by human beings” 5 EB CERNs LHC will generate 10-15 PB a year,640K ought to be enough

10、 for anybody.,What to do with more data?,Answering factoid questions Pattern matching on the Web Works amazingly well Learning relations Start with seed instances Search for patterns on the Web Using patterns to find more instances,How do I make money?,Petabytes of valuable customer data Sitting idl

11、e in existing data warehouses Overflowing out of existing data warehouses Simply being thrown away Source of data: OLTP User behavior logs Call-center logs Web crawls, public datasets Structured data (today) vs. unstructured data (tomorrow) How can an organization derive value from all this data?,2.

12、 Large Data Centers,Web-scale problems? Throw more machines at it! Centralization of resources in large data centers Necessary ingredients: fiber, juice, and land What do Oregon, Iceland, and abandoned mines have in common? Important Issues: Efficiency Redundancy Utilization Security Management over

13、head,3. Different Computing Models,Utility computing Why buy machines when you can rent cycles? Examples: Amazons EC2 Platform as a Service (PaaS) Give me nice API and take care of the implementation Example: Google App Engine Software as a Service (SaaS) Just run it for me! Example: Gmail,“Why do i

14、t yourself if you can pay someone to do it for you?”,4. Web Applications,What is the nature of future software applications? From the desktop to the browser SaaS = Web-based applications Examples: Google Maps, Facebook How do we deliver highly-interactive Web-based applications? AJAX (asynchronous J

15、avaScript and XML) A hack on top of a mistake built on sand, all held together by duct tape and chewing gum?,Some Cloud Definitions,Ian Foster et al defined cloud computing as a large-scale distributed computing paradigm, that is driven by economics of scale, in which a pool of abstracted virtualize

16、d, dynamically-scalable, managed computing power, storage, platforms, and services are delivered on demand to external customers over the internet(云计算是一种商业计算模型。它将计算任务分布在大量计算机构成的资源池上,使各种应用系统能够根据需要获取计算力、存储空间和各种软件服务。) IBM experts consider clouds that can: Host a variety of different workloads, includin

17、g batch-style backend interactive, user-facing applications Allow workloads to be deployed and scaled-out quickly through the rapid provisioning of virtual machines or physical machines Support redundant, self-recovering, highly scalable programming models that allow workloads to recover from HW/SW

18、failures Monitor resource use in real time to rebalance allocations on demand,Internet Cloud Goals,Sharing of peak-load capacity among a large pool of users, improving overall resource utilization Separation of infrastructure maintenance duties from domain-specific application development Major clou

19、d applications include upgraded web services, distributed data storage, raw supercomputing, and access to specialized Grid, P2P, data-mining, and content networking services,Three Aspects in Hardware that are New in Cloud Computing,The illusion of infinite computing resources available on demand, th

20、ereby eliminating the need for cloud users to plan far ahead for provisioning The elimination of an up-front commitment by cloud users, thereby allowing companies to start small and increase hardware resources when needed The ability to pay computing resources on a short-term basis as needed (e.g.,

21、processors by the hour and storage by the day) and release them after done and thereby rewarding resource conservation,Some Innovative Cloud Services and Application Opportunities,Smart and pervasive cloud applications for individuals, homes, communities, companies, and governments, etc. Coordinated

22、 Calendar, Itinerary, job management, events, and consumer record management (CRM) services Coordinated word processing, on-line presentations, web-based desktops, sharing on-line documents, datasets, photos, video, and databases, etc Deploy conventional cluster, grid, P2P, social networking applica

23、tions in cloud environments, more cost-effectively Earthbound Applications that Demand Elasticity and Parallelism rather data movement Costs,Operations in Cloud Computing,Users interact with the cloud to request service Provisioning tool carves out the systems from the cloud configuration or reconfi

24、guration, or deprovision The servers can be either real or virtual machines Supporting resources include distributed storage system, datacenters, security devices, etc.,Cloud Computing Instances,Google Amazon Microsoft Azure IBM Blue Cloud,Google Cloud Infrastructure,Scheduler,Chubby,GFS master,Node

25、,Node,Node,User,Application,Scheduler slave,GFS chunkserver,Linux,Node,MapReduce Job,BigTable Server,Google Cloud Infrastructure,Amazon Elastic Computing Cloud,SQS: Simple Queue Service EC2: Running Instance of Virtual Machines EBS: Elastic Block Service, Providing the Block Interface, Storing Virtu

26、al Machine Images S3: Simple Storage Service, SOAP, Object Interface SimpleDB: Simplified Database,Microsoft Azure Platform,Developer,Monitoring,Application Server,Provisioning Manager,User,Open Source Linux with Xen,Tivoli Monitoring Agent,IBM Blue Cloud,Cost Considerations : Power, Cooling, Physic

27、al Plant, and Operational Costs,Cost technology costs cost of security etc.,Benefits availability opportunity consolidation etc.,Cost Breakdown,+ Storage ($/MByte/year) + Computing ($/CPU Cycles) + Networking ($/bit),Research Challenges,Service availability S3 outage: authentication service overload

28、 leading to unavailability AppEngine partial outage programming error Gmail: site unavailable Solutions: The management of a Cloud Computing service by a single company results in a single point of failure (SPF). In the Internet, a large ISP uses multiple network providers so that failure by a singl

29、e company will not take them off the air. Similarly, we need multiple Cloud Computing providers to support each other to eliminate SPF.,Research Challenges,Data Security Current cloud offerings are essentially public rather than private networks, exposing the system to more attacks such as DDoS atta

30、cks. Solutions: There are many well understood technologies such as encrypted storage, virtual local area networks, and network middle boxes.,Research Challenges,Data Transfer Bottlenecks Applications continue to become more data-intensive. If we assume applications may be “pulled apart” across the

31、boundaries of clouds, this may complicate data placement and transport. Both WAN bandwidth and intra-cloud networking technology are performance bottleneck. Industrial solutions: It is estimated that 2/3 of the cost of WAN bandwidth is consumed by high-end routers, whereas only 1/3 charged by fiber

32、industry. We can lower the cost by using simpler routers built from commodity components with centralized control, but research is heading towards using high-end distributed routers .,Research Challenges,Software Licensing Current software licenses commonly restrict the computers on which the softwa

33、re can run. Users pay for the software and then pay an annual maintenance fee. Many cloud computing providers originally relied on open source software in part because the licensing model for commercial software is not a good match to Utility Computing. Some ideas: We can encourage sales forces of s

34、oftware companies to sell products into Cloud Computing. Or they can implement pay-per-use model to the software to adapt to a cloud environment.,Research Challenges,Scalable storage Differences between common storage and cloud storage The system is built from many inexpensive commodity components t

35、hat often fail The system stores a modest number of large files The workloads primarily consist both large streaming reads and small random reads. The workloads many large, sequential writes that append data to files and once written, files are seldom modified again. The cloud storage (file) system

36、needs to share many of the same goals as previous distributed file systems such as performance, scalability, reliability, and availability. In addition, its design needs to be driven by key observations of the specific workloads and technological environment, both current and anticipated, that refle

37、ct a marked departure from some earlier file system design assumptions. GFS Files are divided into fixed-size chunks, Chunk size is one of the key design parameters. GFS chooses 64 MB, which is much larger than typical file system block sizes. The master stores three major types of metadata: the fil

38、e and chunk namespaces, the mapping from files to chunks, and the locations of each chunks replicas. GFS supports the usual operations to create, delete, open, close, read, and write files.,Research Challenges,Transparent Programming Model Programs written for cloud implementation need to be automat

39、ically parallelized and executed on a large cluster of commodity machines. The run-time system should take care of the details of partitioning the input data, scheduling the programs execution across a set of machines, handling machine failures, and managing the required inter-machine communication.

40、 The programming model should allow programmers without many experiences with parallel and distributed systems to easily utilize the resources of a large distributed system. MapReduce Scalable Data Processing on Large Clusters A web programming model implemented for fast processing and generating la

41、rge datasets Applied mainly in web-scale search and cloud computing applications Users specify a map function to generate a set of intermediate key/value pairs Users use a reduce function to merge all intermediate values with the same intermediate key.,Research Challenges,Steve Ballmers View on the

42、Future of Cloud,Cloud creates opportunities and responsibilities Cloud learns and helps you learn, decide and take action Cloud enhances social and professional interactions The cloud wants smarter devices Cloud drives server advances that, in turn, drive the cloud,Cloud Computing Skepticism,CLOUD C

43、OMPUTING,Cloud computing is simply a buzzword used to repackage grid computing and utility computing, both of which have existed for decades .“Cloud computing is simply a buzzword used to repackage grid computing and utility computing, both of which have existed for decades.”, Definition of Cloud Co

44、mputing,Larry Ellison,“The interesting thing about cloud computing is that weve redefined cloud computing to include everything that we already do. The computer industry is the only industry that is more fashion-driven than womens fashion. Maybe Im an idiot, but I have no idea what anyone is talking about. What is it? Its complete gibberish. Its insane. When is this idiocy going to stop?”,

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 其他


经营许可证编号:宁ICP备18001539号-1