Home Database Mysql Tutorial hadoop资源调度器

hadoop资源调度器

Jun 07, 2016 pm 04:30 PM
hadoop Multi-user support environment Scheduling resource

? ? ? ? ?hadoop支持多用户环境,在生产环境中,往往一个大的hadoop集群,供多个应用如Hive,Mahout等之类的使用。在多用户环境下,有的用户提交的工作量很大、很频繁,而有的很少,还有的优先级很高,那么如何保证“按需”来为各个用户分配资源(内存、CPU

? ? ? ? ?hadoop支持多用户环境,在生产环境中,往往一个大的hadoop集群,供多个应用如Hive,Mahout等之类的使用。在多用户环境下,有的用户提交的工作量很大、很频繁,而有的很少,还有的优先级很高,那么如何保证“按需”来为各个用户分配资源(内存、CPU、带宽、IO、磁盘)呢?这就需要用到资源调度器。

? ?? ? ?这里,我对hadoop的资源调度做个简单的归纳总结:

? ? ? ?? 一、基本术语

? ? ? ? 用户分组管理

? ? ? ? 用于按组为单位组织管理,某个用户只能向固定分组中提交作业,只能使用固定分组中配置的资源;同时可以限制每个用户提交的作业数,使用的资源量等

? ? ? ? 作业管理

? ? ? ? 包括作业提交权限控制,作业运行状态查看权限控制等。如:可限定可提交作业的用户;可限定可查看作业运行状态的用户;可限定普通用户只能修改自己作业的优先级,kill自己的作业;高级用户可以控制所有作业等。

? ? ? ? 用户(User)

? ? ? ? Hadoop使用Linux用户管理,Hadoop中的用户就是Linux中的用户

? ? ? ? 分组(group)

? ? ? ? Hadoop使用Linux分组管理,Hadoop中的分组就是Linux中的分组

? ? ? ? 池(pool)

? ? ? ? Hadoop Fair Scheduler中的概念,一个pool可以是一个user,一个group,或者一个queue。

? ? ? ? 队列(Queue)

? ? ? ? 队列是Hadoop提出的概念,一个Queue可以由任意几个Group和任意几个User组成。

?

?

? ? ? ? 二、资源调度管理简介

? ? ? ? Hadoop的资源管理核心

? ? ? ? ? ? ? ?存储容量管理

? ? ? ? ? ? ? ? ? ? ?HDFS存储管理

? ? ? ? ? ? ? ?计算容量管理

? ? ? ? ? ? ? ? ? ? ?MR任务槽管理

? ? ? ? ? ? ? ? ? ? ?任务调度器

?

?

? ? ? ? ? 三、存储容量管理

?

? ? ? ? ? ? ? ?主要是对HDFS的管理:

? ? ? ? ? ? ? ? ? ? HDFS本身也是一种文件系统,与Linux文件系统有类似之处

? ? ? ? ? ? ? ? ? ? 对用户组、用户、路径设置Quotas(包括空间大小和文件数量)

? ? ? ? ? ? ? ? ? ? Hadoop dfsadmin -setSpaceQuota size path

? ? ? ? ? ? ? ? ? ? hadoop dfsadmin -setQuota number path

? ? ? ? ? ? ? ? ? ? Hadoop共享Linux的用户组和用户

? ? ? ? ? ? ? ? ? ? HDFS的权限控制:用户和用户组

?

?

?

? ? ? ? ? 四、计算容量管理

? ? ? ? ? ? ? ? ? 主要是对MapReduce任务槽的管理,任务槽也可以理解为服务器的资源综合统称:

? ? ? ? ? ? ? ? ? ? ? ? ?CPU(如何设置MR,见下面备注)

? ? ? ? ? ? ? ? ? ? ? ? ?内存(设置子任务的jvm参数和GC参数。在1 . 0以后的版本中,可以全局M/R内存和Job ? ? ? ? ? ? ? M/R内存)

? ? ? ? ? ? ? ? ? ? ? ? ?网络带宽

? ? ? ? ? ? ? ? ? ? ? ? ?磁盘IO

? ? ? ? ? ? ? ? ? ? ? ? ?单台主机或整个集群资源是有限的。在多任务、多用户环境下,大量并发提交和执行任务是非常常见的,这就导致资源不足而竞争,如何保证在有限资源或业务约束下有序调度和执行任务,需要有一个协调者,这就是Hadoop的资源调度器。

?

?

? ? ? ? ? ? ? ? ? Hadoop的调度器

? ? ? ? ? ? ? ? ? ? ? ? FIFO(default)。先进先出调度器?

? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? Hadoop默认提供。

? ? ? ? ? ? ? ? ? ? ? ? Fair Scheduler(plugin)。公平调度器

? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? Facebook提供。

? ? ? ? ? ? ? ? ? ? ? ? Capacity Scheduler (plugin)。容量调度器

? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? Yahoo提供。

?

?

?

? ? ? ? ? ? ? ? 1、 FIFO Scheduler

? ? ? ? ? ? ? ? ? ?先进先出调度器

? ? ? ? ? ? ? ? ? ?优点:

? ? ? ? ? ? ? ? ? ? ? ? ? ?Hadoop默认提供。非常简单、JobTracker无负担

? ? ? ? ? ? ? ? ? ? ? ? ? ?支持五级优先级,每级一个队列,从队列优先级高低提取任务执行。五级分别是:Very Low 、Low 、Normal 、High 、Very Hign;

? ? ? ? ? ? ? ? ? ? ? ? ? ?在同级队列中,按提交Job的时间先后顺序执行

? ? ? ? ? ? ? ? ? ? ? ? ? ?无需额外配置

? ? ? ? ? ? ? ? ? ? 缺点:

? ? ? ? ? ? ? ? ? ? ? ? ? ?完全忽略不同作业的需求差异

? ? ? ? ? ? ? ? ? ??

?

?

? ? ? ? ? ? ? ? ? 2、Fair Scheduler?

? ? ? ? ? ? ? ? ? 公平分享调度器:在多个用户之间共享集群资源。它按资源池来组织作业,并把资源按时间公平地分到这些资源池里。在每一个资源池内,会使用公平共享地方法在运行作业之间共享容量。

?

? ? ? ? ? ? ? ? ? 特征:

? ? ? ? ? ? ? ? ? ? ? ? ? 支持多用户(默认一个用户对一个池,依赖于Linux多用户环境)

? ? ? ? ? ? ? ? ? ? ? ? ? 资源公平共享(公平共享量由优先级决定,同优先级按FIFO调度)

? ? ? ? ? ? ? ? ? ? ? ? ? 保证最小共享量

? ? ? ? ? ? ? ? ? ? ? ? ? 支持时间片抢占

? ? ? ? ? ? ? ? ? ? ? ? ? 限制作业并发量,以防止中间数据塞满磁盘

? ? ? ? ? ? ? ? ? ? ? ? ??



?

?

? ? ? ? ? ? ? ? ? ?3、Capacity Scheduler?

? ? ? ? ? ? ? ? ? ? ?容量调度器:使多个用户可以安全地共享一个大的集群,应用程序可以在容量限度下获取到需要的资源,最大化集群的吞吐率和利用率。它按队列来组织作业,每个队列会配置一定比例的计算资源,且所有提交到队列中的作业共享该队列中的资源。

? ? ? ? ? ? ? ? ? ??

? ? ? ? ? ? ? ? ? ? ?特征:

? ? ? ? ? ? ? ? ? ? ? ? ? ? 层次化的队列

? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? 层次化的队列支持在一个组织内子队列的优先共享资源,从而提供了更多的控制和预

测的能力,不支持抢占

? ? ? ? ? ? ? ? ? ? ? ? ? ? 资源容量保证

? ? ? ? ? ? ? ? ? ? ? ? ? ?队列实现了一种资源的划分,所有的应用都会被指定到特定的队列,这些应用所能使用到的资源受到队列所拥有资源的限制

? ? ? ? ? ? ? ? ? ? ? ? ? ? 安全性?

? ? ? ? ? ? ? ? ? ? ? ? ? ?每一个队列都有一个严格的ACL来控制那些用户可以访问队列,并且有一个safe-guard来保证用户不能够看或者修改其他用户的应用,而且每个队列或系统都可以设置管理员角色。?

? ? ? ? ? ? ? ? ? ? ? ? ? ? 弹性

? ? ? ? ? ? ? ? ? ? ? ? ??空闲的资源可以分配给任何队列,这样可能超出队列的资源限制。也就是说,如果集群有空闲的资源,而有些队列需要的资源超出了分给他的限制,这些空闲的资源将被分配给这些队列,这样就保证了资源的可预测性和弹性,从而防止了人工孤岛,帮助实现资源的优化利用。

? ? ? ? ? ? ? ? ? ? ? ? ? ? 多用户

? ? ? ? ? ? ? ? ? ? ? ? ? ?系列的综合设置可以防止单一的应用或用户占用队列或集群的全部资源,防止集群被单用户过度使用,从而保证了多用户可以共同使用集群

? ? ? ? ? ? ? ? ? ? ? ? ? ? 可操作性

? ? ? ? ? ? ? ? ? ? ? ? ? ?RuntimeConfiguration:一些设置可以在运行时进行配置,例如资源分配的容量, ? ? ? ? ? ? ? ? ? ? ACL

? ? ? ? ? ? ? ? ? ? ? ? ? ? 基于资源的调度?

? ? ? ? ? ? ? ? ? ? ? ? ? ? 支持资源密集型的应用,应用可以被指定分配超出缺省设置的更多的资源

?



?

? ? ? ? ? ? ?Fair VS Capacity

?

? ? ? ? ? ? ? 相同点

? ? ? ? ? ? ? ? ? ? 均支持多用户多队列,即:适用于多用户共享集群的应用环境

? ? ? ? ? ? ? ? ? ? 单个队列均支持优先级和FIFO调度方式

? ? ? ? ? ? ? ? ? ? 均支持资源共享,即某个queue中的资源有剩余时,可共享给其他缺资源的queue

? ? ? ? ? ? ? 异同点

? ? ? ? ? ? ? ? ? ? ?核心调度策略不同。 计算能力调度器的调度策略是,先选择资源利用率低的queue,然后在queue中同时考虑FIFO和memory constraint因素;而公平调度器仅考虑公平,而公平是通过作业缺额体现的,调度器每次选择缺额最大的job(queue的资源量,job优先级等仅用于计算作业缺额)。

? ? ? ? ? ? ? ? ? ? ?内存约束。计算能力调度器调度job时会考虑作业的内存限制,为了满足某些特殊job的特殊内存需求,可能会为该job分配多个slot;而公平调度器对这种特殊的job无能为力,只能杀掉这种task。

单一用户环境下,Fair只能共享一个池,Capacity可以多队列

?



已有 0 人发表留言,猛击->> 这里

ITeye推荐
  • —软件人才免语言低担保 赴美带薪读研!—



hadoop资源调度器

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Unable to boot into Windows recovery environment Unable to boot into Windows recovery environment Feb 19, 2024 pm 11:12 PM

Windows Recovery Environment (WinRE) is an environment used to repair Windows operating system errors. After entering WinRE, you can perform system restore, factory reset, uninstall updates, etc. If you are unable to boot into WinRE, this article will guide you through fixes to resolve the issue. Unable to boot into the Windows Recovery Environment If you cannot boot into the Windows Recovery Environment, use the fixes provided below: Check the status of the Windows Recovery Environment Use other methods to enter the Windows Recovery Environment Did you accidentally delete the Windows Recovery Partition? Perform an in-place upgrade or clean installation of Windows below, we have explained all these fixes in detail. 1] Check Wi

How to fix Windows Hello unsupported camera issue How to fix Windows Hello unsupported camera issue Jan 05, 2024 pm 05:38 PM

When using Windows Shello, a supported camera cannot be found. The common reasons are that the camera used does not support face recognition and the camera driver is not installed correctly. So let's take a look at how to set it up. Windowshello cannot find a supported camera tutorial: Reason 1: The camera driver is not installed correctly 1. Generally speaking, the Win10 system can automatically install drivers for most cameras, as follows, there will be a notification after plugging in the camera; 2. At this time, we open the device Check the manager to see if the camera driver is installed. If not, you need to do it manually. WIN+X, then select Device Manager; 3. In the Device Manager window, expand the camera option, and the camera driver model will be displayed.

Does PyCharm Community Edition support enough plugins? Does PyCharm Community Edition support enough plugins? Feb 20, 2024 pm 04:42 PM

Does PyCharm Community Edition support enough plugins? Need specific code examples As the Python language becomes more and more widely used in the field of software development, PyCharm, as a professional Python integrated development environment (IDE), is favored by developers. PyCharm is divided into two versions: professional version and community version. The community version is provided for free, but its plug-in support is limited compared to the professional version. So the question is, does PyCharm Community Edition support enough plug-ins? This article will use specific code examples to

How to find resources on 115 network disk How to find resources on 115 network disk Feb 23, 2024 pm 05:10 PM

There will be a lot of resources in the 115 network disk, so how to find resources? Users can search for the resources they need in the software, then enter the download interface, and then choose to save to the network disk. This introduction to the method of finding resources on 115 network disk can tell you the specific content. The following is a detailed introduction, come and take a look. How to find resources on 115 network disk? Answer: Search the content in the software, and then click to save to the network disk. Detailed introduction: 1. First enter the resources you want in the app. 2. Then click the keyword link that appears. 3. Then enter the download interface. 4. Click Save to network disk inside.

Pros and Cons Analysis: A closer look at the pros and cons of open source software Pros and Cons Analysis: A closer look at the pros and cons of open source software Feb 23, 2024 pm 11:00 PM

Pros and cons of open source software: Understanding the pros and cons of open source projects requires specific code examples In today’s digital age, open source software is getting more and more attention and respect. As a software development model based on the spirit of cooperation and sharing, open source software is widely used in different fields. However, despite the many advantages of open source software, there are also some challenges and limitations. This article will delve into the pros and cons of open source software and demonstrate the pros and cons of open source projects through specific code examples. 1. Advantages of open source software 1.1 Openness and transparency Open source software

ASUS TUF Z790 Plus is compatible with ASUS MCP79 memory frequency ASUS TUF Z790 Plus is compatible with ASUS MCP79 memory frequency Jan 03, 2024 pm 04:18 PM

ASUS tufz790plus supports memory frequency. ASUS TUFZ790-PLUS motherboard is a high-performance motherboard that supports dual-channel DDR4 memory and supports up to 64GB of memory. Its memory frequency is very powerful, up to 4800MHz. Specific supported memory frequencies include 2133MHz, 2400MHz, 2666MHz, 2800MHz, 3000MHz, 3200MHz, 3600MHz, 3733MHz, 3866MHz, 4000MHz, 4133MHz, 4266MHz, 4400MHz, 4533MHz, 4600MHz, 4733MHz and 4800MHz. Whether it is daily use or high performance needs

How to disable the 'Click to Show Desktop' feature in macOS How to disable the 'Click to Show Desktop' feature in macOS Nov 23, 2023 pm 02:31 PM

By default, macOSSonoma hides all active windows when you click on your desktop wallpaper. This is convenient if you tend to have a bunch of files on your desktop that you need to access. However, if you find this behavior maddening, there is a way to turn it off. Apple's latest macOS Sonoma Mac operating system has a new option called "Click the wallpaper to show the desktop." Enabled by default, this option can be particularly useful if you tend to have multiple windows open and want to access files or folders on your desktop without having to minimize or move the windows. When you enable the feature and click on the desktop wallpaper, all open windows are temporarily swept aside, allowing direct access to the desktop. Once done, you can again

Compatibility and related instructions between GTX960 and XP system Compatibility and related instructions between GTX960 and XP system Dec 28, 2023 pm 10:22 PM

Some users use the XP system and want to upgrade their graphics cards to gtx960, but are not sure whether gtx960 supports the xp system. In fact, gtx960 supports xp system. We only need to download the driver suitable for xp system from the official website, and then we can use gtx960. Let’s take a look at the specific steps below. Does gtx960 support XP system: GTX960 is compatible with XP system. Just download and install the driver and you're good to go. First, we need to open the NVIDIA official website and navigate to the home page. We then need to find a label or button above the page, it will probably be labeled "Drivers". Once we find this option we need to click on

See all articles