課程目錄:Administrator Training for Apache Hadoop培訓(xùn)
4401 人關(guān)注
(78637/99817)
課程大綱:

   Administrator Training for Apache Hadoop培訓(xùn)

 

 

 

1: HDFS (17%)
Describe the function of HDFS Daemons
Describe the normal operation of an Apache Hadoop cluster, both in data storage and in data processing.
Identify current features of computing systems that motivate a system like Apache Hadoop.
Classify major goals of HDFS Design
Given a scenario, identify appropriate use case for HDFS Federation
Identify components and daemon of an HDFS HA-Quorum cluster
Analyze the role of HDFS security (Kerberos)
Determine the best data serialization choice for a given scenario
Describe file read and write paths
Identify the commands to manipulate files in the Hadoop File System Shell
2: YARN and MapReduce version 2 (MRv2) (17%)
Understand how upgrading a cluster from Hadoop 1 to Hadoop 2 affects cluster settings
Understand how to deploy MapReduce v2 (MRv2 / YARN), including all YARN daemons
Understand basic design strategy for MapReduce v2 (MRv2)
Determine how YARN handles resource allocations
Identify the workflow of MapReduce job running on YARN
Determine which files you must change and how in order to migrate a cluster from MapReduce version 1 (MRv1) to MapReduce version 2 (MRv2) running on YARN.
3: Hadoop Cluster Planning (16%)
Principal points to consider in choosing the hardware and operating systems to host an Apache Hadoop cluster.
Analyze the choices in selecting an OS
Understand kernel tuning and disk swapping
Given a scenario and workload pattern, identify a hardware configuration appropriate to the scenario
Given a scenario, determine the ecosystem components your cluster needs to run in order to fulfill the SLA
Cluster sizing: given a scenario and frequency of execution, identify the specifics for the workload, including CPU, memory, storage, disk I/O
Disk Sizing and Configuration, including JBOD versus RAID, SANs, virtualization, and disk sizing requirements in a cluster
Network Topologies: understand network usage in Hadoop (for both HDFS and MapReduce) and propose or identify key network design components for a given scenario
4: Hadoop Cluster Installation and Administration (25%)
Given a scenario, identify how the cluster will handle disk and machine failures
Analyze a logging configuration and logging configuration file format
Understand the basics of Hadoop metrics and cluster health monitoring
Identify the function and purpose of available tools for cluster monitoring
Be able to install all the ecosystem components in CDH 5, including (but not limited to): Impala, Flume, Oozie, Hue, Manager, Sqoop, Hive, and Pig
Identify the function and purpose of available tools for managing the Apache Hadoop file system
5: Resource Management (10%)
Understand the overall design goals of each of Hadoop schedulers
Given a scenario, determine how the FIFO Scheduler allocates cluster resources
Given a scenario, determine how the Fair Scheduler allocates cluster resources under YARN
Given a scenario, determine how the Capacity Scheduler allocates cluster resources
6: Monitoring and Logging (15%)
Understand the functions and features of Hadoop’s metric collection abilities
Analyze the NameNode and JobTracker Web UIs
Understand how to monitor cluster Daemons
Identify and monitor CPU usage on master nodes
Describe how to monitor swap and memory allocation on all nodes
Identify how to view and manage Hadoop’s log files
Interpret a log file

主站蜘蛛池模板: 亚洲精品综合一二三区在线| 国产成+人+综合+欧美亚洲| 一本色道久久88综合日韩精品 | 激情五月综合综合久久69| 伊人久久综合成人网| 狠狠色丁香婷婷综合久久来| 亚洲色偷偷狠狠综合网| 亚洲国产综合无码一区| 久久九色综合九色99伊人| 婷婷综合久久中文字幕蜜桃三电影| 国产欧美日韩综合AⅤ天堂| 亚洲五月综合缴情在线观看| 天天综合天天看夜夜添狠狠玩| 色综合网天天综合色中文男男| 久久久亚洲裙底偷窥综合| 伊人久久综合成人网| 国产综合久久久久久鬼色| 色噜噜狠狠色综合网| 炫硕日本一区二区三区综合区在线中文字幕 | 欧美激情综合五月色丁香| 国产精品国产欧美综合一区| 色老头综合免费视频| 97久久久精品综合88久久| 狠狠综合久久综合中文88| 亚洲国产欧美国产综合久久| 色综合色综合色综合| 亚洲伊人tv综合网色| 亚洲婷婷五月综合狠狠爱| 久久香综合精品久久伊人| 丁香色欲久久久久久综合网| 亚洲欧美国产∧v精品综合网 | 久久综合色区| 色综合久久综合中文综合网| 欧美日韩亚洲综合一区二区三区| 色欲综合一区二区三区| 色综合久久无码中文字幕| 国产亚洲综合一区柠檬导航| 亚洲综合熟女久久久30p| 久久综合亚洲鲁鲁五月天| 一本久道久久综合狠狠爱| 色视频综合无码一区二区三区|