課程目錄:Hadoop For Administrators培訓
4401 人關注
(78637/99817)
課程大綱:

   Hadoop For Administrators培訓

 

 

 

Introduction
Hadoop history, concepts
Ecosystem
Distributions
High level architecture
Hadoop myths
Hadoop challenges (hardware / software)
Labs: discuss your Big Data projects and problems
Planning and installation
Selecting software, Hadoop distributions
Sizing the cluster, planning for growth
Selecting hardware and network
Rack topology
Installation
Multi-tenancy
Directory structure, logs
Benchmarking
Labs: cluster install, run performance benchmarks
HDFS operations
Concepts (horizontal scaling, replication, data locality, rack awareness)
Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode)
Health monitoring
Command-line and browser-based administration
Adding storage, replacing defective drives
Labs: getting familiar with HDFS command lines
Data ingestion
Flume for logs and other data ingestion into HDFS
Sqoop for importing from SQL databases to HDFS, as well as exporting back to SQL
Hadoop data warehousing with Hive
Copying data between clusters (distcp)
Using S3 as complementary to HDFS
Data ingestion best practices and architectures
Labs: setting up and using Flume, the same for Sqoop
MapReduce operations and administration
Parallel computing before mapreduce: compare HPC vs Hadoop administration
MapReduce cluster loads
Nodes and Daemons (JobTracker, TaskTracker)
MapReduce UI walk through
Mapreduce configuration
Job config
Optimizing MapReduce
Fool-proofing MR: what to tell your programmers
Labs: running MapReduce examples
YARN: new architecture and new capabilities
YARN design goals and implementation architecture
New actors: ResourceManager, NodeManager, Application Master
Installing YARN
Job scheduling under YARN
Labs: investigate job scheduling
Advanced topics
Hardware monitoring
Cluster monitoring
Adding and removing servers, upgrading Hadoop
Backup, recovery and business continuity planning
Oozie job workflows
Hadoop high availability (HA)
Hadoop Federation
Securing your cluster with Kerberos
Labs: set up monitoring
Optional tracks
Cloudera Manager for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Cloudera distribution environment (CDH5)
Ambari for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Ambari cluster manager and Hortonworks Data Platform (HDP 2.0)

主站蜘蛛池模板: 亚洲精品国产第一综合99久久| 欧美激情综合亚洲一二区| 亚洲AV人无码综合在线观看| 一个色综合久久| 亚洲国产一成久久精品国产成人综合| 99久久综合狠狠综合久久| 狠狠色综合日日| 亚洲综合日韩久久成人AV| 色欲香天天综合网无码| 亚洲综合色在线| 久久影视综合亚洲| 国产人成精品综合欧美成人| 国产成人精品综合久久久久| 久久久久综合网久久| 色狠狠色狠狠综合天天| 一本色道久久88加勒比—综合 | 亚洲国产综合无码一区| 欧美综合区综合久青草视频| 亚洲色欲久久久综合网| 精品久久人人做人人爽综合| 五月六月综合欧美网站| 欧美成电影综合网站色www| 综合无码一区二区三区| 亚洲综合色成在线播放| 一本大道加勒比久久综合| 狠狠色综合网站久久久久久久高清| 亚洲综合国产一区二区三区| 一本色道久久88综合日韩精品 | 伊人色综合九久久天天蜜桃| 国产成+人+综合+亚洲欧美| 亚洲综合欧美精品一区二区 | 久久亚洲高清综合| 色欲香天天综合网站| av色综合久久天堂av色综合在| 亚洲综合伊人久久大杳蕉| 激情综合色五月丁香六月亚洲 | 天天色综合天天色| 日韩欧美综合在线| 综合久久一区二区三区 | 亚洲色偷偷狠狠综合网| 狠狠色狠狠色综合伊人|