課程目錄:Hadoop For Administrators培訓
4401 人關注
(78637/99817)
課程大綱:

   Hadoop For Administrators培訓

 

 

 

Introduction
Hadoop history, concepts
Ecosystem
Distributions
High level architecture
Hadoop myths
Hadoop challenges (hardware / software)
Labs: discuss your Big Data projects and problems
Planning and installation
Selecting software, Hadoop distributions
Sizing the cluster, planning for growth
Selecting hardware and network
Rack topology
Installation
Multi-tenancy
Directory structure, logs
Benchmarking
Labs: cluster install, run performance benchmarks
HDFS operations
Concepts (horizontal scaling, replication, data locality, rack awareness)
Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode)
Health monitoring
Command-line and browser-based administration
Adding storage, replacing defective drives
Labs: getting familiar with HDFS command lines
Data ingestion
Flume for logs and other data ingestion into HDFS
Sqoop for importing from SQL databases to HDFS, as well as exporting back to SQL
Hadoop data warehousing with Hive
Copying data between clusters (distcp)
Using S3 as complementary to HDFS
Data ingestion best practices and architectures
Labs: setting up and using Flume, the same for Sqoop
MapReduce operations and administration
Parallel computing before mapreduce: compare HPC vs Hadoop administration
MapReduce cluster loads
Nodes and Daemons (JobTracker, TaskTracker)
MapReduce UI walk through
Mapreduce configuration
Job config
Optimizing MapReduce
Fool-proofing MR: what to tell your programmers
Labs: running MapReduce examples
YARN: new architecture and new capabilities
YARN design goals and implementation architecture
New actors: ResourceManager, NodeManager, Application Master
Installing YARN
Job scheduling under YARN
Labs: investigate job scheduling
Advanced topics
Hardware monitoring
Cluster monitoring
Adding and removing servers, upgrading Hadoop
Backup, recovery and business continuity planning
Oozie job workflows
Hadoop high availability (HA)
Hadoop Federation
Securing your cluster with Kerberos
Labs: set up monitoring
Optional tracks
Cloudera Manager for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Cloudera distribution environment (CDH5)
Ambari for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Ambari cluster manager and Hortonworks Data Platform (HDP 2.0)

主站蜘蛛池模板: 久久婷婷色综合一区二区| 亚洲第一区欧美国产不卡综合| 久久亚洲欧洲国产综合| 国产成人精品综合在线观看| 一本久道久久综合狠狠爱| 亚洲综合久久综合激情久久| 天天久久狠狠色综合| 婷婷国产天堂久久综合五月| 五月婷婷综合网| 丁香五月亚洲综合深深爱| 99久久婷婷免费国产综合精品| 欧美偷窥清纯综合图区| 亚洲综合色自拍一区| 狠狠色狠狠色综合日日五| 久久综合久久鬼色| 伊人久久大香线蕉综合5g| 综合久久久久久中文字幕亚洲国产国产综合一区首| 国产成人亚洲综合一区| 91欧美一区二区三区综合在线| 亚洲欧美日韩综合一区| 天天综合色天天综合色hd| 国产精品综合专区中文字幕免费播放 | 狠狠色噜狠狠狠狠色综合久| 狠狠色综合网站久久久久久久| 亚洲成A人V欧美综合天堂麻豆 | 色老头综合免费视频| 日韩亚洲国产综合高清| 亚洲欧美日韩综合网导航| 伊人久久综合成人网| 久久综合久久美利坚合众国| 99久久精品国产综合一区| 国产成人亚洲综合| 色综合AV综合无码综合网站| 亚洲国产精品成人AV无码久久综合影院| 伊人yinren6综合网色狠狠| 国产精品综合AV一区二区国产馆| 色爱区综合激情五月综合色| 婷婷五月综合色视频| 国产人成精品综合欧美成人| 一个色综合国产色综合| 成人亚洲综合天堂|