課程目錄:Hadoop for Developers and Administrators培訓(xùn)
4401 人關(guān)注
(78637/99817)
課程大綱:

   Hadoop for Developers and Administrators培訓(xùn)

 

 

 

Module 1. Introduction to Hadoop
The Hadoop Distributed File System (HDFS)
The Read Path and The Write Path
Managing Filesystem Metadata
The Namenode and the Datanode
The Namenode High Availability
Namenode Federation
The Command-Line Tools
Understanding REST Support
Module 2. Introduction to MapReduce
Analyzing the Data with Hadoop
Map and Reduce Pattern
Java MapReduce
Scaling Out
Data Flow
Developing Combiner Functions
Running a Distributed MapReduce Job
Module 3. Planning a Hadoop Cluster
Picking a Distribution and Version of Hadoop
Versions and Features
Hardware Selection
Master and Worker Hardware Selection
Cluster Sizing
Operating System Selection and Preparation
Deployment Layout
Setting up Users, Groups, and Privileges
Disk Configuration
Network Design
Module 4. Installation and Configuration
Installing Hadoop
Configuration: An Overview
The Hadoop XML Configuration Files
Environment Variables and Shell Scripts
Logging Configuration
Managing HDFS
Optimization and Tuning
Formatting the Namenode
Creating a /tmp Directory
Thinking Namenode High Availability
The Fencing Options
Automatic Failover Configuration
Format and Bootstrap the Namenodes
Namenode Federation
Module 5. Understanding Hadoop I/O
Data Integrity in HDFS
Understanding Codecs
Compression and Input Splits
Using Compression in MapReduce
The Serialization mechanism
File-Based Data Structures
The SequenceFile format
Other File Formats and Column-Oriented Formats
Module 6. Developing a MapReduce Application
The Configuration API
Setting Up the Development Environment
Managing Configuration
GenericOptionsParser, Tool, and ToolRunner
Writing a Unit Test with MRUnit
The Mapper and Reducer
Running Locally on Test Data
Testing the Driver
Running on a Cluster
Packaging and Launching a Job
The MapReduce Web UI
Tuning a Job
Module 7. Identity, Authentication, and Authorization
Managing Identity
Kerberos and Hadoop
Understanding Authorization
Module 8. Resource Management
What Is Resource Management?
HDFS Quotas
MapReduce Schedulers
Anatomy of a YARN Application Run
Resource Requests
Application Lifespan
YARN Compared to MapReduce 1
Scheduling in YARN
Scheduler Options
Capacity Scheduler Configuration
Fair Scheduler Configuration
Delay Scheduling
Dominant Resource Fairness
Module 9. MapReduce Types and Formats
MapReduce Types
The Default MapReduce Job
Defining the Input Formats
Managing Input Splits and Records
Text Input and Binary Input
Managing Multiple Inputs
Database Input (and Output)
Output Formats
Text Output and Binary Output
Managing Multiple Outputs
The Database Output
Module 10. Using MapReduce Features
Using Counters
Reading Built-in Counters
User-Defined Java Counters
Understanding Sorting
Using the Distributed Cache
Module 11. Cluster Maintenance and Troubleshooting
Managing Hadoop Processes
Starting and Stopping Processes with Init Scripts
Starting and Stopping Processes Manually
HDFS Maintenance Tasks
Adding a Datanode
Decommissioning a Datanode
Checking Filesystem Integrity with fsck
Balancing HDFS Block Data
Dealing with a Failed Disk
MapReduce Maintenance Tasks
Killing a MapReduce Job
Killing a MapReduce Task
Managing Resource Exhaustion
Module 12. Monitoring
The available Hadoop Metrics
The role of SNMP
Health Monitoring
Host-Level Checks
HDFS Checks
MapReduce Checks
Module 13. Backup and Recovery
Data Backup
Distributed Copy (distcp)
Parallel Data Ingestion
Namenode Metadata

主站蜘蛛池模板: 亚洲综合色婷婷七月丁香| 久久久久青草线蕉综合超碰 | 国产成人综合久久精品红| 狠狠色伊人亚洲综合成人| 丁香色欲久久久久久综合网| 日日AV色欲香天天综合网| 亚洲综合自拍成人| 激情综合丁香五月| 亚洲精品综合在线影院| 天天综合网网欲色| 中文字幕亚洲综合久久菠萝蜜 | 欧美亚洲日韩国产综合网| 国产成人亚洲综合无码精品| 五月六月综合欧美网站| 日韩欧美综合在线| 狠狠色丁香婷婷综合激情 | 国产精品天干天干在线综合| 欧美综合欧美视频| 一本综合久久国产二区| 自拍三级综合影视| 天天做天天爱天天综合网2021 | 天天影视色香欲综合久久| 亚洲国产综合精品中文第一| 激情综合色综合久久综合| 一本久道久久综合狠狠躁AV | 狼狼综合久久久久综合网| 国产一级a爱做综合| 天天干天天射综合网| 国产亚洲欧洲Aⅴ综合一区| 色综合久久综精品| 91探花国产综合在线精品| 婷婷综合久久中文字幕蜜桃三电影| 97se色综合一区二区二区| 国产AV综合影院| 狠狠色色综合网站| 99综合电影在线视频好看| 精品国产第一国产综合精品| 久久本道久久综合伊人| 狠狠色狠狠色综合系列| 国产成人综合一区精品| 亚洲综合第一页|