最具影响力的数字化技术在线社区

168大数据

 找回密码
 立即注册

QQ登录

只需一步,快速开始

1 2 3 4 5
打印 上一主题 下一主题
开启左侧

[基础] Cloudera Hadoop管理员(CCAH)认证大纲

[复制链接]
跳转到指定楼层
楼主
发表于 2016-3-23 13:14:37 | 只看该作者 回帖奖励 |倒序浏览 |阅读模式

马上注册,结交更多数据大咖,获取更多知识干货,轻松玩转大数据

您需要 登录 才可以下载或查看,没有帐号?立即注册

x
Cloudera Certified Administrator for Apache hadoop (CCA-500)
Number of Questions: 60 questions
Time Limit: 90 minutes
Passing Score: 70%
Language: English, Japanese
Exam Sections and Blueprint
1. HDFS (17%)
•        Describe the function of HDFS daemons
•        Describe the normal operation of an Apache Hadoop cluster, both in data storage and in data processing
•        Identify current features of computing systems that motivate a system like Apache Hadoop
•        Classify major goals of HDFS Design
•        Given a scenario, identify appropriate use case for HDFS Federation
•        Identify components and daemon of an HDFS HA-Quorum cluster
•        Analyze the role of HDFS security (Kerberos)
•        Determine the best data serialization choice for a given scenario
•        Describe file read and write paths
•        Identify the commands to manipulate files in the Hadoop File System Shell
2. YARN and MapReduce version 2 (MRv2) (17%)
•        Understand how upgrading a cluster from Hadoop 1 to Hadoop 2 affects cluster settings
•        Understand how to deploy MapReduce v2 (MRv2 / YARN), including all YARN daemons
•        Understand basic design strategy for MapReduce v2 (MRv2)
•        Determine how YARN handles resource allocations
•        Identify the workflow of MapReduce job running on YARN
•        Determine which files you must change and how in order to migrate a cluster from MapReduce version 1 (MRv1) to MapReduce version 2 (MRv2) running on YARN
3. Hadoop Cluster Planning (16%)
•        Principal points to consider in choosing the hardware and operating systems to host an Apache Hadoop cluster
•        Analyze the choices in selecting an OS
•        Understand kernel tuning and disk swapping
•        Given a scenario and workload pattern, identify a hardware configuration appropriate to the scenario
•        Given a scenario, determine the ecosystem components your cluster needs to run in order to fulfill the SLA
•        Cluster sizing: given a scenario and frequency of execution, identify the specifics for the workload, including CPU, memory, storage, disk I/O
•        Disk Sizing and Configuration, including JBOD versus RAID, SANs, virtualization, and disk sizing requirements in a cluster
•        Network Topologies: understand network usage in Hadoop (for both HDFS and MapReduce) and propose or identify key network design components for a given scenario
4. Hadoop Cluster Installation and Administration (25%)
•        Given a scenario, identify how the cluster will handle disk and machine failures
•        Analyze a logging configuration and logging configuration file format
•        Understand the basics of Hadoop metrics and cluster health monitoring
•        Identify the function and purpose of available tools for cluster monitoring
•        Be able to install all the ecoystme components in CDH 5, including (but not limited to): Impala, Flume, Oozie, Hue, Cloudera Manager, Sqoop, Hive, and Pig
•        Identify the function and purpose of available tools for managing the Apache Hadoop file system
5. Resource Management (10%)
•        Understand the overall design goals of each of Hadoop schedulers
•        Given a scenario, determine how the FIFO Scheduler allocates cluster resources
•        Given a scenario, determine how the Fair Scheduler allocates cluster resources under YARN
•        Given a scenario, determine how the Capacity Scheduler allocates cluster resources
6. Monitoring and Logging (15%)
•        Understand the functions and features of Hadoop’s metric collection abilities
•        Analyze the NameNode and JobTracker Web UIs
•        Understand how to monitor cluster daemons
•        Identify and monitor CPU usage on master nodes
•        Describe how to monitor swap and memory allocation on all nodes
•        Identify how to view and manage Hadoop’s log files
•        Interpret a log file

楼主热帖
分享到:  QQ好友和群QQ好友和群 QQ空间QQ空间 腾讯微博腾讯微博 腾讯朋友腾讯朋友
收藏收藏 转播转播 分享分享 分享淘帖 赞 踩

168大数据 - 论坛版权1.本主题所有言论和图片纯属网友个人见解,与本站立场无关
2.本站所有主题由网友自行投稿发布。若为首发或独家,该帖子作者与168大数据享有帖子相关版权。
3.其他单位或个人使用、转载或引用本文时必须同时征得该帖子作者和168大数据的同意,并添加本文出处。
4.本站所收集的部分公开资料来源于网络,转载目的在于传递价值及用于交流学习,并不代表本站赞同其观点和对其真实性负责,也不构成任何其他建议。
5.任何通过此网页连接而得到的资讯、产品及服务,本站概不负责,亦不负任何法律责任。
6.本站遵循行业规范,任何转载的稿件都会明确标注作者和来源,若标注有误或遗漏而侵犯到任何版权问题,请尽快告知,本站将及时删除。
7.168大数据管理员和版主有权不事先通知发贴者而删除本文。

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

关闭

站长推荐上一条 /1 下一条

关于我们|小黑屋|Archiver|168大数据 ( 京ICP备14035423号|申请友情链接

GMT+8, 2024-6-1 18:17

Powered by BI168大数据社区

© 2012-2014 168大数据

快速回复 返回顶部 返回列表