ADMINISTRATION OF HADOOP DISTRIBUTED FILE SYSTEM (HDFS)
· Administration Concepts : Blocks, Replication, Version File, Safe mode, Namespace IDs, Reading and Writing in HDFS, Understanding Name Node, Understanding Data Node, Understanding Secondary Name Node, Understanding Job Tracker, Understanding Task Tracker, HDFS Shell Commands, Hadoop Admin Commands, Accessing HDFS using API, Understanding HDFS Java classes and methods, HDFS Nextgeneration Concepts, Hands On Exercise,
· Setting up Hadoop Cluster for Apache Hadoop: Installation in detail, creating Ubuntu image in VMware, Downloading Hadoop, Installing SSH, configuring Hadoop, Download, Installation & Configuration of Pig, Hive and Sqoop, Installing MySql in Hadoop cluster, Download and work with Cloudera Image.
· Configuring Hadoop in Different Modes; Local Mode,Running without HDFS, Pseudo-distributed Mode, Running all daemons in a single node, Fully distributed mode,Running daemons on dedicated nodes Managing Hadoop Processes, Starting and Stopping Processes with Init Scripts, Starting and Stopping Processes Manually, HDFS Maintenance Tasks, Adding a Datanode, Decommissioning a Datanode, Checking Filesystem Integrity with fsck, Balancing HDFS Block Data, Dealing with a Failed Disk,MapReduce Maintenance Tasks, Adding a Tasktracker,Decommissioning a Tasktracker, Killing a MapReduce Job, Killing a MapReduceTask,Dealing with a Blacklisted Tasktracker
MAP REDUCE
(DEVELOPMENT)
· Map Reduce Programming :Understanding block and input splits,,Common Input and Output Formats, MapReduce Data types, Understanding Writable and WritableComparable (Introduction), Data Flow in MapReduce Application, Understanding MapReduce problem on real datasets(stocks), MapReduce Skeleton in Details, Writing MapReduce Application, Understanding Mapper function, Understanding Reducer Function, Understanding Driver, Understanding Tool Runner, Hands on Exercise, MapReduce Continued, Using Combiner Using Distributed Cache, Passing the parameters to mapper and reducer, Hands On exercise,Writing Custom key values, Hands On Exercise,Designed Use Cases for common problems.