Bigdata

Big Data is defined as data that is huge in size. Bigdata is a term used to describe a collection of data that is huge in size and yet growing exponentially with time.

About the Course
Bigdata

Big Data is defined as data that is huge in size. Bigdata is a term used to describe a collection of data that is huge in size and yet growing exponentially with time. Examples of Big Data generation includes stock exchanges, social media sites, jet engines, etc.

Duration: 60 Days

Know More

Course Module:

Big Data

• HADOOP 

• DATA BASES 

Hadoop

HDFS (Hadoop Distributed File System) 

HDFS ARCHITECTURE 

• Name node • Secondary Name Node 

• Data Node 

• Data storage in HDFS 

• HDFS block size 

• HDFS commands 

• HOW TO OVERCOME THE DRAWBACKS IN

• HDFS

• HOW TO CONFIGURE THE HADOOP CLUSTER 

• HADOOP 2.X.X VERSION FEATURES 

• MAPREDUCE

• MAP REDUCE ARCHITECTURE 

• Job Tracker

• Task Tracker 

• Data Types in Hadoop 

• Mapper 

• Reducer

• Combiner

• Distributed Cache

• Counters 

Joins

    • Map side join  • Reduce side join

• Map Reduce Schedulers

• Map Reduce programming model

• Debugging Map reduce jobs

• YARN (Next Generation Map Reduce)

• Data locality

• SPECULATIVE EXECUTION

• APACHE PIG

• HIVE

• APACE ZOOKEEPER

• APACHE HBASE

• Apache SQOOP

• Apache FLUME

• Advanced and new technologies architecture

• Discussions

   •  Mahout (Machine Learning Algorithms)

•  Storm (Real time data streaming)

•  Cassandra (NOSQL database)

•  Mongo DB (NOSQL database)

•  Ganglia (Monitoring Tools)

•  Cloudera, Hortonworks, Map R, Amazon   EMR(Distributions