Mehboob

" Every cloud has a silver lining "

Hadoop Tutorial

Posted by Mehboob on 15/12/2011

Tutorial Hadoop or MapReduce can be a task if you get your hand dirty at the start. Most of IT people say it is future tech, let start from very basics .. This tutorial covers the basics of distributed systems design. The pre-requisites are significant programming experience with a language such as C++ or Java, a basic understanding of networking, and data structures & algorithms.

What Is Apache Hadoop?

Start with very basics of MR with http://code.google.com/edu/parallel/dsd-tutorial.html

Then go for the first two lectures in http://www.cs.washington.edu/education/courses/cse490h/08au/lectures.htm

A very good course introduction to MapReduce and Hadoop. Read the seminal paper http://labs.google.com/papers/mapreduce.html

and its improvements in the updated version at
http://www.cs.washington.edu/education/courses/cse490h/08au/readings/communications200801-dl.pdf

Then go for all the other videos in the U.Washington link given above to understand distributed systems in detail (optional).

Try youtubing the terms Map reduce and Hadoop to find videos by ORielly and Google RoundTable for good overview of the future of Hadoop and MapReduce

Then off to the most important videos – Cloudera Videos
http://www.cloudera.com/resources/?media=Video

And Google MiniLecture Series
http://code.google.com/edu/submissions/mapreduce-minilecture/listing.html

Along with all the Multimedia above we need good written material Documents:

Architecture diagrams at hadooper.blogspot.com are good to have on your wall Hadoop: The definitive guide goes more into the nuts and bolts of the whole system where as Hadoop in Action is a good read with lots of teaching examples to learn the concepts of hadoop. Pro Hadoop is not for beginners pdfs of the documentation from Apache Foundation

http://hadoop.apache.org/common/docs/current/

And 

http://hadoop.apache.org/common/docs/stable/

Will help you learn as to how model your problem into a MR solution in order to gain the advantages of Hadoop in total.  HDFS paper by Yahoo! Research is also a good read in order to gain in depth knowledge of hadoop distributed file system Try the http://developer.yahoo.com/hadoop/tutorial/module1.html link for beginners to expert path to Hadoop.

All this for Hadoop lover. Install Hadoop on Ubunto Server through this vedio, no aduio, just hang on with this so you can make it.

Hadoop Part 1 – Install Ubuntu Server 10.04

Part 2

Part 3

Part 4

Part 5

 

 Mehboob

SQL – SPS  DBA – MCTS

2 Responses to “Hadoop Tutorial”

  1. Safdar said

    Great site, keep go on

  2. John said

    Cool and good information, ur site help me a lot, thanks for you time.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

 
Follow

Get every new post delivered to your Inbox.