Hadoop Tutorial
Posted by Mehboob on 15/12/2011
Tutorial Hadoop or MapReduce can be a task if you get your hand dirty at the start. Most of IT people say it is future tech, let start from very basics .. This tutorial covers the basics of distributed systems design. The pre-requisites are significant programming experience with a language such as C++ or Java, a basic understanding of networking, and data structures & algorithms.
What Is Apache Hadoop?
Start with very basics of MR with http://code.google.com/edu/parallel/dsd-tutorial.html
Then go for the first two lectures in http://www.cs.washington.edu/education/courses/cse490h/08au/lectures.htm
A very good course introduction to MapReduce and Hadoop. Read the seminal paper http://labs.google.com/papers/mapreduce.html
and its improvements in the updated version at
http://www.cs.washington.edu/education/courses/cse490h/08au/readings/communications200801-dl.pdf
Then go for all the other videos in the U.Washington link given above to understand distributed systems in detail (optional).
Try youtubing the terms Map reduce and Hadoop to find videos by ORielly and Google RoundTable for good overview of the future of Hadoop and MapReduce
Then off to the most important videos – Cloudera Videos
http://www.cloudera.com/resources/?media=Video
And Google MiniLecture Series
http://code.google.com/edu/submissions/mapreduce-minilecture/listing.html
Along with all the Multimedia above we need good written material Documents:
Architecture diagrams at hadooper.blogspot.com are good to have on your wall Hadoop: The definitive guide goes more into the nuts and bolts of the whole system where as Hadoop in Action is a good read with lots of teaching examples to learn the concepts of hadoop. Pro Hadoop is not for beginners pdfs of the documentation from Apache Foundation
http://hadoop.apache.org/common/docs/current/
And
http://hadoop.apache.org/common/docs/stable/
Will help you learn as to how model your problem into a MR solution in order to gain the advantages of Hadoop in total. HDFS paper by Yahoo! Research is also a good read in order to gain in depth knowledge of hadoop distributed file system Try the http://developer.yahoo.com/hadoop/tutorial/module1.html link for beginners to expert path to Hadoop.
All this for Hadoop lover. Install Hadoop on Ubunto Server through this vedio, no aduio, just hang on with this so you can make it.
Hadoop Part 1 – Install Ubuntu Server 10.04
Part 2
Part 3
Part 4
Part 5
Mehboob
SQL – SPS DBA – MCTS

Safdar said
Great site, keep go on
John said
Cool and good information, ur site help me a lot, thanks for you time.