Nt1330 Unit 3 Assignment

Words: 451
Pages: 2

The roots of Hadoop date way back to the year 2002 when Dough Cutting was working on an open source project named Nutch ( which was used to index the web pages and use the indexed web pages for searching , same thing which goggle does) . He was facing scalability issues both in terms of storage and computing. In 2003 google published GFS (google file system) and in 2004 Nutch created NDFS (Nutch distributed file system). After google announcing MapReduce as their computational brain behind their sorting algorithms, Dough was able to run Nutch on NDFS and used MapReduce in the year 2005 and in the year 2006 Hadoop was born.
Hadoop Vs MapReduce! Hadoop is an eco-system of open source projects such as Hadoop Common, Hadoop distributed file system (HDFS), Hadoop YARN, Hadoop MapReduce. Hadoop as such is an open source framework for storing and processing huge datasets. The storing is carried by HDFS and the processing is taken care by MapReduce. MapReduce on the other hand is a programming model which allows you to process huge data stored in Hadoop.
…show more content…
• The below figure will help in differentiating MapReduce from