What is it about?
Size of the data used in todays enterprises has been growing at exponential rates from last few years. Simultaneously, the need to process and analyze the large volumes of data has also increased. To handle and for analysis of large scale datasets, an open-source implementation of Apache framework, Hadoop is used now-a-days. For managing and storing of all the resources across its cluster, Hadoop possesses a distributed file system called Hadoop Distributed File System (HDFS). HDFS is written completely in Java and is depicted in such a way that in can store Big data more reliably, and can stream those at high processing time to the user applications. Hadoop has been widely used in recent days by popular organizations like Yahoo, Facebook and various online shopping market venders. On the other hand, experiments on Data-Intensive computations are going on to parallelize the processing of data. None of them could actually achieve a desirable performance. Hadoop, with its Map-Reduce parallel data processing capability can achieve these goals efficiently. This chapter initially provides an overview of the HDFS in details. The next portion of the paper evaluates Hadoops performance with various factors in different environments. The chapter shows how files less than the block size affect Hadoops R/W performance and how the time of execution of a job depends on block size and number of reducers. Chapter concludes with providing the different real challenges of Hadoop in recent days and scope for future work.
Featured Image
Why is it important?
It provides rigorous experimental evaluation of Hadoop. It provides important parameters that effect Hadoop.
Perspectives
Exposes various parameters that effects HDFS and MapReduce.
Ripon Patgiri
National Institute of Technology Silchar
Read the Original
This page is a summary of: A Deep Dive into the Hadoop World to Explore Its Various Performances, January 2016, Springer Science + Business Media,
DOI: 10.1007/978-3-319-27520-8_3.
You can read the full text:
Contributors
The following have contributed to this page







