HADOOP COMPONENTS & FAULT TOLERANCE

Shyamala S; Pushparani M

doi:10.5281/ijrset.v2i9.136

PDF

Published Oct 30, 2018

DOI https://doi.org/10.5281/ijrset.v2i9.136

Shyamala S

MPhil Scholar, Department of Computer Science, Mother Teresa Women’s University, Kodaikanal, India.

Pushparani M

Professor &Head, Department of Computer Science, Mother Teresa Women’s University, Kodaikanal, India.

Abstract

In this modern trend of information technology and computer science storing and processing a data is very important. Nowadays even a terabytes and petabytes of data is not enough for storing large amount of database. Hadoop is designed to store large chunks of data sets reliably. Hadoop is an open source software which supports parallel and distributed data processing. It is highly scalable compute platform. Hadoop enables users to store and process large amount of data which is not possible while using less scalable techniques. Hadoop also provide faults tolerance mechanism by which system continues to function correctly even after some components fail’s working properly. Faults tolerance is mainly achieved using data duplication and making copies of same data sets in two or more data nodes. In this paper we describe the major components of Hadoop along with how fault tolerance is achieved by means of data duplication, check point and recovery

Issue

Vol 2 No 9 (2015): Volume 2 Issue 9

Section

Articles

HADOOP COMPONENTS & FAULT TOLERANCE

##plugins.themes.bootstrap3.article.sidebar##

##plugins.themes.bootstrap3.article.main##

Abstract

##plugins.themes.bootstrap3.article.details##