Monday, 19 June 2017

Characteristics​ of Hadoop

Characteristics of Hadoop:

1. Robust: 
Handles hardware failure as data is stored in multiples node. 

2. Scalable:
Cluster size can be increased by adding more node.

3. Simple:
We can write parallel code. Focuses on moving code rather than data.

4. Portable:
Analyze structured,semi-structured and unstructured data. (Structured- In table format, Semi-structure- Not in table format but in well organised format(XML,JSON), Unstructured- Has no format(Text, Image,Videos).

5. Cost Effective:
Hadoop is open source and uses commodity hardware to store data so it is really cost effective as compared to traditional RDBMS.

6. Fault Tolerance:
If any node fails, the task are automatically redirected to other node. Multiple copies of all data are automatically stored. Due to this even if one node fails, same data is available on some other node also.

No comments:

Post a Comment