| |||||
Hadoop is a software system that allows one particular quickly compose and operate apps that approach large quantities of info.
Hadoop implements MapReduce, working with the Hadoop Dispersed File Technique (HDFS). MapReduce will divide your purposes into many smaller blocks of perform.
HDFS generates a number of replicas of facts blocks for reliability, positioning them on compute nodes around the cluster. MapReduce can then procedure the knowledge the place it is positioned.
Hadoop has been demonstrated on clusters with 2000 nodes. The existing design and style goal is ten,000 node clusters.
Right here are some crucial characteristics of "Hadoop":
· Scalable: Hadoop can reliably retailer and course of action petabytes.
· Cost-effective : It distributes the facts and processing across clusters of generally readily available personal computers. These clusters can variety into the thousands of nodes.
· Productive : By distributing the information, Hadoop can method it in parallel on the nodes the place the facts is found. This makes it particularly rapid.
· Trustworthy : Hadoop instantly maintains multiple copies of information and routinely redeploys computing jobs primarily based on failures.
What is New in This Launch : [ read through complete changelog ]
· This launch delivers a variety of vital bug-fixes for hadoop- two.x uncovered through integration tests of prior launch.
Via: Hadoop 1.2.0 / 2.0.5 Alpha
No comments:
Post a Comment