Hadoop is one of the de facto standards in the big data world, but as with Linux there are multiple distributions.
MapR has announced the latest version of its Hadoop distribution, which includes Hadoop 2.2 with YARN.
Company officials described YARN as a next-generation resource manager, allowing Hadoop applications to share a cluster's computer resources for increased efficiency and utilisation.
|
The MapReduce 1.x and YARN schedulers can be run on the same nodes simultaneously, simplifying the transition between schedulers.
The MapR distribution also includes Cascading, Flume, HBase, Hive, Hue, Impala, Multitool, Oozie, Pig, Solr, Sqoop, and ZooKeeper.
"As YARN expands Hadoop use cases in the enterprise, the need for enterprise-grade dependability, interoperability and performance increases exponentially," said vice president of product management Tomer Shiran.
"The combination of YARN and the MapR Data Platform delivers the only distribution for Hadoop in which both YARN and non-YARN distributed Big Data applications share the compute and storage resources of large-scale clusters."
The MapR Distribution including Apache Hadoop YARN will be available in March.