Posted by on
Tags:
Categories: Apache Hadoop MapReduce Yarn

Find out what #YARN is. Yet Another Resource Negotiator, or YARN for short, is an #Apache #Hadoop technology. The technology is designed for cluster management and is one of the key features in the second generation of Hadoop, the Apache Software Foundation’s open source distributed processing framework. YARN is described as a large-scale, distributed operating system for big data applications. YARN is a software rewrite that is capable of decoupling #MapReduce ’s resource management and scheduling capabilities from the data processing component. The benefit of this is that it enables Hadoop to support more varied processing approaches and a broader array of applications, such as Hadoop clusters now being able to run interactive querying and streaming data applications along with MapReduce batch jobs. Combining a central resource management with node manager agents that monitor the processing operations of individual cluster nodes has helped to increase the appeal of YARN and Hadoop. The separation of HDFS from MapReduce with YARN has made Hadoop more suitable for operational applications that can’t wait for batch jobs to finish.

http://www.cbronline.com/news/big-data/software/what-is-yarn-4963516

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.