Posted by on
Categories: Apache Hadoop MapReduce Spark

#Apache #Spark 2.0 has been released with updated SQL support, structured streaming and better performance. Apache Spark is an open source data processing engine that has become very popular since its initial release. It improves on #Hadoop #MapReduce performance, running programs up to 100 times faster in memory and ten times faster on disk, according to Apache. The graph below shows logistic regression in Hadoop and Spark (according to Apache).

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.