Modern Big Data applications need to process data in real time to reveal patterns, trends, and associations. The Apache Spark Connector for Riak moves data from Riak to Spark for in-memory analysis, plus the results can be stored back in Riak for future data processing.
Why Apache Spark and Riak?
Apache Spark is an analytics framework for Big Data. Riak is built to store Big Data in a distributed NoSQL database that is designed for massive scalability, high availability, and ease of operations. Apache Spark integrated with Riak provides the real-time analytics of Spark with the availability and scalability of Riak. This makes real-time analytics of unstructured data possible. Until Spark came along, no single processing framework could handle the load, required by a distributed system.
Apache Spark Integration Resources: