This document discusses using Alluxio with Spark to improve performance when working with big data. It provides an overview of Alluxio and how it can be used to accelerate Spark jobs by consolidating memory, providing data resilience, and enabling data access from different storage systems at memory speed. Performance tests show that Alluxio provides 2-17x speedups over Spark alone for reading RDDs and DataFrames from remote storage like S3, by caching the data in memory.