This document summarizes a research paper that analyzed the co-occurrence problem using MapReduce on AWS datasets. It compared the pairs and stripes approaches for calculating a co-occurrence matrix on different sized datasets and cluster node configurations. The stripes approach had significantly better performance, taking half the time of the pairs approach. Further optimizations like using a combiner and pre-processing the data were suggested to improve efficiency even more.