This document summarizes Lightning-Fast Cluster Computing with Spark and Shark, a presentation about the Spark and Shark frameworks. Spark is an open-source cluster computing system that aims to provide fast, fault-tolerant processing of large datasets. It uses resilient distributed datasets (RDDs) and supports diverse workloads with sub-second latency. Shark is a system built on Spark that exposes the HiveQL query language and compiles queries down to Spark programs for faster, interactive analysis of large datasets.