This document provides an introduction to Apache Spark presented by Maxime Dumas of Cloudera. It discusses: 1. What Cloudera does including distributing Hadoop components with enterprise tooling and support. 2. An overview of the Apache Hadoop ecosystem including why Hadoop is used for scalability, efficiency, and flexibility with large amounts of data. 3. An introduction to Apache Spark which improves on MapReduce by being faster, easier to use, and supporting more types of applications such as machine learning and graph processing. Spark can be 100x faster than MapReduce for certain applications.