This document provides an introduction to Apache Spark and Zeppelin. It describes Spark as an open source cluster computing framework, and its APIs for Scala, Java, Python and R. Key Spark components are outlined like Spark Core, Spark SQL, MLlib and GraphX. RDDs are defined as Spark's primary abstraction, and DataFrames/Datasets are presented as higher-level APIs built on RDDs. The benefits of Spark SQL for structured data are highlighted. Examples demonstrate basic Spark and SQL usage. Finally, Apache Zeppelin and the Hortonworks sandbox are introduced as tools for interactive data analytics on Spark and Hadoop clusters.