Cloudera Spark Training
Cloudera Spark Training
Hands-On Hadoop
Through instructor-led discussion and interactive, hands-on exercises, participants will
navigate the Hadoop ecosystem, learning topics such as:
Using the Spark shell for interactive data analysis
The features of Sparks Resilient Distributed Datasets
How Spark runs on a cluster
Parallel programming with Spark
Writing Spark applications
TRAINING SHEET
Why Spark?
Problems with Traditional Large-Scale
Systems
Introducing Spark
Spark Basics
RDD Lineage
Caching Overview
Distributed Persistence
Spark Streaming
RDD Operations
Logging
HDFS Architecture
Overview
Using HDFS
Conclusion
Overview
A Spark Standalone Cluster
The Spark Standalone Web UI
cloudera.com
1-888-789-1488 or 1-650-362-0488
Cloudera, Inc., 1001 Page Mill Road, Palo Alto, CA 94304, USA
2015 Cloudera, Inc. All rights reserved. Cloudera and the Cloudera logo are trademarks or registered trademarks of Cloudera Inc. in the USA
and other countries. All other trademarks are the property of their respective companies. Information is subject to change without notice.
cloudera-training-sheet-spark-103