This document provides an overview of big data processing tools and NoSQL databases. It discusses how Hadoop uses MapReduce and HDFS to distribute processing across large clusters. Spark is presented as an alternative to Hadoop. The CAP theorem is explained as relating to consistency, availability, and network partitions. Different types of NoSQL databases are described including key-value, column, document and graph databases. Examples are provided for each type.