The document discusses big data and Hadoop. It provides an introduction to Apache Hadoop, explaining that it is open source software that combines massively parallel computing and highly scalable distributed storage. It discusses how Hadoop can help businesses become more data-driven by enabling new business models and insights. Related projects like Hive, Pig, HBase, ZooKeeper and Oozie are also introduced.