The document is an introduction to big data and Hadoop that discusses:
1) What big data is and common use cases across different industries.
2) The characteristics of big data according to IBM.
3) An overview of the Hadoop ecosystem including HDFS, MapReduce, YARN and other related frameworks.
4) How Hadoop allows for distributed processing of large datasets across clusters of machines more efficiently than traditional systems.