This document provides an introduction and overview of Hadoop. It discusses the brief history of Hadoop, including its origins from Google papers in 2005 and promotion by Yahoo since 2006. It then discusses why Hadoop is useful for big data applications that are petabyte in scale, scalable, robust, and secure. Specific use cases like analytics, reporting, filtering and machine learning on log files, user behavior data, and other structured or unstructured data sources are covered. Finally, it outlines the Hadoop ecosystem and tools like native Java APIs, Pig, Hive, and streaming options for other languages.