Facebook generates large amounts of user data daily from activities like status updates, photo uploads, and shared content. This data is stored in Hadoop using Hive for analytics. Some key facts: - Facebook adds 4TB of new compressed data daily to its Hadoop cluster. - The cluster has 4800 cores and 5.5PB of storage across 12TB nodes. - Hive is used for over 7500 jobs daily and by around 200 engineers/analysts monthly. - Performance improvements to Hive include lazy deserialization, map-side aggregation, and joins.