This document discusses the marketing funnel abstraction and workflows for processing large-scale clickstream data. It describes using a marketing funnel model to analyze customer behavior and calculate metrics like cost per acquisition. The document outlines some of the complexities in working with real-world clickstream data at large scales. It then provides a historical example of building a Hadoop application in 2008 to process billions of events for an online advertising company. This highlighted needs for improved workflow abstractions. The Cascading open source project is introduced as addressing some of these needs.