This document discusses how a media platform scaled their use of Spark across AWS to process terabytes of data daily. They moved from two on-premise clusters to running analytics and streaming workloads on AWS while keeping their core workload on-premise, initially using Spark on EMR but then self-managing Spark on EC2 for more flexibility. They implemented auto-scaling of the AWS clusters to maintain utilization targets and handle fluctuating workload demands.