The document discusses the evolution of compute APIs from early graphics APIs like CUDA and CTM to current standards like OpenCL and DirectCompute. It summarizes the key aspects of the 1st generation APIs, including their execution model based on graphics processing and caveats identified by developers. The document proposes that the 2nd generation of APIs will be better suited to current hardware designed for compute by adopting a task-based execution model that maps more directly to multi-threaded CPU and GPU architectures.