The document discusses scaling service composition engines to leverage multicore architectures. It proposes a topology-aware deployment approach that replicates the engine architecture across cores instead of just increasing threads. Each replica's threads would be bound to specific affinity groups, and resources like memory and threads distributed proportionally among replicas based on hardware resources and number of replicas. An example shows binding two engine instances to separate sets of cores instead of letting all threads span all cores. This improves scalability over a single instance approach.