This document discusses the implementation of a correlated topic model (CTM) using a variational expectation-maximization algorithm in a MapReduce framework to address scalability issues in topic extraction from large document collections. The study evaluates the performance of the proposed approach against the widely used latent Dirichlet allocation (LDA) model, demonstrating comparable topic coherence. Key contributions include a scalable methodology for topic extraction and analysis of crawled documents from a public digital library.