This document discusses text document clustering techniques. It provides an overview of clustering algorithms like k-means, k-means++, and k-medoids and compares their performance on text document clustering. It also discusses common clustering validation metrics like purity, F-measure. The paper proposes an improved method for text document clustering that involves preprocessing the dataset to reduce noise, applying clustering algorithms while evaluating with different indexes, and using distance measures like Euclidean distance to improve cluster quality. The proposed method aims to improve the accuracy and efficiency of k-medoids clustering for large datasets.