This document discusses a project aimed at developing a classifier for detecting benign and malicious websites using MapReduce to improve processing speed compared to traditional methods. It reviews existing approaches for malware classification, evaluates various machine learning algorithms, and highlights challenges such as dealing with imbalanced datasets and high cardinality categorical features. Results show that Random Forest and Decision Tree algorithms achieve high accuracy, while the study emphasizes the need for further experimentation with larger datasets to better assess time efficiency.