The paper discusses a proposed system that utilizes machine learning and map reduce algorithms to predict diseases by analyzing both structured and unstructured medical data, enhancing accuracy to 94.8%. The solution addresses the limitations of current methods by effectively reconstructing incomplete data and providing specific predictions for disease subtypes. The implementation shows improved efficiency and reduced query retrieval times, making it accessible and practical for healthcare applications.