This document summarizes an approach for image-based information retrieval using deep learning and clustering techniques. It begins by discussing how current search engines rely on text-based methods that cannot fully capture image content. The proposed approach uses deep learning to extract visual features from images and hierarchical clustering to organize similar images. Images are initially retrieved based on text queries, then re-ranked based on visual relevance scores to return only images truly relevant to the user's query. The approach was found to reduce the semantic gap between low-level image features and high-level semantics compared to traditional text-based search.