The document describes a proposed model for micro-content recognition in lip reading using deep learning. The model takes micro-contents (the English alphabet) as input from video and recognizes them using a convolutional neural network (CNN). The CNN performs feature extraction and recognition. The model was tested on a dataset containing videos of 11 people pronouncing letters and achieved a high recognition rate of 98%.