This document proposes a system to automatically summarize videos in text format using natural language processing techniques. It discusses extracting audio from videos, converting audio to text, preprocessing the text, and using an extractive summarization approach like TextRank to generate a summary. The system aims to provide concise video overviews to save viewers' time by allowing them to quickly understand content or check relevance without watching full videos. The extractive summarization approach is used because it is less computationally intensive and easier to implement than abstractive summarization techniques.