Jsoup is a Java library that allows users to parse HTML and extract and manipulate data from documents. It can be used to scrape and parse HTML from URLs, files, or strings. Jsoup provides methods to navigate documents using DOM traversal or CSS selectors, modify HTML elements and attributes, clean user-submitted content to prevent XSS attacks, and output tidy HTML. Documents can be parsed from URLs, strings, or files and then data can be extracted and elements can be modified using DOM methods or CSS selectors.