The police collected 4 datasets of emails - 2 from Debbie before and after marriage, 1 from Jamie, and 1 large reference collection. They analyzed the datasets using linguistic analysis of word frequencies, keywords that occur more than expected, and patterns around keywords. Comparing these linguistic features across the datasets could determine if the same person wrote the questioned and known datasets.