Corpus search results

From Zodiac Killer Ciphers Wiki
Revision as of 06:15, 23 June 2012 by Admin (talk | contribs)
Jump to: navigation, search

This experiment involved a systematic search for words and phrases shared between Zodiac's correspondences and a large corpus. The content of Zodiac's correspondences were reduced to a stream of alphabet characters with no spacing or punctuation. The corpus was similarly reduced. Then, all possible substrings of each of Zodiac's correspondences were compared to items in the corpus, and matches are organized from largest to smallest. Matches of the same length are organized from most frequently found to least frequently found.

The corpus used for this experiment was the almost 30,000 books from the Project Gutenberg April 2010 DVD.