User:TJones (WMF)/Notes/Stempel Analyzer Analysis/Error Examples

Below are the examples I found of likely stemming errors. See Stempel Analyzer Analysis for more details.

These tables show the stem (provided by the stemmer), the common substrings (as "beginning .. ending", with either possibly being empty), and the words stemmed to the shown stem. The original words are "types", and the number in parens after the type is the count of how many times the word appeared in the corpus (i.e., the number of "tokens").