User:OrenBochman/bots

Some bot Ideas

Rule Based Bots

 * 1) Phonologist. Use TTS code from Mbrola etc to to add IPA, Sampa, MBrola phonetical data in registered languages.
 * 2) IPA to Sampa etc. conversion.
 * 3) QA and confidence tests on against existing IPA.
 * 4) Compound word mode processing.
 * 5) String matching algorithm to map text n-grams to IPA ngrams (space,phon,phon,phone).
 * 6) production rule extraction from above (as per paper).

Template Labeler & Checker

 * 1) Add ID or MD5 HASH to mark template boundaries.
 * 2) Detect and Mark with categorized template mistakes.
 * 3) e.g. orphan tags/bad tidy code.