User:OrenBochman/bots

Some bot Ideas

Rule Based Bots

 * 1) Phonologist. Use TTS code from Mbrola etc to to add IPA, Sampa, MBrola phonetical data in registered languages.
 * 2) IPA to Sampa etc. conversion.
 * 3) QA and confidence tests on against existing IPA.
 * 4) Compound word mode processing.
 * 5) String matching algorithm to map text n-grams to IPA ngrams (space,phon,phon,phone).
 * 6) production rule extraction from above (as per paper).

Mine Feedback loop
#Get all he.wiktionary entries and add them to en.wiktionary + othographt
 * 1) Mine for data in wikis
 * 1) Edit terms and store it there.

Template Labeler & Checker

 * 1) Add ID or MD5 HASH to mark template boundaries.
 * 2) Detect and Mark with categorized template mistakes.
 * 3) e.g. orphan tags/bad tidy code.