User:OrenBochman

=My Info=
 * Name: Oren Bochman
 * Main Project title: "Wikipedia Search"
 * Contact information:
 * IRC: OrenBochman
 * GTalk/GMail: OrenBochman[at]gmail.com
 * Skype Id: OrenBochman
 * my facebook
 * my page on Wiktionary

=SearchNG ToDo List=
 * 1) Get Help with a mediawiki-labs operator to set up a dev/build/test/bechmark lab to test code fixes.
 * 2) The test and code are both OS dependent and the critical (undocumented) features are based on the cluster.
 * 3) Complete the project Risks Assessment
 * 4) Complete the project Test Plan
 * 5) Track project Resouces
 * 6) Do A Project Work Plan
 * 7) Review and Fix the Lucene Search bugs, c.f. Bug Smashing
 * 8) Update Search
 * 9) docs
 * 10) Upgrade to Version 2.9.4 of Lucene.
 * 11) * of 89 compilation errors fixed.
 * 12) * of 943 (777 old and 166 new) warnings unfixed.
 * 13) * of 27 unit test broken.
 * 14) * Code base reduced by two classes.
 * 15) Upgrade to Version 3.5.0 of Lucene
 * 16) Integrate with SOLR
 * 17) Automatic Language Detection
 * 18) Task: automate generating tika profiles for all wiki languages.
 * 19) Search Q&A
 * 20) How to get all the units tests to pass?


 * 1) Links:
 * 2) Labs Page
 * 3) Search extension
 * 4) Search Integration extension.
 * 5) XML Search extension.

=Projects=

Useful Things

 * Bugzilla
 * Subversion
 * Code Review
 * Wikilabs
 * Main Page
 * Security Groups
 * Console Output]


 * Dev
 * Antr - writing grammar based analysers
 * Maven - dependency managment
 * Jenkins - CI
 * Search
 * Lucene
 * SOLR
 * Open Relevence - relevence testing
 * carrot2 clustring
 * R - statistical analysis
 * Tika - language detection, HTML/XML analyzer


 * Linguistics
 * Translate Wiki
 * Cross Language
 * Apertium

My Parser NG pages
Parsing improvements

Social Wiki Ideas
How to make a wiki more social

/WikiJournal/
An insidious master plan to make academics love Wikipedia and change the world of academic publishing in the bargain.

Mediawiki.Org Netiquete

 * to delete a page use:

IRC

 * A list of freenode network has some channels of interest at some time.
 * mediawiki for media wiki discussions
 * wikimedia-tech for server/cluster related discussions (good place to report down time
 * wiktionary for wiktionary issues
 * Openzim for the static dump format
 * Kiwix for the software that reads Openzim
 * Lucene for the search librar
 * Solr for the search engine.
 * Semantic Media Wiki for the search engine.


 * other networks - irc.oftc.net
 * for the parser generator.


 * Initially I used a web access. After some time I decided to use IRC client software.
 * I chose chatzilla. It is a fire fox extension.
 * http://chatzilla.hacksrus.com/faq/