User:OrenBochman/Installation

I have been trying to do some development on media wiki search. It turns out that setting up an environment is not so simple. I decided to document the process in case I need to do it again and for other peoples benefit.

=Notes On Importing Wiktionary=

App Stack Installation

 * 1) installed latest Xampp in D:\xampp\

Problem: Apache won't start
|the dbase (local link)
 * 1) Skype blocked port 80 (resolved)
 * 2) had to change security settings which blocked control panel from starting Apache (resolved)
 * 3) xampp now works - though Zend accelerator is not enabled.

Media Wiki installation

 * 1) decompressed MediaWiki software archive version 1.17 to D:\xampp\htdocs\mediawiki
 * 2) browsed to http://localhost/mediawiki/ and followed instructions
 * 3) however it is necessary to use binary representation and not UTF=8 in db otherwise mwdumper will fail

changeing settings

 * 1) editing LocalSettings.php and adding at the end and then $wgCapitalLinks=false;
 * 2) running cleanUpCaps.php from the command line (this took about 12 hours for 3million entries)

Extensions
require( "extensions/ParserFunctions/ParserFunctions.php"); $wgUseAjax = true; require_once("{$IP}/extensions/CategoryTree/CategoryTree.php"); require_once("$IP/extensions/CharInsert/CharInsert.php"); require_once("$IP/extensions/ImageMap/ImageMap.php"); require_once ("$IP/extensions/LabeledSectionTransclusion/lsth.php"); require_once ("$IP/extensions/SyntaxHighlight_GeSHi/SyntaxHighlight_GeSHi.php");
 * 1) Since My main purpose is Development related (bots and indexing) I wanted to introduce sufficent extetions to allow decent dumping
 * 2) consult ...
 * 3) download
 * 4) decompress to
 * 5) edit LocalSettings.php adding require( "extensions/ParserFunctions/ParserFunctions.php"); to the end
 * 1) char insert
 * 1) image map
 * 1) require_once Labeled Section Transclusion
 * 1) syntax heighlightng
 * 1) oren code end

Importing Contents

 * 1) importing via ... will take forever and has problems with the term Wiktionary in the inter-wiki namespace table
 * 2) importing via mwdumper with some db modification took about 12 hours.
 * 3) unfortunately it crashed every time requiring one to drop the db and reinstall.
 * 4) to speed things up I removed the indexes from the Text Page and Revisions table and later re introduced them

Fixing Capitalization
add to LocalSettings.php:

$wgCapitalLinks=false;

next run maintenece/cleanupCaps.php (12 hours)

Crash
run rebuild:

examining the file does not shed any more light on the problem.

it appears the database is over loaded. reduced packet size to 1 mb and restarted however after a few minutes time out started again.

Diagnostics

 * 1) Then the wiki decided to crash. It gives the error: "Fatal error: Maximum execution time of 30 seconds exceeded in D:\xampp\htdocs\mediawiki\includes\db\DatabaseMysql.php on line 23"
 * 2) It turns out that the problem is not a crash but a slow response on many pages
 * 3) random link works
 * 4) pages generated by random link work too
 * 5) Switching to a second empty db with differnet table suffix works fine too

Conclusions

 * the db is slow
 * some pages have content which causes a time out


 * 1) further diagnostics:
 * 2) dump these heavy duty pages via a dumper....

IRC

 * I initialy used a web access. After some time I decided to use IRC client software.
 * I chose chatzilla. It is a fire fox extension.
 * http://chatzilla.hacksrus.com/faq/
 * freenode network has some channels
 * mediawiki for media wiki discussions
 * wikimedia-tech for server/cluster related discussions (good place to report down time
 * wiktionary for wiktionary issues
 * Openzim for the static dump format
 * Kiwix for the static dump format