Extension:DumpHTML
From MediaWiki.org
| Languages: |
English |
|
dumpHTML Release status: unknown |
|
|---|---|
| Implementation | Data extraction |
| Description | Creates a simple HTML-dump of a MediaWiki installation. |
| Download | http://svn.wikimedia.org/viewvc/mediawiki/trunk/extensions/DumpHTML/ |
dumpHTML.php was a script that allows to make a simple HTML-dump of a MediaWiki installation.
You'll need a console-based command like /usr/bin/php for Linux. You can't execute the file with your browser, because you aren't able to use parameters.
In MediaWiki 1.11.x images (and therefore also equations) were broken. See bugzilla:12122 and bugzilla:13061 for details. 1.11.x was the last branch to have dumpHTML.php as a maintenance script. For upcoming releases the functionality was moved to an extension, http://svn.wikimedia.org/viewvc/mediawiki/trunk/extensions/DumpHTML/.
[edit] Parameters
| Option/Parameter | Description |
|---|---|
| -d <dest> | destination directory |
| -s <start> | start ID |
| -e <end> | end ID |
| -k <skin> | skin to use (defaults to htmldump) |
| --no-overwrite | skip existing HTML files |
| --checkpoint <file> | use a checkpoint file to allow restarting of interrupted dumps |
| --slice <n/m> | split the job into m segments and do the n'th one |
| --images | only do image description pages |
| --shared-desc | only do shared (commons) image description pages |
| --no-shared-desc | don't do shared image description pages |
| --categories | only do category pages |
| --redirects | only do redirects |
| --special | only do miscellaneous stuff |
| --force-copy | copy commons instead of symlink, needed for Wikimedia |
| --interlang | allow interlanguage links |
| --image-snapshot | copy all images used to the destination directory |
| --compress | generate compressed version of the html pages |
| --udp-profile <N> | profile 1/N rendering operations using ProfilerSimpleUDP |
For example (Linux):
/usr/bin/php /opt/mediawiki/maintenance/dumpHTML.php -d C:\wikidump -k monobook

