Extension:DumpHTML

From MediaWiki.org

Jump to: navigation, search
Manual on MediaWiki Extensions
List of MediaWiki Extensions
dumpHTML

Release status: unknown

Implementation Data extraction
Description Creates a simple HTML-dump of a MediaWiki installation.
Download http://svn.wikimedia.org/viewvc/mediawiki/trunk/extensions/DumpHTML/

dumpHTML.php was a script that allows to make a simple HTML-dump of a MediaWiki installation.
You'll need a console-based command like /usr/bin/php for Linux. You can't execute the file with your browser, because you aren't able to use parameters.

In MediaWiki 1.11.x images (and therefore also equations) were broken. See bugzilla:12122 and bugzilla:13061 for details. 1.11.x was the last branch to have dumpHTML.php as a maintenance script. For upcoming releases the functionality was moved to an extension, http://svn.wikimedia.org/viewvc/mediawiki/trunk/extensions/DumpHTML/.

[edit] Parameters

Option/Parameter Description
-d <dest> destination directory
-s <start> start ID
-e <end> end ID
-k <skin> skin to use (defaults to htmldump)
--no-overwrite skip existing HTML files
--checkpoint <file> use a checkpoint file to allow restarting of interrupted dumps
--slice <n/m> split the job into m segments and do the n'th one
--images only do image description pages
--shared-desc only do shared (commons) image description pages
--no-shared-desc don't do shared image description pages
--categories only do category pages
--redirects only do redirects
--special only do miscellaneous stuff
--force-copy copy commons instead of symlink, needed for Wikimedia
--interlang allow interlanguage links
--image-snapshot copy all images used to the destination directory
--compress generate compressed version of the html pages
--udp-profile <N> profile 1/N rendering operations using ProfilerSimpleUDP

For example (Linux):

/usr/bin/php /opt/mediawiki/maintenance/dumpHTML.php -d C:\wikidump -k monobook
Personal tools