Extension:DumpHTML

dumpHTML is an extension for generating a simple HTML dump, including images and media files, of a MediaWiki installation. MediaWiki versions before 1.12.0 used the maintenance script dumpHTML.php instead.

Parameters
dumpHTML does not function like a normal extension; you must run it from the command line.

Example to create a complete snapshot including image and media files and image thumbnail files in directory wikidump (LINUX) /usr/bin/php /srv/www/mediawiki/extension/DumpHTML/dumpHTML.php -d /srv/www/mediawiki/wikidump -k monobook --image-snapshot --force-copy

Problems
If you intend to use the wikidump on a CD/DVD-ROM or on a Windows filesystem, and if your wiki pages or files had non-ASCII characters, which is likely, then you probably need to change the link references, the directories and filenames from UTF-8 (on LINUX) to the character encoding on your Windows, for example to codepage 1252 for Western-European systems. A useful LINUX tool for renaming directories and filenames is convmv hosted on http://freshmeat.net/projects/convmv/. Be aware, that even after converting the filenames browsers like Internet Explorer 6.x and 7.x still have problems to access pages with non-ASCII characters. Firefox 2.x does work. As a final solution for interoperability and a maximum of compatibility on different filesystems, DumpHTML needs to be changed to create wiki snapshots with ISO-9660 filesystem compliant links or with MD5 hashed links and filenames which only use ASCII characters.

The following command changes subdirectories and filenames from UTF-8 to CP1252: convmv -f utf-8 -t cp1252 --notest -r /srv/www/mediawiki/wikidump

Modified version of DumpHTML

 * Bugzilla 8147 "Filenames in the HTML static dump" had proposed to convert article and picture/media filenames to their MD5-hashed version to enable snapshots to be written to CD/DVDs without filename character encoding problems. Such a modified version of DumpHTML will be posted soon. --Wikinaut 22:27, 28 August 2008 (UTC)

Skin hacking
If you modified your skin (e.g. monobook) then this script will likely fail. Upgrade/update your mediawiki installation and replace any "hacked" skins, then re-try.