Extension:DumpHTML

dumpHTML is an extension for generating a simple HTML dump, including images and media files, of a MediaWiki installation. MediaWiki versions before 1.12.0 used the maintenance script dumpHTML.php instead.

Parameters
dumpHTML does not function like a normal extension; you must run it from the command line.

Example to create a complete snapshot including image and media files and image thumbnail files in directory wikidump (LINUX) /usr/bin/php /srv/www/mediawiki/extension/DumpHTML/dumpHTML.php -d /srv/www/mediawiki/wikidump -k monobook --image-snapshot --force-copy

Problems
If you intend to use the wikidump on a CD/DVD-ROM or on a Windows filesystem, and if your wiki pages or files had non-ASCII characters, which is likely, then you probably need to change the link references, the directories and filenames from UTF-8 (on LINUX) to the character encoding on your Windows, for example to codepage 1252 for Western-European systems. A useful LINUX tool for renaming directories and filenames is convmv hosted on http://freshmeat.net/projects/convmv/. Be aware, that even after converting the filenames browsers like Internet Explorer 6.x and 7.x still have problems to access pages with non-ASCII characters. Firefox 2.x does work. As a final solution for interoperability and a maximum of compatibility on different filesystems, DumpHTML needs to be changed to create wiki snapshots with ISO-9660 filesystem compliant links or with MD5 hashed links and filenames which only use ASCII characters.

The following command changes subdirectories and filenames from UTF-8 to CP1252: convm -f utf-8 -t cp1252 --notest -r /srv/www/mediawiki/wikidump