Extension talk:DumpHTML

From MediaWiki.org
Jump to: navigation, search
An archive box Archives 

/Archive 1

Start a new discussion

Contents

Thread titleRepliesLast modified
How to export Monobook Skin to the HTMLdump?113:13, 10 April 2012
Broken since MediaWiki 1.17.0?115:39, 3 February 2012
Does it support ForeignFileRepos?008:47, 20 January 2012
Unicode diacritic character on dumped html010:37, 10 January 2012
Bug with german umlauts in filenames of images010:37, 10 January 2012
Error110:36, 10 January 2012
Howto provide login data110:36, 10 January 2012
Minor: pages vrs page titles010:34, 10 January 2012
PostgreSQL and Mediawiki 1.12.0210:34, 10 January 2012

How to export Monobook Skin to the HTMLdump?

When I export with the HTMLdump extension it always uses the Offline Skin. And its realy ugly. How can I make my Export to the Monobook Skin? Thank you.

—The preceding unsigned comment was added by an unknown user on a unknown date.10:35, 10 January 2012

The tool indeed seemingly ignores the -k switch. Does anyone know when it broke?

134.130.21.8513:13, 10 April 2012
 

Broken since MediaWiki 1.17.0?

It appears that starting with MediaWiki 1.17.0, dumpHTML no longer downloads relevant CSS/JavaScript bits. This appears to be a result of the ResourceLoader module, via load.php. This module and ordeal is documented in the MediaWiki 1.17.0 release notes.

Even tools like "wget -r" won't solve this problem. So given this change, what exactly are people doing for creating HTML archives of Wiki pages for offline or Intranet use?

—The preceding unsigned comment was added by an unknown user on a unknown date. -- 23:00, 9 January 2012 (UTC)10:38, 10 January 2012

That would be very interesting for me too!!

62.214.112.3515:39, 3 February 2012
 

Does it support ForeignFileRepos?

Hi. I'm storing my images on commons. I'm trying to run this extension, but it stops after a few pages and I cannot figure out why. Maybe not finding the images on the local filesystem makes it crash?

88.187.236.23208:47, 20 January 2012

Unicode diacritic character on dumped html

My chars, mostly diacritic chars in dumped htmls seem changed.

Examples:

  • Saṃyutta Nikāya -> Sa峁儁utta Nik膩ya
  • … -> 鈥�
  • Soṇadaṇḍa Sutta -> So峁嘺da峁囜笉a

Does anyone have similar issue?

Thanks.

Benzwu10:37, 10 January 2012

Bug with german umlauts in filenames of images

When there is an umlaut in the filename of an image, the image will be saved in the dump but with a wrong name - the link in the HTML is not working. Does somebody know how to fix this?

--212.114.205.190 12:51, 29 September 2011 (UTC)10:37, 10 January 2012

When i try to run the script, i always get the error message: default users are not allowed to read, please specify (--group=sysop). I also tried it with this option, but then i become the error message "the specified user group is not allowed to read". Any ideas? :)

—The preceding unsigned comment was added by an unknown user on a unknown date.10:36, 10 January 2012

The group "user" works for me

--212.114.205.190 11:49, 29 September 2011 (UTC)10:36, 10 January 2012
 

Howto provide login data

I have a MW V1.16.0 with the Lockdown-extension installed and need a username and password to look at it. It is a Windows system so I used the modified Version of dumphtml that produces the hash-filenames. How can i provide username and password with DumpHTML? First time using, DumpHTML asked me providing a -group parameter and i did. DumpHTML then produced a lot of stuff. But i cant login to the index site in the static wiki. Furthermore some (many,most) pages and their pathes are missing in the static wiki. The page "login required" (Anmeldung erforderlich in German) exist multiple, multiple times. Everytime a page is shown it is this one, each on another path and filename. I tried to give 'read' permission to * in localsettings.php, but then the extension produces a static wiki without any style, pictures...and even many pages and paths linked to do not exist.

—The preceding unsigned comment was added by an unknown user on a unknown date.10:35, 10 January 2012

get the version for MW 1.16 - there is a --group parameter. use the group "user" (sysop doesn't work for me)

--212.114.205.190 11:48, 29 September 2011 (UTC)10:36, 10 January 2012
 

Minor: pages vrs page titles

"If you intend to use the wikidump on a CD/DVD or on a Windows filesystem, and if your wiki pages or files had non-ASCII characters, which is likely, then you probably need to change the link references,"

Should this be "if your wiki page titles or filenames had non-ASCII characters"? If correct, it would be much clearer.

—The preceding unsigned comment was added by an unknown user on a unknown date.10:34, 10 January 2012

PostgreSQL and Mediawiki 1.12.0

If the error is

DB connection error: No database connection

It may be a problem logging into the database. Looking in the PostgreSQL logs revealed I had to adapt pg_hba.conf

--Albert25 11:45, 18 January 2011 (UTC)10:33, 10 January 2012

This is the error message I get when I try to execute the dumpHTML.php file on my local machine. Does anybody know a fix for that?

—The preceding unsigned comment was added by an unknown user on a unknown date.10:34, 10 January 2012
 

I have a similar problem which I cant solve:

DB connection error: Ein Verbindungsversuch ist fehlgeschlagen, da die Gegenstel
le nach einer bestimmten Zeitspanne nicht richtig reagiert hat, oder die hergest
ellte Verbindung war fehlerhaft, da der verbundene Host nicht reagiert hat.
 (localhost)
Mediawiki 13.3, Win7, Mowes webserver, just installed php5.3 without webserver support to execute the dumpHTML.php
I can trace the problem to occur in includes/db/Loadbalancer.php function reallyOpenConnection, $db = new $class( $host, $user, $password, $dbname, 1, $flags ); just times out
database class is DatabaseMysql (I cant find the source), host, database name, user and password are correct
any help?
—The preceding unsigned comment was added by an unknown user on a unknown date.10:34, 10 January 2012
 
Personal tools
Namespaces

Variants
Actions
Navigation
Support
Download
Development
Communication
Print/export
Toolbox