Topic on Project:Support desk

Page titles of HTML files imported via Html2Wiki

4
Bisherbas (talkcontribs)

By default, file names are the page titles. How can I extract a text from within the HTML file and make this text as the Wiki page title instead

MediaWiki v.1.30. Thanks!

Gryllida (talkcontribs)

Currently the extension does not seem to have such option.

It uses pandoc to convert html to wiki markup. Possibly the page title is simply discarded then.

https://github.com/wikimedia/mediawiki-extensions-Html2Wiki/blob/master/specials/SpecialHtml2Wiki.php#L237 suggests that mArticleTitle will be "the full value of mCollectionName (if any) plus mArticleSavePath plus the file name MINUS any extension."

I don't think the extension even bothers to parse the HTML contents. This may be possible to add to the program.

I would suggest to ask at https://www.mediawiki.org/wiki/Extension_talk:Html2Wiki instead, the extension developers and users may have this page on their watch list.

Gryllida (talkcontribs)

The project also has a phabricator page https://phabricator.wikimedia.org/project/profile/1094/ so if you add a new task there with title 'use html title for the article title' and tag it with the project name then it may be more likely to be received by the relevant people.

Bisherbas (talkcontribs)

@Gryllida Thank you!

Reply to "Page titles of HTML files imported via Html2Wiki"