Topic on Talk:Parsoid

How to convert Wikitionary dump to html?

2
Summary by Arlolra

There hasn't been any effort to make Parsoid usable with those dumps.

DungLe94 (talkcontribs)

I've just found from this link that Parsoid which is a perfect tool to convert Wikitionary dump to html. I've downloaded the latest dump from here. However, I could not find any instruction to use Parsoid on this offline dump. Could you please elaborate on this issue?


Thank you so much for your help!

Dung Le.

Arlolra (talkcontribs)

There hasn't been any effort to make Parsoid usable with those dumps.

There are often questions about how to use Parsoid offline. See past discussions, https://lists.wikimedia.org/pipermail/wikitext-l/2020-February/000994.html https://lists.wikimedia.org/pipermail/wikitext-l/2020-April/000999.html https://www.mediawiki.org/wiki/Topic:Uko9gbijtxv2nh19

But so far Parsoid is mostly useful when it has access to a MediaWiki API to fetch configuration and resolve templates.

If you wanted to use the source from the dump, you could do something like cat "text from source" | php bin/parse.php --domain fr.wiktionary.org --wt2html and that will output some html. Alternatively, you can use the titles from the dump and fetch the html from the REST API, https://fr.wiktionary.org/api/rest_v1/page/html/bonjour