Extension:Collection/OpenDocument Export

Overview
The planned extension shall make it possible to export single pages or Collections in OpenDocument Text format (.odt).

See this press release Wikis Go Printable for background information on this project.

The desired output format of the article collections can be chosen on a special MediaWiki page which provides download links for the various formats.

Current Status
This is work in progress. Initial code is available as part of the mwlib python MediaWiki library. (see odwriter.py). Feel free to contribute.

There is a google group for support and discussion of mwlib and derived applications.

See this page for installation instructions.

Help needed!
While the basic export works, documents still look poorly layouted. People with some expertise in odf-styles are very welcome to contribute.

Generating ODF from the DOM
The DOM is generated by a python library (mwlib) and could be translated to ODF using the ODFPY library.

This option was chosen for the initial code development

xhtml -> docbook -> odf
One possible option to convert wiki syntax to ODF is docbook2odf (GPL), a toolkit that automatically converts DocBook to OASIS OpenDocument (ODF, the ISO standardized format used for texts, spreadsheets and presentations). Conversion is based on a XSLTransformation which makes it easy to convert DocBook to ODT documents which are XML based.

xhtml2docbook conversion tools:
 * Html2DocBook by JeffBeal
 * XSLT from xmlblaster.org (GPL)
 * html2dbk (Artistic License)

xhtml2odf
The ODFPY-README states: Alex Hudson has been contracted to produce a command-line html2odf converter. It should include support for images, tables, CSS, etc. He will provide a C# version first, and later a C version.

Related

 * Extension:PDF_Writer
 * Extension:XML_Bridge
 * Extension:Collection