Export Enhancements

From MediaWiki.org
Jump to: navigation, search

Placeholder for notes on Export Enhancements.

Contents

[edit] Extend the Extension:Collection to provide an API

Basic Use Case: Be able to use the collection extension from the cmd line.


[edit] Interface (list of function from Collection.php)

  • CollectionAddArticle
  • CollectionRemoveArticle
  • CollectionAddCategory
  • CollectionAddChapter
  • CollectionRenameChapter
  • CollectionSetTitles
  • CollectionSetSorting
  • CollectionClear
  • CollectionSuggestAction (ban, add, remove, undo)
  • CollectionGetBookCreatorBoxContent
  • CollectionGetItemList
  • GetMWServeStatus

irrelevant functions (i.e. interactive mode only):

  • CollectionGetPopupData

[edit] Add Collection extension support to dumpHTML, and dumpBackup

Use case: Allow for easier export of portions of a wiki.

[edit] Tasks

  • Add --collection= flag to each script

[edit] Add dumpHTML and dumpBackup support for Collection

Use case: Allow users to export HTML and XML using the Collection interface.

[edit] Tasks

  • Remove the article count limit for dumpHTML/dumpBackup formats
  • Add support for adding a category, without expanding the category's article list

[edit] Provide parsed & semantically annotated XML of article text (dumps & API)

Use case: Make it easier to reuse the data. This was the #1 recommendation of the Wikimedia Strategy Offline Task Force.


[edit] How to get there

  • Build a list of missing semantic information in the current XHTML output
  • Subclass the parser and add
    • semantic information where it is currently missing
    • offer control over how templates are expanded
    • add the new functionality to the API
Personal tools
Namespaces

Variants
Actions
Navigation
Support
Download
Development
Communication
Print/export
Toolbox