Extension:Proofread Page

The Proofread Page extension creates a book either:
 * 1) as a column of OCR text beside a column of scanned images, or
 * 2) broken into chapters or poems. The content of a document appears in the mediawiki page. (transclusion).

The extension is intended to allow easy comparison of text to the original digitization.

This extension shows the text in several ways without actually duplicating the original text.

Use
The extension is installed on all 1>s:mul:|Wikisource wikis. For the syntax, see 2>s:mul:Wikisource:ProofreadPage. It is also used on 3>wikilivres:|Bibliowiki.

Requirements and recommendations

 * Access to the command line is required if running the update script (maintenance/update.php) from the web browser fails (2>Upgrade|documentation here and 3>Manual:Update.php|here).
 * If you want to use DjVu files (optional but recommended), a native DjVu handler needs to be available for configuration. See also 1>Manual:How to use DjVu with MediaWiki.
 * In addition, use of ProofreadPage is highly improved by the use of the following extensions:
 * LabeledSectionTransclusion (strongly recommended)
 * Cite (default page footer contains
 * Poem
 * PdfHandler (may require additional PHP packages) — adds PDF support
 * PagedTiffHandler
 * ParserFunctions

Thumbnailing
The extension links directly to image thumbnails which often don't exist. You must catch 404 errors and generate the missing thumbnails. You can do this with any one of these solutions:


 * Set an Apache RewriteRule in .htaccess to 3>Special:MyLanguage/Manual:Thumb.php|thumb.php for missing thumbnails:
 * or set the Apache 404 handler to Wikimedia's thumb-handler. This is a general-purpose 404 handler with Wikimedia-specific code, not simply a thumbnail generator.
 * For MediaWiki >= 1.20, you can simply redirect to thumb_handler.php</>:
 * Or in <tvar|1>apache2.conf</>:

WARNING: There is an <tvar|1></> file in the images directory that may interfere with any <tvar|2>.htaccess</> rules you install.

Namespaces
ProofreadPage create by default two custom namespaces named "Page" and "Index" in English with respectively ids 250 and 252.

Their names are translated if your wiki use an other language. <tvar|1></>.

You can customize their name or their id: 1>Special:MyLanguage/Manual:Using_custom_namespaces#Creating_a_custom_namespace</>|Create namespaces by hand and set their ids in <tvar|2></> using <tvar|3>$wgProofreadPageNamespaceIds</> global. You will do something like:

Namespace id customization is not recommended and might not be supported in the future.

Configuration

 * In order to use the 1>:s:Help:Page status</>|page quality system, it is necessary to create five categories. The names of these categories must be defined in <tvar|2>Mediawiki:Proofreadpage_quality0_category</> to <tvar|3>Mediawiki:Proofreadpage_quality4_category</>.
 * Ensure that you have installed <tvar|1></>

Configuration of index namespace
The configuration is a JSON array of properties. Here is the structure of a property in the array, all the parameters are optional, the default value are set:
 * You need to create <tvar|1>MediaWiki:Proofreadpage_index_template</> in order to display index pages. This page is a template that receive as parameter entries of the edition form.
 * You need to create <tvar|1>MediaWiki:Proofreadpage_index_data_config</> that contain the configuration of the index form. This new configuration page overrides <tvar|2>MediaWiki:Proofreadpage_index_attributes</> and <tvar|3>MediaWiki:Proofreadpage_js_attributes</>.

The data parameter can have for value: "type", "language", "title", "author", "translator", "illustrator", "editor", "school", "year", "publisher", "place", "progress"

Page separator
The extension puts a separator between every transcluded page and the next, which is defined by <tvar|1> </>. The default value is <tvar|2> </> (a whitespace). Set <tvar|3> </> to suppress the separator.

Join hyphenated words across pages
When a word is hyphenated between a page and the next, the extension joins together the two halves of the word. Example: his- and tory becomes history. The "joiner" character is defined by <tvar|1> </> and defaults to '-' (the regular hyphen character).

Creating your first page (example with DjVu)

 * Before following these steps ensure you have followed the instructions in Using DjVu with MediaWiki.
 * (when and in which namespace is the djvu file itself uploaded?)
 * Create a page in the "Page" namespace (or the internationalized name if you use an not-English wiki). For example if your namespace is <tvar|1>'Page'</> create <tvar|2> </>
 * Create the corresponding file for this page <tvar|1>commons:File:Carroll - Alice's Adventures in Wonderland.djvu</> (or set <tvar|2>Manual:$wgUseInstantCommons</> to <tvar|3> </>).
 * Create the index page <tvar|1> </>
 * Insert the tag <tvar|1> </> in the Pages field to visualize the page list
 * To edit page 5 of the book navigate to <tvar|1>'Page:Carroll - Alice's Adventures in Wonderland/5'</> and click edit

Syntax
This extension introduces the following tags: <tvar|1> </> and <tvar|2> </>.