Расширение:Proofread Page

From mediawiki.org
This page is a translated version of the page Extension:Proofread Page and the translation is 14% complete.
Outdated translations are marked like this.
Справка по расширениям MediaWiki
Proofread Page
Статус релиза: стабильно
Реализация Действие страницы , ContentHandler , Тег , API , База данных
Описание The Proofread Page extension can render a book either as a column of OCR text beside a column of scanned images, or broken into its logical organization (such as chapters or poems) using transclusion.
Автор(ы) ThomasV (original author)
Tpt (current maintainer)
Последняя версия continuous updates
Политика совместимости Snapshots releases along with MediaWiki. Master is not backward compatible.
MediaWiki current master
PHP 7.0+
Изменения в БД Да
Composer mediawiki/proofread-page
Таблицы pr_index
Лицензия GNU General Public License 2.0 или позднее
Скачать
Помощь Help:Extension:ProofreadPage
Пример s:Index:Wind in the Willows (1913).djvu
Page, Index
  • $wgProofreadPagePageJoiner
  • $wgProofreadPagePageSeparator
  • $wgProofreadPageNamespaceIds
  • $wgProofreadPageEnableEditInSequence
  • $wgProofreadPageBookNamespaces
  • $wgProofreadPageUseStatusChangeTags
  • $wgProofreadPagePageSeparatorPlaceholder

  • pagequality
  • pagequality-admin
  • pagequality-validate
Переведите расширение Proofread Page, если оно доступно на translatewiki.net
Проблемы Открытые задачи · Сообщить об ошибке

Proofread Page extension

2020 Coolest Tool
Award Winner

in the category
Impact


Расширение Proofread Page создаёт книгу:

  1. либо как колонку распознанного посредством OCR текста рядом с колонкой отсканированных изображений,
  2. либо разделенную на главы или стихи. Контент появляется на странице MediaWiki (через трансклюзию).

Расширение предназначено для легкого сравнения текста с оригинальной оцифровкой.

This extension shows the text in several ways without actually duplicating the original text.[1]

Применение

Расширение установлено на всех вики Викитеки. Для синтакса, см. $1. Расширение также используется в Библиовики.

Требования и рекомендации

  • If you want to use DjVu files (optional but recommended), a native DjVu handler needs to be available for configuration.

См. также Manual:How to use DjVu with MediaWiki .

  • In addition, use of ProofreadPage is highly improved by the use of the following extensions:


Установка

Расширение

  • Скачайте и распакуйте файл(ы) в папку с названием ProofreadPage в вашей папке extensions/.
    Вместо этого разработчикам и соавторам кода следует установить расширение из Git, используя:cd extensions/
    git clone https://gerrit.wikimedia.org/r/mediawiki/extensions/ProofreadPage
  • Добавьте следующий код в конце вашего файла LocalSettings.php :
    wfLoadExtension( 'ProofreadPage' );
    
  • Выполните скрипт обновления, который автоматически создаст необходимые таблицы, используемые расширением.
  • Yes Готово – Перейдите на страницу Special:Version на своей вики, чтобы удостовериться в том, что расширение успешно установлено.

Thumbnailing

The extension links directly to image thumbnails which often don't exist. You must catch 404 errors and generate the missing thumbnails. You can do this with any one of these solutions:

  • Set an Apache RewriteRule in .htaccess to thumb.php for missing thumbnails:
    RewriteEngine On
    RewriteCond %{REQUEST_FILENAME} !-f
    RewriteCond %{REQUEST_FILENAME} !-d
    RewriteRule ^/w/images/thumb/[0-9a-f]/[0-9a-f][0-9a-f]/([^/]+)/page([0-9]+)-?([0-9]+)px-.*$ /w/thumb.php?f=$1&p=$2&w=$3 [L,QSA]

This is a general-purpose 404 handler with Wikimedia-specific code, not simply a thumbnail generator.

    ErrorDocument 404 /w/extensions/upload-scripts/404.php
  • For MediaWiki >= 1.20, you can simply redirect to thumb_handler.php:
    RewriteEngine On
    RewriteCond %{REQUEST_FILENAME} !-f
    RewriteCond %{REQUEST_FILENAME} !-d
    RewriteRule ^/w/images/thumb/[0-9a-f]/[0-9a-f][0-9a-f]/([^/]+)/page([0-9]+)-?([0-9]+)px-.*$ /w/thumb_handler.php [L,QSA]
  • or in apache2.conf:
    ErrorDocument 404 /w/thumb_handler.php
Внимание Внимание: There is an .htaccess file in the images directory that may interfere with any .htaccess rules you install.

If you encounter a problem similar to the following:

  • phab:T301291PDF and DjVu files on Commons failed to be processed (no thumbnails, zero pages) but otherwise valid
  • phab:T298417Undeleted DjVu files show incorrect metadata: 0x0 size, no page number info
  • phab:T299521PDF file has 0x0 image size in Commons after uploading a new version while the page number is correct

Try next steps:

  1. repair thumbnails for DjVu files of the core MediaWiki (for PDF use mimetype application/pdf )
    php maintenance/refreshImageMetadata.php --verbose --mime image/vnd.djvu --force
    
  2. needed for actualization info about the pages counts of the Special:IndexPages
    php maintenance/refreshLinks.php --namespace 252
    

Namespaces

ProofreadPage create by default two custom namespaces named "Page" and "Index" in English with respectively ids 250 and 252.

Their names are translated if your wiki use another language. Full list.

You can customize their name or their ID: Create namespaces by hand and set their IDs in Руководство:LocalSettings.php using $wgProofreadPageNamespaceIds global. You will do something like:

define( 'NS_PROOFREAD_PAGE', 250);
define( 'NS_PROOFREAD_PAGE_TALK', 251);
define( 'NS_PROOFREAD_INDEX', 252);
define( 'NS_PROOFREAD_INDEX_TALK', 253);
$wgExtraNamespaces[NS_PROOFREAD_PAGE] = 'Page';
$wgExtraNamespaces[NS_PROOFREAD_PAGE_TALK] = 'Page_talk';
$wgExtraNamespaces[NS_PROOFREAD_INDEX] = 'Index';
$wgExtraNamespaces[NS_PROOFREAD_INDEX_TALK] = 'Index_talk';
$wgProofreadPageNamespaceIds = array(
    'index' => NS_PROOFREAD_INDEX,
    'page' => NS_PROOFREAD_PAGE
);

Namespace id customization is not recommended and might not be supported in the future.

Настройка

The names of these categories must be defined in MediaWiki:Proofreadpage_quality0_category to MediaWiki:Proofreadpage_quality4_category.

Configuration of index namespace

For more details, see Extension:Proofread Page/Index data configuration

This page is a template that receive as parameter entries of the edition form.

This new configuration page overrides MediaWiki:Proofreadpage_index_attributes and MediaWiki:Proofreadpage_js_attributes.

The configuration is a JSON array of properties. Here is the structure of a property in the array, all the parameters are optional, the default value are set:

{
  "ID": { //id of the metadata (first parameter of proofreadpage_index_attributes)
    "type": "string", //the property type (for compatibility reasons the values have not to be of this type). Possibles values: string, number, page. If set, the newly set values should be valid according to the type (e.g. for a number a valid number, for a page an existing wiki page...)
    "size": 1, //only for the type string : number of lines of the input (third parameter of proofreadpage_index_attributes)
    "values":  {"a":"A", "b":"B","c":"C", "d":"D"}, //an array values : label that list the possible values (for compatibility reasons the stored values have not to be one of these)
    "default": "", //the default value
    "header": false, //add the property to MediaWiki:Proofreadpage_header_template template (true is equivalent to being listed in proofreadpage_js_attributes)
    "label": "ID", //the label in the form (second parameter of proofreadpage_index_attributes)
    "help": "", //a short help text
    "delimiter": [], //list of delimiters between two part of values. By example ["; ", " and "] for strings like "J. M. Dent; E. P. Dutton and A. D. Robert"
    "data": "" //proofreadpage's metadata type that the property is equivalent to
  }
}

The data parameter can have for value:

"type", "language", "title", "author", "translator", "illustrator", "editor", "school", "year", "publisher", "place", "progress"

Page separator

The extension puts a separator between every transcluded page and the next, which is defined by wgProofreadPagePageSeparator. The default value is   (a whitespace). Set wgProofreadPagePageSeparator = "" to suppress the separator.

Join hyphenated words across pages

When a word is hyphenated between a page and the next, the extension joins together the two halves of the word. Example: his- and tory becomes history. The "joiner" character is defined by wgProofreadPagePageJoiner and defaults to '-' (the ASCII hyphen character).

Configure change tagging (optional)

See Change tagging to set up change tags.

Usage

Creating your first page (example with DjVu)

  • (when and in which namespace is the DjVu file itself uploaded?)
  • Create a page in the "Page" namespace (or the internationalized name if you use an not-English wiki).

For example if your namespace is 'Page' create Page:Carroll - Alice's Adventures in Wonderland.djvu

  • Create the index page Index:Carroll - Alice's Adventures in Wonderland.djvu
    • Insert the tag ‎<pagelist /> in the Pages field to visualize the page list
  • To edit page 5 of the book navigate to 'Page:Carroll - Alice's Adventures in Wonderland/5' and click edit

Syntax

This extension introduces the following tags:

‎<pages>, ‎<pagelist>

Notes

  1. Because the pages are not in the main namespace, they are not included in the statistical count of text units.

См. также