閱讀/網路/PDF功能

From MediaWiki.org
< Reading‎ | Web
Jump to: navigation, search
This page is a translated version of the page Reading/Web/PDF Functionality and the translation is 45% complete.

Outdated translations are marked like this.
Other languages:
Deutsch • ‎English • ‎español • ‎français • ‎日本語 • ‎polski • ‎português • ‎tarandíne • ‎русский • ‎中文

Update January 2018

We're currently preparing performance tests of the PDF to book function. We should know more in early February.

2017年九月更新

我们目前的PDF渲染服务,离线内容生成器(OCG)现已不在维护。说白了,就是坏掉了。维基媒体基金会阅读团队已决定在接下来几个月内替换这一服务。OCG原本是由第三方创建的过时代码,这今后可能导致安全漏洞及其他严重问题。在过去3个月内,我们在PDF创建页面放置横幅,请求就新渲染模式提供反馈。新的渲染器将以OCG为基础改进其性能——它将能够打印表格和信息框,并将包含朝向最佳易读性努力的样式设计。我们收到了很多新模式的积极反馈,并且正致力于为新的PDF生成器进行必要更新。

Later addendum: Turning PDF book rendering OFF for the short term

Unfortunately, major issues with our old renderer (OCG) will require us to remove it as a rendering option prior to completing the necessary updates for the books feature. This is earlier than we wanted. By the time we remove OCG, the work for rendering of single articles will be completed. However, the rendering of books will be paused while we evaluate and complete the necessary work. Our initial choice of renderer for the replacement, the Electron rendering service, is not capable of supporting PDFs of larger sizes and fails when attempting to render a book with multiple articles. We will be working to select a new rendering system for books which can handle the size of the files and support our requirements. This is not how we planned to do this. We never aimed to temporarily remove the book PDF functionality.

時間軸:

  • Release of full-featured renderer for single articles (print to pdf) – Oct 1, 2017
  • 暂停图书PDF渲染——2017年10月1日
  • 彻底停用OCG渲染器——2017年10月1日
  • 发布完整功能的图书渲染器——2017年11~12月(基于研究结果计划发布的备选渲染系统)

功能:

For a full list of current and upcoming functionality, see below.

In addition to this page being updated, this will be communicated in a banner on PDF creation page, in Tech News and on some Wikimedia mailing lists.

介紹

我們目前的PDF渲染服務,離線內容產生器不再維護。簡單的講,它已經过期了。它原本由第三方建立架構,而目前仍在運行老舊的程式代碼,而這在今後可能導致安全漏洞及其他大規模問題。如果我們仍然需要PDF功能,遺憾的講,我們將不得不替換它,或者我們突然找到一種不需要計劃什麼,甚至不需要這種功能的解決方案。

Additionally, it does not support a number of rendering requests from the community, the main one being the ability to render tables. We have selected a new service, the electron rendering service, as a suitable replacement. Our next step is to duplicate the functionality provided by OCG using the electron rendering service. Below, we will describe the main portions of the functionality we have identified as necessary. We would like to invite conversation around what is missing or what is superfluous in the provided list. We would also like to highlight over our future plans for PDF rendering to gather initial feedback.

Userbase

The following table shows a sample of traffic to the Electron "Download as PDF" service for over a 6 hour period. The traffic is broken down by operating system (OS), browser, and the browser major version (e.g. Windows 7, Chrome v61.*).

Note well that the majority of our traffic appears to come from Windows based machines.

OS Browser Browser Major Version  % of requests
Other Other - 14.38
Windows 7 Chrome 61 12.42
Windows 10 Chrome 61 8.83
Windows 7 IE 11 7.33
Windows 7 Firefox 56 6.59
Windows 10 Firefox 56 3.82
Windows 10 Edge 15 3.24
Windows 8.1 Chrome 61 3.07
Windows XP Chrome 49 2.2
Windows 10 Chrome 59 1.53
Windows 10 IE 11 1.51
Windows 8.1 Firefox 56 1.31
Windows XP Firefox 52 1.22
Windows 8 Chrome 61 1.15
Windows 8.1 IE 11 1.15
Mac OS X Safari 11 0.9
Windows 7 Firefox 53 0.89
Windows 7 Firefox 52 0.78
Ubuntu Firefox 56 0.78
Windows XP IE 6 0.7
Windows 7 Chrome 55 0.68
Windows 7 Firefox 55 0.62
Mac OS X Chrome 61 0.62
Android UC Browser 11 0.6
Windows 10 Edge 14 0.59
Windows 7 Opera 48 0.53
Android Chrome Mobile 61 0.49
Windows 10 Opera 48 0.44
Windows 7 Chrome 60 0.4
Windows Vista Chrome 49 0.39
Windows 7 Yandex Browser 17 0.37
Windows 10 Firefox 55 0.37
Mac OS X Safari 10 0.36
Windows 10 Chrome 50 0.34
Android Android 4 0.33
Mac OS X Firefox 56 0.33
Windows 10 Chrome 60 0.32
Windows 8.1 Chrome 43 0.3
Android Amazon Silk 60 0.29
Windows 7 Sogou Explorer 1 0.27
Windows 8 IE 10 0.26
Windows 7 IE 8 0.26
Windows 7 IE 9 0.25
Windows 8 Opera 12 0.25
Linux Firefox 52 0.25
Mac OS X Firefox 53 0.24
Windows 7 Firefox 45 0.24
Windows 10 Firefox 57 0.24
Windows 7 Firefox 38 0.22
Windows 10 Firefox 47 0.21

Current Functionality Requirements

The following is a list of the current requirements for PDF rendering for single-article PDF's and for books. The requirements different from the current implementation are displayed in bold.

History

  • Rendering PDF articles and books from Wikipedia pages is handled by a service called OCG. When rendering "books" through the book creator, it uses OCG as embedded within the Collection extension. OCG has multiple issues, especially with tables.
  • Multiple issues with OCG are identified, including complaints from the community around OCG's inability to render tables.
  • Rendering of tables ranks as number 9 on the German-speaking Community Technical Wishlist.
  • Wikimedia Deutschland begins on working on a solution for rendering tables in PDF's, and introduces Electron. They do this planning to run it alongside OCG, not to replace it.
  • At the same time as Wikimedia Deutschland is working on the Electron service, the responsible maintainers of the OCG service at the Wikimedia Foundation come to the conclusion that OCG has to be replaced.
  • The WMF Reading Team takes over the responsibility for the long term maintenance of PDF rendering begins plans on implementing table rendering across all projects.
  • The Reading team launches a community consultation for gathering feedback on Electron.
  • The Reading Infrastructure and Web teams begin scoping the working necessary to port OCG functionality over to the Electron service.

Update After Consultation

Proposed PDF and print styles based on feedback from consultation

We launched a consultation on the current implementation of the PDF renderer in early June, 2017. After reviewing the consultation responses, we have made the following observations:

  • A larger number of users preferred the single-column format over the double column format
  • Users which prefered the double-column format highlighted that their preference was based in the styling and look and feel of double columns. Some users also expressed concerns with font size and wasting paper when printing PDF's in the single-column option
  • The following feature requests were made:
    • Functional hyperlinks
    • Date and url, 'this page downloaded [date] from [URL]'
    • Customizable css for layout, title, TOC
    • Option for 2 column format
    • Include/exclude images versions
    • Modifiable margins
    • print by section - allows you to remove references, paragraphs you don’t want, index, etc
    • allowing configurable text size

Based on the feedback, we have incorporated the following into our new print styles:

  • 超链接
  • 条目信息
  • smaller font and book-like styling

The remainder of the requests above will be postponed until the second iteration of the PDF renderer, in which we plan to build a settings mode that will allow for customization of the available options.

提案

下面是界定PDF渲染所需的功能的范围的提案:

  • Individual articles will be rendered to PDF using the "Download as PDF" link in the sidebar
  • Multiple articles will be rendered to PDF using the Book Creator tool
  • All articles will contain attribution for text and images
  • All PDFs rendered will be able to print tables
    • Users will be able to customize the layout of their PDF (optional)

当前与今后实现方式间的不同

OCG 新服務 備註
渲染單個條目
使用圖書創建器渲染多個條目
包含多個條目的目錄
渲染表格
署名 开放问题:新服务中的署名位置
样式 Latex 新样式
数栏布局
默认双栏布局 试验中 Default one column or two-column layout will be chosen based on feedback and quantitative and/or qualitative testing
輸出格式 PDF與純文字 僅限PDF

Design

The new PDF styles will be designed for increased readability. Based on community feedback and qualitative or quantitative testing, support for a 2-column layout may be built for the book creator and/or for individual PDFs.

Development and Deployment Roadmap

The following is a rough outline of the development and deployment roadmap. It is subject to change.

  1. 2017年4~5月:
    1. The Reading team builds back-end support for functionality identified above
    2. Communities are consulted on expanding or shrinking proposed functionality
    3. Qualitative test performed for styling
  2. 2017年6~7月:
    1. New styles implemented
    2. First iteration is launched along with OCG on all projects and performance is compared
    3. Iterations based on consultations and identified edge cases
  3. 2017年8~9月:
    1. Additional changes made if necessary
  4. 2017年10月
    1. Second iteration launched without OCG on all projects

Single Articles

  • A PDF for a single article will be created by selecting the "Download as PDF" link
  • Upon selecting "Download as PDF", the PDF file will be generated. To download the file, users will select the "Download the file link"
  • Each PDF file will contain the following:
    • Article title and text
    • 資訊框 (如果有)
    • 表格s (如果有)
    • Single-column layout
    • 頁碼
    • All article images and captions
    • Links to pages linked from the article (blue links and external links)
    • Text and image sources, contributors, and licenses

Phabricator Tracking

All PDF-related changes including sunsetting OCG, replacing the Electron PDF renderer, and any updates to books or the collections extension are tracked under the phabricator project Proton. The project page will display any recent updates for all tasks related to PDFs.

Books

Functionality available in October, 2017

Note: no changes will be made to the current book creator workflow at this time

  • User will launch the books creator by selecting "Create a book"
  • This will navigate to the current book creation page
  • To download a book, users will select the "download" link from the books page
  • Users may only download books in PDF format
  • Books will contain all elements from single article format as well as:
    • 圖書標題頁面
    • The references for each article from the book will appear at the end of the article
    • 每个条目将出现在新页面上
    • A single section for text and image sources, contributors, and licenses, that contains the collected contributions from all articles

Functionality available in November - December, 2017

  • Books will contain a table of contents with page numbers
    • Selecting a section from the table of contents will navigate the user to the corresponding section within the book

Styles for books will be updated for improved readability