Content translation/Machine Translation/Youdao/zh

从2016年10月31日起，有道机器翻译系统可以使用了，并且可以在使用内容翻译功能翻译维基百科页面时使用.

有道的服务有中国的一家互联网企业网易提供. 维基媒体基金会法务部和网易公司以签署合作协议. 这一协议将确保维基百科用户使用有道翻译服务时，维基百科的政策如署名权、用户隐私及品牌形象等方面不受影响. 协议的概要在本页列出，我们欢迎您对这一服务提出任何的建议或者意见. 这一合作得到了中文维基百科社群的支持.



主要功能
When Youdao’s service is used, they provide a translated version of Wikipedia content that maintains its free license. Users can modify it and publish it as part of Wikipedia without conflicts with existing policies. The resulting content translated by Youdao and the user modifications will be available under the same license that is used for the rest of the articles in Wikipedia. Translations obtained from Youdao and modifications made by users will be publicly available. The post-edited translations are of special interest for the translation research community who can use this resource to create new translation services to support languages for which open source machine translation is not available yet. This will help developers create and improve machine translation systems. Users have an option to disable it if they don't find it useful for some reason. The preferences allow users to decide whether they would like to use this service, or another available service for their language, or no machine translation services at all.
 * 有道不会收集任何个人信息.  内容翻译工具将通过有道的公开API使用机器翻译的服务.  Article content (freely licensed) is sent to Youdao servers from Wikimedia Foundation servers. No direct communication is happening between the user and external services and no personal information (IP, username) is sent to Youdao’s servers along with the Wikipedia content. The client contacting Youdao servers is open source and you can check it here. No part of Youdao's service or code will be part of Wikimedia infrastructure or Content Translation codebase
 * 有道翻译返回的内容依然为自由版权. 当使用有道的服务时，有道提供的机翻维基百科内容将仍然是自有版权的. 用户在翻译文本的基础上修改并发布在维基百科并不与任何现存政策冲突. 由有道翻译的内容以及用户的修改将于维基百科其他的条目一样处于相同的版权协议下.
 * 惠及更广泛的开源翻译社群. 通过有道翻译及用户修改的内容是公开的. 翻译研究社群对编辑后的翻译原文进行研究并以此为资源开发能够支援其他语言的开源翻译服务. 这将帮助开发者们开发和改进机器翻译系统.
 * 自动翻译是内容翻译工具的可选功能. 用户可以选择关闭自动翻译功能. 操作界面给用户提供了多种选择：是否使用这一功能、使用支援该语言的其他服务、或者不使用任何机器翻译功能.





已知問題
有道及其翻译服务并不能翻译富文本，例如HTML. 它接受纯文本内容，并输出纯文本翻译. 内容翻译通常在这种输出的基础上重新应用标记，但对于中文而言，由于语句中存在并不常见的标记化字词而并不可能. 因此翻译者将只收到纯文本的翻译，所有链接和参考资料等需要在翻译后手动添加. It accepts plain text content and outputs plain text translation. Content translation usually re-applies markup on top of this kind of output, but for Chinese, this is not possible because of non-trivial tokenization of sentence to words. Hence, translators will receive the translations only in plain text, and all links, references etc. will have to be added manually during translation.



与有道协议的摘要


有道的义务

 * 为维基媒体基金会提供免费的API以供志愿者在维基媒体站点中翻译条目.
 * 容许所有志愿者单次翻译不超过四千字节的内容，每日上限则为一千万字节. （远多于有道公开的机翻服务）
 * 向维基媒体基金会提供用户请求翻译的统计数据.


 * To license their API key for free to the Wikimedia Foundation to allow volunteers on Wikimedia sites to translate articles
 * To allow volunteers to translate up to four thousand characters per request and ten million characters per day (much more than their publicly available option)
 * To give Wikimedia statistical data on the quantity of characters in the requests sent



维基媒体基金会的义务

 * 向有道提供志愿者编辑后的机翻文本，有道可以此改进其服务：
 * 不会分享任何翻译者的用户信息.
 * 目前，仅向有道提供翻译的源文本、翻译语言和目标语言.
 * 无论翻译者是否用了机器翻译服务帮助翻译，其翻译都会由内容翻译API以平行语料库的形式提供. 这些API将逐步开发，结果将免费提供给所有人，不仅仅是有道.


 * To provide the volunteer-edited versions of the text translated by the translation tool so that Youdao can improve their tool
 * No personal data of translators will be shared.
 * Currently, just the original content to translate, its language, and translation target language are sent in the request to Youdao.
 * The translations published by translators, with or without the help of machine translation services, will be provided in the form of parallel corpora by content translation APIs. These APIs will be developed incrementally and results will be freely available for everyone, not just Youdao.



其他重点

 * 所有内容将保持 CC BY-SA 3.0 版权协议
 * 除了在内容翻译工具的下拉菜单中加入有道翻译作为机器翻译服务的选项之一外，有道不要求维基媒体基金会在任何情况为其推广或宣传.
 * 任何用户的信息都不会在双方间交换.
 * 协议期限为一年，我们将在到期后重新评估我们的需求.
 * 我们可以再任何时候（提前30日通知）出于任何理由终止与有道的协议.
 * 本协议适用美国法律.


 * All content will remain licensed under CC BY-SA 3.0
 * Youdao is not requiring any “branding” on Wikimedia Sites outside of listing Youdao as a translation tool option in the translation interface drop-down menu
 * There is no exchange of personal information of users
 * The agreement is limited to 1 year, at which time we can reevaluate our needs
 * We are free to terminate the agreement for any reason, at any time (with 30 days notice)
 * Agreement is governed by US law



对本服务的问答
我们在这里回答了一些关于有道翻译的常见问题，类似的问题也可以在“内容翻译”的FAQ页面中找到.

What languages are being handled by Youdao? Are there plans to add more?
Youdao is available at present only for translations Chinese, and English, French, Japanese, Korean, Portuguese, Russian and Spanish, for users who will be creating pages through Content Translation. As Youdao’s language coverage expands we will consider enabling them for Content Translation. Please note: Youdao machine translation will not be available when creating pages from Chinese to English.

How is using Youdao different than using other machine translation systems?
As a user of Content Translation you will not feel any difference on the translation interface as the machine translation system of Youdao will display the translated content similar to Apertium or Yandex. However, due to Youdao's current limitation of not supporting rich text, links, references etc. will have to be adapted manually.



如果我使用有道的服务，机器翻译的工作是如何完成的？
有道提供免费API供其他网站或服务使用其机器翻译系统. “内容翻译”工具将使用一个专门的API来使用有道的服务. 当用户开始翻译一个页面时，每个小节的HTML内容会被上传到有道服务器端，获取到的翻译版本会显示在“内容翻译”的对应翻译栏中. 链接及参考资料会像通常一样直接接受，用户可以进一步按需修改这些翻译版本. Content Translation also uses a unique API key to access this service on Youdao's server. When a user starts translating an article, the HTML content of each section of the source article is sent to the Youdao server and a translated version is obtained and displayed on the respective translation column of Content Translation. Links and references are adapted as usual and users can modify the content as required.

This process continues for all the sections of the article being translated. For better performance, the translations for consecutive sections are pre-fetched. The user can save the unpublished translation (to work on it again at a later time) or publish the article in the usual manner. The article is published on Wikipedia like any other normal article with appropriate attribution and licenses.

You can view a diagram of the process.



有道不是基于开源软件的. 我们为什么要使用它的服务？
“内容翻译”开发的目的是为了减少维基百科各语言间的差异. 正如其他用于维基媒体网站的软件一样，“内容翻译”是开源的. 在特定的情况下，我们会使用开源的软件引入外部的服务和自有版权的内容，来帮助我们的更好地扩展自由的知识.

Like all other software used on Wikimedia sites, Content Translation is also open source. In this particular case as well, we are using an open source client to interact with the external service and import freely licensed content in order to help users expand our free knowledge.

我们并未将有道或者Yandex的机器翻译系统整合在“内容翻译”的代码，或维基媒体的网站及服务中. 这项服务是免费提供给所有人使用. The service is free of charge and available for everyone.

Only the freely available Wikipedia article content (in segments) is sent to the Youdao service and the obtained translated content is also freely usable on Wikipedia pages. The translated content can be modified by users and this data also maintains its free license and is available publicly through the Content Translation API. This is a valuable resource made available for the community to develop open source translation services for those languages where they don't exist yet.

After studying the implications carefully, we found the fact that the content was stored previously in a closed source service does not limit the freedom of our knowledge or our software in the present or the future. We have taken special care to make sure that the content translated maintained its free license to make sure it complies with Wikipedia policies. This includes a long process for legal and technical evaluation and compliance. The summary of the terms of use is also available.

From user feedback we have seen that machine translation support is really helpful for users and we want to support all languages in the best way. Guided by the principles of Wikimedia Foundation's resolution to support free and open source software, we will prioritise the integration of open source services whenever they are available for a language. Apertium has been a critical part of Content Translation since its inception, and currently provides machine translations for nearly 70 of the numerous possible language combination that Wikipedia can support. Adding Yandex in November 2015 has helped a large group of users of nearly 70 more languages, who were unable to use this facility before with Content Translation.

Should I be worried about my personal information when using Youdao?
Irrespective of the service being used, you can be assured that only Wikipedia content from existing articles is sent and only freely licensed content will be added back to the translation. No personal information is collected and communication with those services happen at the server side, so they are isolated from the user device. Please refer to this diagram for more details.

What if Youdao is the only machine translation tool available and I don't want to use it?
Machine Translation is an optional feature in Content Translation that you can easily disable at will. If more machine translation systems are added for your languages, you can choose to enable MT again and select the MT service of your choice.

Will the content translated by Youdao be free for use in Wikipedia?
Yes. The Wikipedia content translated from Youdao is otherwise freely available on the Youdao web translation platform. Content Translation receives it via an API key to make it seamlessly available on the translation interface. This content can be modified by the users (if necessary) and used in other Wikipedia articles under free licenses.

Can this content be used for improving machine translation systems in general?
Yes. Translations made in Content Translation are saved in our database. This information will be made publicly available for anyone to use as translation examples to improve their translation services (from University research groups, open source projects to commercial companies, anyone!). The content can be accessed via the Content Translation API. Please note, only information related to translated text is publicly available. This includes – source and translated text, source and target language information and an identifier for the segment of text.