Content translation/Machine Translation/Google Translate/ja

コンテンツ翻訳機能で使える機械翻訳サポートが拡充されました. コンテンツ翻訳機能で利用できる機械翻訳システム（MT＝Machine Translation）にはこれまであったApertium、LingoCloud、Matxin、Yandex、Youdaoに加え、Google Translate が使えるようになりました. これにより対応言語は 100 超に増え、中にはこれまで既存のシステムでは未対応だった言語もたくさんあります.

Google Translate はGoogle – アメリカに本拠を置く多国籍企業より提供を受けます. ウィキメディア財団の複数の部門と Google との協働により、Google Translate の使用に際してウィキペディアの諸権利の継承、ユーザーの個人情報保護、ブランドの表現の方針を遵守する契約が成立しました. ''契約の詳細は をご参照いただき、このサービスに関する質問はぜひお問い合わせください. ''

主な機能

 * Google Translate には個人情報は一切わたりません.  MT システムへのアクセスは Google cloud サービス経由で成立します. 記事のコンテンツ（フリーライセンスの対象） はウィキメディア財団のサーバから Google のサーバへ送ります.  ユーザーと外部リンサービスは直接のやりとりをせず、一切の個人情報（IPアドレスと利用者名）は Google サーバに送信されません.  clientはオープンソースとして Google サーバにつながり、こちらで確認できます.  Google 側の一切のサービスもしくはコードはウィキメディアのインフラあるいはコンテンツ翻訳のコードベースに属しません.  細部については、この節の末尾に図がありますのでご参照ください.
 * Information is returned from Google Translate under a free license. When Google Translate is used, a translated version of Wikipedia content under a free license is obtained. Users can modify it and publish it as part of Wikipedia without conflicts with existing policies. The resulting content translated by Google Translate and the user modifications will be available under the same license that is used for the rest of the articles in Wikipedia.
 * Benefits the wider open source translation community. Translations obtained from Google Translate and user modifications will be publicly available. The post-edited translations are of special interest for the translation research community who can use this resource to create new translation services to support languages for which open source machine translation is not available yet. This will help developers create and improve machine translation systems.
 * 利用者は無効化できます. Automatic translation is an optional tool in Content Translation. Users have an option to disable it if they don't find it useful for some reason. Although many Content Translation users have requested for this translation service, each individual user may decide whether they would like to use it or not.

Google's obligations

 * Use of their Translation API key at no cost to the Wikimedia Foundation to allow volunteers on Wikimedia sites to translate articles and as many characters as needed.

Wikimedia Foundation's obligations

 * To provide the volunteer-edited versions of the text translated by the translation tool so that Google can improve their tool
 * No personal data of translators will be shared.
 * Just the original content to translate, its language, and translation target language will be sent in the request to Google.
 * The translations published by translators, with or without the help of machine translation services, will be provided in the form of parallel corpora by the Content Translation APIs. These APIs will be developed incrementally and results will be freely available for everyone, not just Google.

重要な注記

 * All content will remain licensed under CC BY-SA 3.0
 * Google is not requiring any "branding" on Wikimedia Sites outside of listing Google Translate as a translation tool option in the translation interface drop-down menu
 * There is no exchange of personal information of volunteers
 * The agreement is limited to 1 year, at which time we can reevaluate our needs
 * We are free to terminate the agreement for any reason, at any time
 * Agreement is governed by U.S. law

Questions about this service
We have addressed some immediate questions about Google in this section. This is also available in the Content Translation FAQ page.

What languages are being handled by Google Translate? Are there plans to add more?
Google Translate can be used to translate into all available languages except English, which is currently not enabled to use machine translation of any kind.

How is using Google Translate different than using Apertium?
As a user of Content Translation, you will not feel any difference on the translation interface as Google Translate will display the translated content in the same way Apertium currently does for the supported language pairs.

How is machine translation being done if I choose Google Translate?
Google Translate provides an API key that allows websites and other services to use their translation system. Content Translation also uses that unique API key to access Google Cloud services for Google Translate. When a user starts translating an article, the HTML content of each section of the source article is sent to Google Translate and a translated version is obtained and displayed on the respective translation column of Content Translation. Links and references are adapted as usual and users can modify the content as required.

This process continues for all the sections of the article being translated. For better performance, the translations for consecutive sections are pre-fetched. The user can save the unpublished translation (to work on it again at a later time) or publish the article in the usual manner. The article is published on Wikipedia like any other normal article with appropriate attribution and licenses.

Here’s a diagram of the process.

Google Translate is not based on open source software. Why are we using it?
Content Translation evolved from a long-standing need to bridge the gap in the amount of content between Wikipedias in different languages. Like all other software used on Wikimedia sites, Content Translation is also open source. In this particular case as well, we are using an open source client to interact with the external service and import freely licensed content in order to help users expand our free knowledge.

To use Google Translate we are not adding any proprietary software in the Content Translation code, or on the Wikimedia websites and servers. The service is free of charge as part of Google’s offering to the Wikimedia Foundation.

Only the freely available Wikipedia article content (in segments) is sent to the Google Translate and the obtained translated content is freely usable on Wikipedia pages. The translated content can be modified by users and this data is also available publicly under a free license through the Content Translation API. This is a valuable resource made available for the community to develop open source translation services for those languages where they don't exist yet.

After studying the implications carefully, we found the fact that the content was stored previously in a closed source service does not limit the freedom of our knowledge or our software in the present or the future. We have taken special care to make sure that the content provided is freely licensed to make sure it complies with Wikipedia policies. This includes a long process for legal and technical evaluation and compliance. The summary of our agreement is also available above.

From user feedback, we have seen that machine translation support is really helpful for users and we want to support all languages in the best way. Guided by the principles of Wikimedia Foundation's resolution to support free and open source software, we will prioritize the integration of open source services whenever they are available for a language. Apertium has been a critical part of Content Translation since its inception, but currently, it only provides machine translations for about 30 of the numerous possible language combination that Wikipedia can support.

Google Translate を使うときに自分の個人情報に危惧はありますか？
利用するサービスに関わらず、送信対象はウィキペディアの既存の記事の内容のみであり、訳文にはライセンスフリーの内容のみ戻されます. 一切の個人情報は収集されず、外部サービスとのやり取りはサーバ側で行われ、利用者の使用機器とは隔離されます. をご参照ください.

もしも機械翻訳ツールがGoogle Translate しか使えない状況なのに、これを使いたくない場合は？
コンテンツ翻訳における機械翻訳（MT）はツールの選択肢であって、自分の意思で簡単に無効にできます. ご使用の言語で使えるMTがさらに追加された時点で、再度この機能を有効にして、どのサービスを利用するか指定できます.

ウィキペディアでGoogleの機械翻訳を使うのは無料ですか？
無料です. Google Translate を介した翻訳するはウェブ上の翻訳プラットフォームで既に無償提供されています. APIキーを用いてサービスとつなぎ、翻訳インターフェイスでシームレスに使えるようにします. 利用者はこうして得た内容を（必要に応じて）改変し、ライセンスフリーの条件でウィキペディアの記事（複数）に使用できます.

この内容を機械翻訳システム全般の改良に利用できますか？
可能です. コンテンツ翻訳で作成した翻訳はウィキメディアのデータベースに保存します. この情報は全ての人に公開し、翻訳例を皆さんの翻訳サービス改善に利用できます（対象は大学研究グループから商用目的のオープンソースプロジェクトまですべて）. ただし公開の対象は翻訳文に関する情報に限定されます. その情報の範囲には – 翻訳の原文と訳文、翻訳原文の言語と訳出した言語の情報、さらに文の断片識別子を含みます.

関連項目

 * Project Glow FAQ - ウィキメディア財団の良くある質問 - Google との提携関係
 * インド諸語のウィキペディア支援プログラム - ウィキメディア財団と Google の提携により、特定のパイロット事業で Centre for Internet and Society (CIS)、ウィキメディアのインド協会WMIN＝Wikimedia India chapter）ならびに利用者グループとともにウィキペディアのコミュニティ群でローカルの事情に即したインド諸語の高品質なコンテンツ作成を支援します. この事業は通称をプロジェクト・タイガーとします.
 * コンテンツ翻訳関連のよくある質問
 * さらに機械翻訳サービスを増やす Phabricator タスク