Content translation

The content translation tool is aimed to improve multilingual contribution. It is a Computer Assisted Translation (CAT) tool aimed to make translation of Wikipedia articles easier and better, and to collect a corpus of parallel text. It complements the existing Translate extension.

Translating content present in one language version of Wikipedia to another Wikipedia is a process that involves different steps and tools. Creating a new Wikipedia page based on an existing one from a different language normally requires the use of automatic translation services, dictionaries, reformatting text, tweaking links and references, and a lot of tab switching. This is a process that can be streamlined a lot so that translators spend their time creating high-quality content that reads naturally in their language.

Most Wikipedia users edit only one language edition. However, just over 15% of users also edit multiple language editions. These multilingual users were found to be more prolific than their monolingual counterparts, making about 2.3 times as many edits on average.

Purpose of the tool
Content translation will allow you to create an initial version of a Wikipedia page based on an existing version from a different language. so the main scenario we are addressing can be expressed as "A user creates a new page for language X based on language Y, where languages X and Y can be any pair of different languages".

The tool is targeted to users that know two or more languages, and provides value for different kinds of users. For new users, the tool will provide a new way of contributing that is easier than creating a new page from scratch. or experienced editors, the tool will simplify the process of translating content.

The goals for the tool are the following:
 * Save time. Help translators to create content quickly, without unnecessary copy&paste to external tools.
 * Provide assistance. Prevent errors, and make the user confident of their translations.
 * Encourage quality of translations. The tool should communicate properly the purpose of translations in Wikimedia context and help the user to avoid bad quality translations.
 * Don’t force the user. Since different translators follow different editing patterns, the aids provided should not intrude into the editing process.
 * Focus on content. Translation is more focused on content than text styling. Technical elements such as wikitext should be dealt in a way that do not make the translation harder.

More details about the analytics considered are available.

Initial ideas
During the design stage many ideas have been explored. Some ideas are collected below but the final list may be different depending on the feedback we receive.



Translation center
View prototype (Try to press New translation and search for "Mies van der rohe")
 * Initiate translations
 * List of translations and suggestions

Translation view
View prototype More design details
 * Three-column layout.
 * Automatic translation as template.
 * Warn automatic translation abusers.
 * Adaptation of links and references.
 * Information about words and suggestions for alternatives.

Additional entry points
More design details
 * New translation from page (view prototype)
 * New translation from "contributions" (view prototype)
 * New translation from Wikidata (view prototype)

Future plans
Some aspects are not part of the scope for the first iterations of the tool, but we have them in mind for the future. The include:
 * Real-time collaboration on a shared translation.
 * Add content to an existing translation.

How to participate

 * You can volunteer for our testing sessions.
 * Provide feedback at the talk page.

Development

 * The extension is being developed. Extension:ContentTranslation
 * Architecture