Help:Content translation/Translating/Translation quality/zh

创建翻译时，必须在发布内容之前审阅内容. 您需要确保生成的内容不会改变原始含义，并检查它在目标语言中是否自然. 提供的初始机器翻译有助于加快翻译过程的起步，但该工具鼓励用户仔细查看和编辑初始内容.

不同的机制保证翻译人员适当地编辑初始翻译. 翻译编辑器会检查有多少初始机器翻译被用户修改过，并设置差别阈值，以决定是否阻止用户发表译文，或者只是对其发出警告，建议对内容做更进一步的修正.

这样做可以使用户正确使用机器翻译，而不是生成未经认真审阅的低质量内容. 下面将进一步解释关于差别阈值的细节，如何针对各种语言的具体情况设置差别阈值，以及怎样评价机器翻译结果的质量.

阈值鼓励译文审阅
检查用户对初始机器翻译结果进行修改的百分比. 系统由此得知初始翻译被修改，删除，或者添加了多少单词. 系统在段落和全文两个层面上进行检查. 接下来解释每个层面上的差别阈值

全文范围的阈值
You cannot publish a translation with 99% or more of unmodified contents for the whole document. This limit is intended to block the most clear vandalism. This prevents users from just adding paragraphs to the translation and publishing them with no edits at all. As detailed below, this limit can be adjusted on a per language basis.

每个段落的限制
For each paragraph, the percentage of user modifications is also measured. A paragraph is considered problematic when the paragraph contains more than 85% of the initial machine translation (or, when copying the contents from the source document, it contains more than 60% of unmodified content).

The translation editor will show a warning for each paragraph that is considered problematic, encouraging the user to edit it further. In some cases users are still able to publish, but the resulting page may get added to a tracking category of potentially unreviewed translations for the community to review. In other cases, users may not be allowed to publish.

These are some of the factors considered for determining whether to allow the user to publish or not (some of these are still in development):

Users can still publish translations with less than 50 problematic paragraphs, but translations with 10 to 49 problematic paragraphs will be added to a tracking category of potentially unreviewed translations for the community to review. In those cases, translations with 10 problematic paragraphs or more will be prevented from publishing, while those with 9 or less problematic paragraphs will be added to a tracking category of potentially unreviewed translations for the community to review. For paragraphs where the unmodified content warning was shown but the user marked it as resolved, we apply a less strict threshold (accepting 95% of Machine translation or 75% of source content). This will provide a way to accommodate cases where the automatic translation was exceptionally good, but still avoid potential abuse of the feature (i.e., not following blindly the user confirmation).
 * The number of problematic paragraphs. Users are prevented from publishing translations with 50 or more problematic paragraphs.
 * Previous deleted translations. For users with some translations deleted in the last 30 days, the limits will be much more strict to prevent recurring problems.
 * User confirmation. A less strict threshold is considered for paragraphs that users marked as resolved, as a signal that the user reviewed and confirmed the status of the translation.

Contents not affected by the limits
Some contents are not expected to be edited significantly, and they are not considered when applying the limits described above. Very short section titles, citations, or the list of references are excluded from the checks. Otherwise, users may get misleading warnings because of contents such as the book titles in their references that they were not expected to translate.

调整限制
上述限制提供了一组通用机制，但它们可能需要调整每个维基的特定需求. 根据初始评估，初始机器翻译所需的修改量可以在10％到70％之间，具体取决于语言对. 在某些维基上，默认限制可能过于严格，会产生不必要的干扰或阻止发布完全有效的翻译. 在其他维基上，限制可能不够严格，允许发布编辑不够的翻译.

调整不同的阈值允许根据每个维基的需要或多或少地严格修改这些限制. 母语人士的反馈对于正确调整限制至关重要. 如果根据您创建或审核翻译的经验，目前的限制似乎不能很好地运作，请s分享您的反馈，我们可以探索如何更好地调整它们.

When providing feedback about adjusting the thresholds we recommend trying to create several example translations (make sure to check the publishing options if your test is not intended to be published as regular content). When testing how the limits work for your language, it is useful to keep in mind the following:

In this way you can more easily find the right balance for the limits. Checking only one type of problem can lead you to suggest moving the thresholds too far in the opposite direction. For example, content full of numeric data or technical names may require less editing by users than content with more descriptive text. Make sure to test with different articles, making translations of different lengths and involving different types of content. It may require to make a custom adjustment to the thresholds or improve the general approach. In any case, after each change further testing may be needed to verify the improvements.
 * Check for both cases. Make sure to check how the limits work for translations where the content has not been edited enough and also for those where the initial translation has been edited enough.
 * Check different contents. Content in a wiki is very diverse, and machine translation may work much better for some cases compared to others.
 * Prepare to iterate. Adjusting the thresholds is an iterative process.

Adjusting the limits in collaboration with editors has proven to be effective. For example, initial results show that the Indonesian community has reduced significantly the number of problematic translations by restricting the publication of translations with more than 70% of unmodified machine translation. Similar adjustments have been made for Telugu and Assamese There is no automatic tool that is infallible, and these limits are not an exception.

The process of content review by the community is still essential, but these limits provide communities with tools to reduce the number of translations they have to focus on, making the review process much more effective. Please share your feedback and we can explore how to better adjust them.

Tracking potentially unreviewed translations
A tracking category with the name "cx-unreviewed-translation-category" is provided for the community to easily find the articles that were published with some content exceeding the recommended limits.

You can find this category in the list of tracking categories on each wiki. There you can find the articles that passed the limits that prevent publishing but had still some paragraphs that have been edited less than expected. For example the Indonesian category includes articles that have less than 40% of machine translation overall but have some paragraphs with more than 80% of unmodified machine translation.

衡量翻译质量
自动评估内容质量并非易事. 删除率提供了一个有用的估计，即创建的内容是否足够好以便编辑所在的社群不会删除它. Based on the analysis of deletion ratios, articles that are created as translations are less likely to be deleted compared to an initial version of the article created from scratch. This suggests that it may not be practical to set the limits for participation by translating much higher than those set for other ways of article creation.

Find published translations
Content translation adds a contenttranslation edit tag to the published translations, for communities to be able to focus on the translations created with the tool using Recent changes and similar tools. In addition, data on published translations and the statistics for machine translation use are available for anyone to analyze.

Inspect a specific translation
The Translation debugger is a tool that allows to inspect some metadata for a given translation, including the percentage of machine translation used for the whole document or the translation service used for each paragraph.

基于用户权限的其他限制


一些维基已经基于用户权限实施了其他翻译限制，以减少低质量翻译的创建. For example, English Wikipedia requires users to be extended confirmed, which means they need to make 500 edits on English Wikipedia before they are allowed to publish a translation as an article. Newer editors can still publish translated articles in the  or   namespaces, and then move the article to the mainspace.

This restriction was created before the system of limits described in this page was available, and it is not the recommended approach to encourage the creation of good quality translations.

Before adding restrictions that do not take into account the content created, consider going through the process of adjusting the limits of unmodified content as described above. The limits can be made as strict as needed to prevent low quality translations, while still allowing editors making good translations to publish them.