Content translation/Machine Translation/MinT/id

MinT (Machine in Translation / Mesin dalam Terjemahan) adalah layanan terjemahan berdasarkan model penerjemahan mesin saraf bersumber terbuka. Layanan ini diinangi di infrastruktur Yayasan Wikimedia dan akan menjadi bagian dari daftar sistem terjemahan mesin yang tersedia untuk pengguna Terjemahan Konten dan proyek Wikimedia lainnya. The translations provided are based on NLLB-200, OPUS, IndicTrans2 and Softcatalà translation models which have been optimized for performance using OpenNMT Ctranslate2 library in order to avoid the need for GPU acceleration. For more details you can check the source code, the API spec, and a test instance.



Fitur-fitur kunci

 * Tidak ada informasi pribadi non-publik milik pengguna yang dikirim ke MinT. The MT system will be accessed via an API. Article content (freely licensed) is sent to the MinT server and no direct communication is happening between the user and external services and no nonpublic personal information of users (IP, username) is sent to the MinT service. Klien yang kontak dengan MinT adalah sumber terbuka dan Anda dapat memeriksanya di sini. Although the MinT service is hosted in Wikimedia infrastructure, the integration follows the same pattern as other external services (please also see a diagram of this technical setup at the end of the section).
 * Any copyrightable information is returned from MinT under a free license. When MinT is used, a translated version of Wikipedia content is obtained. The copyrightability of such machine-generated content is an open legal question. To the extent that MinT translations are copyrightable, these translations are available under the same free license as the Wikipedia content being translated. Users can modify it and publish it as part of Wikipedia without conflicts with existing policies. Konten terjemahan yang dihasilkan oleh MinT dan modifikasi pengguna akan tersedia di bawah lisensi yang sama dengan yang digunakan untuk artikel lainnya di Wikipedia.
 * Benefits the wider open source translation community. Translations obtained from MinT and user modifications will be publicly available. The post-edited translations are of special interest for the translation research community who can use this resource to create new translation services to support languages for which open source machine translation is not available yet. This will help developers create and improve machine translation systems.
 * Users can disable it. Automatic translation is an optional tool in Content Translation. Users have an option to disable it if they don't find it useful for some reason. Although many Content Translation users have requested for translation services, each individual user eventually decides whether they would like to use them or not.

center|500x500px



Pertanyaan mengenai layanan ini
Kami telah menulis beberapa pertanyaan yang sering diajukan tentang MinT di bagian ini. Itu juga tersedia di halaman Pertanyaan Umum Terjemahan Konten.



Bahasa apa yang ditangani oleh MinT? Apa ada rencana untuk menambahkan lebih banyak bahasa?
MinT dirancang untuk menampung beberapa model penerjemahan terbuka. Jumlah bahasa yang didukung akan bergantung pada model-model tersebut. Laman daftar sistem terjemahan mesin yang tersedia berisi daftar terbaru.

How is using MinT different than using Apertium or others?
As a user of Content Translation you will not feel any difference on the translation interface as MinT will display the translated content in the same way Apertium or other services currently do for the supported language pairs. Different services provide a different translation quality level depending on the language and the specific contents. You can try and change among the available services the one providing the best initial translation for a given paragraph.

How is the machine translation being done if I choose MinT?
When a user starts translating an article, the HTML content of each section of the source article is sent to MinT. The MinT service processes the request and uses one of the translation models available based on the supported language and configuration. A translated version is obtained and displayed on the respective translation column of Content Translation. Links and references are adapted as usual and users can modify the content as required.

This process continues for all the sections of the article being translated. For better performance, the translations for consecutive sections are pre-fetched. The user can save the unpublished translation (to work on it again at a later time), revise, or publish the article in the usual manner. The article is published on Wikipedia like any other normal article with appropriate attribution and licenses.

Berikut diagram prosesnya.



Apakah MinT didasarkan pada perangkat lunak bersumber terbuka?
The MinT service is open source and it integrates models that are released as open source:


 * The AI research team at Meta released the translation models used by NLLB-200 with an open source license and the dataset used for training as part of the No Language Left Behind project.
 * The OPUS project provides pre-trained neural translation models trained on OPUS data with an open source license.

These models have been optimized for performance using OpenNMT Ctranslate2 library which is also an open source library.

Content Translation evolved from a long-standing need to bridge the gap in the amount of content between Wikipedias in different languages. Like all other software used on Wikimedia sites, Content Translation is also open source. In this particular case as well, we are using an open source client to interact with the external service and import freely licensed content in order to help users expand our free knowledge. To use MinT we are not adding any proprietary software in the Content Translation code, or on the Wikimedia websites and servers.

Should I be worried about my personal information when using MinT?
Irrespective of the service being used, you can be sure that only Wikipedia content from existing articles is sent and only freely licensed content will be added back to the translation. Communication with those services happens at the server side, so they are isolated from the user device and they have no access to nonpublic personal information of users. Please refer to this diagram for more details.

What if MinT is the only machine translation tool available and I don't want to use it?
Machine Translation is an optional feature in Content Translation that you can easily disable at will. If more machine translation systems are added for your languages, you can choose to enable MT again and select the MT service of your choice.



Apakah konten yang diterjemahkan oleh MinT bebas digunakan di Wikipedia?
Yes. The content received from MinT is otherwise freely available on the web translation platform. For ease of use Content Translation receives it via an API to make it seamlessly available on the translation interface. This content can be modified by the users (if necessary) and used in Wikipedia articles under free licenses.



Bisakah konten ini digunakan untuk meningkatkan sistem terjemahan mesin secara umum?
Ya. Terjemahan yang dibuat dalam Terjemahan Konten disimpan dalam basis data kami. Informasi ini akan tersedia untuk umum, bagi siapa saja untuk digunakan sebagai contoh terjemahan untuk meningkatkan layanan terjemahan mereka (dari kelompok riset Universitas, proyek sumber terbuka hingga perusahaan komersial, semua orang!). Konten dapat diakses melalui API Terjemahan Konten. Harap ingat bahwa hanya informasi terkait dengan teks terjemahan saja yang tersedia untuk umum. Ini mencakup – teks sumber dan teks terjemahan, informasi bahasa sumber dan target, serta juga pengidentifikasi segmen teks.