Extension:PronunciationRecording/GSoC 2013

=GSOC Project Idea : Pronunciation Recording Extension=

I have been engaged with Mediawiki since 2 months now and since GSOC 2013 is up soon i felt its the right time to draft a Rough Proposal

Introduction

 * In Wiktionary many words have pronunciation audio files(.ogg) attached with them, these audio files tell the user how to pronounce a word in specific language. Same words are pronounced differently in different parts of the world. Example : The word Garage is spoken differently all around the world [Garage]
 * The word "minute" is pronounced differently when it is spoken in the time context as compared to when it is spoken in the quantity context, such words are called hetronyms. The audio files attached to each of the etymologies clearly depict this difference.
 * But there are several words that do not have audio files attached to them. Conducting a rough survey I found out that words used extensively in a particular discipline i.e medicine, mathematics, etc don't have audio files attached to them. Example : aggravate, compendium


 * It would be nice if we could have an extension that records pronunciations.
 * There is a thread in the mailing list requesting for a Pronunciation Recording Tool.

Simple workflow

 * The workflow basically consists of 4 steps
 * 1) A Record Pronunciation link is displayed on the Wiktionary page of a word that does not have a pronunciation file attached to it.
 * 2) When the user clicks on the Record Pronunciation link a dialog box pops up. The dialog box basically consists of 4 parts :
 * 3) The Recording Toolbar : It essentially consists of a user friendly toolbar that would help the user to record pronunciations . It essentially consists of buttons like "Record", "Stop", "Play", "Reset". The description of each button is fairly self-explanatory . The Recording Toolbar is not shown in the snapshot, the words Recording Toolbar will be replaced by a working toolbar . The user will get a maximum of 5 seconds in which he can record the pronunciation.
 * 4) IPA : This section consists of the IPA of the word that the user wants to record . It will assist the user in pronouncing the word correctly.
 * 5) Choosing a License : To upload a file to Wikimedia Commons requires licensing . If the file the user wishes to upload is his/her own work then he/she can choose from a variety of licenses . When the user clicks on the "This file is my work", then automatically the radio buttons to the 3 licenses are activated and the radio button corresponding to "This file is not my work" is deactivated .This applies vice-versa too.
 * 6) Upload Button : On clicking this button the file is uploaded to Wikimedia Commons a with a specific file name like en-minute.ogg. For a different etymology of the same word the file name will be en-minute-1.ogg and for a different language the file name will be fr-minute-1.ogg.
 * 7) The Success and Thank you Note : After the user clicks the upload button if the file is successfully uploaded to commons then a dialog box confirming that upload was succesfull will be displayed .This dialog box also consists of a small Thank You note.
 * 8) Refresh The Page : After the user closes the dialog box, he/she must refresh the page so that the pronunciation file is embedded in the wiktionary page.


 * We will be using the word aggravate as a reference ,since it does not have a pronunciation file attached to it. The workflow that I described is illustrated through a UI mockup.

Benefits

 * For words having a particular background (mathematics, biology, etc) experts from that field can record pronunciations.