User:Hardik95/GSoC 2012 Application

(Project title)

 * Public URL : Parser_migration_tool
 * Bugzilla report : Bug Report
 * Announcement :

Name and contact information
Name: Hardik Juneja Email: hardikjuneja.hj@gmail.com IRC or IM networks/handle(s): hardikj Location: India Typical working hours: 12am to 3am until August, 6pm to 2am after August

Project Summary
This GSOC project aims at enhancing the communication of parsoid developers with wikipedia editors through a tool that generates information about all the broken and deprecated wikitext lying all around pages on wikipedia. This will be done by creating a generator for an existing tool CheckWiki that will find issues for the tool and also feed fixup information that will be generated using parsoid. Since parsoid has got lot of fixup information that can help the wikipedia editors to know where broken wikitext is and how they can fix it, this tool might be quite useful for the Community.

The project aims at implementing a generator which would have following features -


 * 1) Finding issues like broken and deprecated wikitext and reporting them to checkwiki
 * 2) Generating fixup information for each issue using parsoid
 * 3) Feeding this information to checkwiki or provide a web service for checkwiki to pull data.

Project Scope

 * 1) Finding issues
 * 2) * Using some infrastructure of logging setup that is used to log production errors and also for  tracing and debugging during development.
 * 3) * Creating log events when particular issue is found
 * 4) Generating fixup information -
 * 5) * Planning the database structure and Create a database.
 * 6) * Creating an interface that listen to the log events and save it into a database.
 * 7) Feeding this information / provide a web service -
 * 8) * Creating web API’s for check wiki so they can pull data from our database
 * 9) * Creating a database sync service that will keep both database in sync
 * 10) Filtering and optimization -
 * 11) * Filter and optimize the process of Generation of issues
 * 12) * Generate fixup information for hard problem like balanced/unbalanced templates using parsoid