User talk:YuviPanda/Archive 1

GSoC proposal
I know we've already talked about this in IRC, but just so you have the link: In order to have your proposal considered for Google Summer of Code, you need to formally submit it at http://www.google-melange.com/gsoc/org/google/gsoc2011/wikimedia. The deadline is the 8th of April. Please ask us for feedback in #mediawiki on Freenode or via the wikitech-l mailing list, then submit! Thanks. Sumanah 21:22, 31 March 2011 (UTC)

Suggestions
"It will still be a (parse assessment data -> store in db -> display) based architecture," I'm not sure about this - let's leave this a little more open-ended. Like we talked about this morning, I think it would be great to approach this from a perspective having n possible assessment aggregators - creating a pluggable assessment parsing system and shipping the initial release with the same (at least from a user-facing perspective) functionality present in the WP 1.0 bot. Part of the planning and development work will be determining what the most sensible way of handling the assessment data found in the assessment templates will be. Otherwise, I think this is off to a good start. Awjrichards 21:37, 5 April 2011 (UTC)

Also, there is some additional functionality that would be really great to include in an initial release of this extension. Here are some user stories describing the desired features: Let me know if you have any questions or concerns about these, or any reservations about being able to complete all of this by the end of GSoC. I should be on IRC probably around 8am PDT tomorrow (6 April) if you want to chat, or shoot me an email. Awjrichards 00:19, 6 April 2011 (UTC)
 * 1) As a Wikipedia offline creator, I want to create sub selections of existing collections so that I don't have to start from scratch each time.
 * 2) As a Wikipedia offline creator, I want to be able to add new articles to sub selections so that I can append anything that might be missing.
 * 3) As a Wikipedia offline creator, I want to be able to mark articles as 'child appropriate' so that inappropriate content doesn't surface.
 * 4) As a Wikipedia offline creator, I want to be able to filter out certain categories so that inappropriate content doesn't show (Adult articles).

Support / workflow diagram
As the current maintainer of the WP 1.0 bot I support this project fully. I will be happy to consult with this project to share my experiences with the offline releases I have helped with.

The workflow at http://i.min.us/imFAWI.png isn't quite accurate. The difficulty is not just making a list of articles, the more difficult part is choosing a good revision ID for each of those articles. The actual workflow for creating the Wikipedia 0.8 release was this:


 * 1) Use the WP 1.0 bot to make a preliminary selection of articles (this requires some manual work from User:CBM)
 * 2) Other volunteers use WikiTrust and other tools to choose a "good" revision ID for each article. They send these to CBM.
 * 3) CBM imports these selected revision IDs into the WP 1.0 database.
 * 4) We announce the selections to the WikiProjects and give them time to suggest changes, additions, and removals.
 * 5) Other volunteers manually go through the suggestions and implement them as appropriate, using a web tool.
 * 6) User:CBM makes the final list containing selected articles and revision IDs.
 * 7) That list goes to other volunteers who use semi-automated tools to create the ZIM file.

We don't directly edit CSV files much; they're mostly used just to move data between one database and another. I don't handle anything about packaging or rendered HTML. I just help create the list of selected articles and revision IDs, using the WP 1.0 bot's tools. – Carl &middot; CBM 21:11, 7 April 2011 (UTC)
 * I'm another 1.0 person, one of the "volunteers" Carl mentioned above. I also fully support this, and I also completely agree with Carl's correction above.  We need a bit of manual intervention as a "reality check", both for revisionIDs and actual articles.  Other than that, your proposal looks very nice.  Any tools that can make the manual steps (4 & 5 above) faster would be appreciated.  Thanks, 173.20.132.189 05:00, 8 April 2011 (UTC) (Walkerma after logging in)

Hello there
I am a Pune wikipedian presently working on assembling a set of articles for Wikipedia for Schools (offline version - Indian edition). Your project interests me very much as everything I do in this regard is manually. Do tell me where I can watch your progress. If you want me to beta-test something, that will be fine. AshLin 08:59, 26 April 2011 (UTC)

Congrats
Congratulations on being accepted for GSOC. I would be happy to help (as much as my schedule will allow) over the summer. I can definitely explain how the current system works and why certain decisions were made over time. Please feel free to contact me at cbm.wikipedia@gmail.com any time; I'd prefer to communicate by email, or over IRC. If you don't have an account on the toolserver, you should probably request one. Once you have an account, I will be happy to give you access to the current WP 1.0 system there so you have access to its code and database. I'm looking forward to seeing what you can do with the system this summer. CBM 01:09, 6 May 2011 (UTC)