Google Books, Internet Archive, Commons upload cycle/Progress

(Automation Tool) Google Books > Internet Archive > Commons upload cycle

 * Public URL: [//www.mediawiki.org/wiki/Google_Books,_Internet_Archive,_Commons_upload_cycle //www.mediawiki.org/wiki/Google_Books,_Internet_Archive,_Commons_upload_cycle]
 * Bugzilla report: Bug - 57813
 * Hosted on tools-lab: http://tools.wmflabs.org/bub/
 * Maintained on github: https://github.com/rohit-dua/bub

Goals for the first half of the internship

 * Create the front-end for the web-tool to be hosted on tools-lab server.
 * Develop bot that handles queries in database (time-out deletion/queue handling/IPC-communication messages).
 * Extract meta-data from Google-Books and introduce system to check if book already present in IA(Internet Archive).
 * Create script to download from Google-books.
 * This will be done by extracting individual page image, and then converting'em to pdf.

Communication plan

 * I find IRC a quick way to contact to my mentors.
 * Email will be used when mentors are not available.
 * plan to have involvement of interested parties for testing/ suggestions.
 * For this announcement on wikitech-l, wikisource-l, commons-l, will be made.

Lessons learned since 21st April

 * Every task becomes a piece of cake, if you love doing it.
 * For queries, google cannot be as good as a real-time chat/email with someone experienced.
 * Before the core-coding, the set-up work does take a lot of time and edits.
 * Discussions and feedback make thing better.

Before Week 1

 * Started the fronted development of the tool (the web face.)
 * Using bootstrap
 * Shifted the workspace to tools-lab.
 * Linked the github repo. to the tools folder.
 * Examined the code-base.

Week 1: May 19 to May 25

 * University Examinations
 * Familiarized myself with tools-lab.

Week 2: May 26 to June 2

 * Worked on the back-end python script.
 * Added script to verify Commons Name and the Google-books ID.
 * Cookie/session handling
 * Linked the DB to the tool.
 * Set up a cron-job to delete unconfirmed requests.
 * The tool can now be tested(for the frontend only) here
 * Understood the redis-queue implementation.