Extension:RDFIO/Template matching for RDFIO

Visual Import and Template Matching for RDFIO

 * Public URL: https://www.mediawiki.org/wiki/User:Zahara/OPW_Proposal_Round_8
 * Bugzilla report: Bug 61999
 * Announcement: http://lists.wikimedia.org/pipermail/wikitech-l/2014-March/075286.html

Name and contact information

 * Name: Ali King
 * Email: 1.alison.king[at]gmail.com
 * IRC or IM networks/handle(s): zahara
 * Web Page / Blog / Microblog / Portfolio: @ali_king
 * Resume (optional):
 * Location: Edinburgh, Scotland
 * Typical working hours: 9:30-18:00 BST (UTC + 1)

Synopsis
Visual display of RDF data being imported using the Extension:RDFIO extension, showing entity relationships and how the entities will be matched to template types. Where the default match is incorrect, the user can select an alternative type before import.

This requires the initial mapping of the most common ontologies (such as FOAF) to Semantic Mediawiki template types. Other specialised ontologies reference these common type in the definition of their own types, which can be used to select likely candidate templates. User input to match these types will then inform the mapping of these ontologies to the templates.

The main benefit of this is to open up the extension to less technically skilled users who may not be used to manipulating RDF data, and encourage the use and import of pre-existing data stores.

Joel Sachs, Samuel Lampa
 * Possible mentors:

Deliverables
Please describe the details and the timeline of the work you plan to accomplish on the project you are most interested in (discuss these first with the mentor of the project):

About you
In September 2012 I quit my public sector job in order to become a professional programmer. I secured a mentor and began working at a startup company on projects using JavaScript, jQuery, XSLT, Ruby, Rails and other gems.

I also gained my first contract work doing report development work in MS SQL Server for the National Health Service. This involved database design, visual dashboard design, training staff in SQL, and remote working.


 * Education completed or in progress:

(all completed - currently undertaking informal learning)

Computing for Data Analysis (Coursera, completed with distinction) CIW Database Design, Web Languages (JavaScript & Perl), Web Design, Web Foundation Studied for a BEng in Electronics with Music at the University of Glasgow.


 * How did you hear about this program?

Tweet from the Ada Initiative


 * Will you have any other time commitments, such as school work, another job, planned vacation, etc., during the duration of the program?

No - I am a contract IT worker, and my current contract will have finished by the time the program begins. I do not plan to start my next contract until September.


 * We advise all candidates eligible to Google Summer of Code and FOSS Outreach Program for Women to apply for both programs. Are you planning to apply to both programs and, if so, with what organization(s)?

I am not a student so am not eligible for GSoC

Past experience

 * Please describe your experience with any other FOSS projects as a user and as a contributor:

I have not previously worked as a contributor to open source, although it's something I've been wanting to do for a while, and now I feel my skills and experience are up to the job. I have however done research into the feasibility of using open data exchange formats, and attended Open Knowledge Foundation events in Edinburgh to this end.


 * Please describe any relevant projects that you have worked on previously and what knowledge you gained from working on them (include links):

The most important thing I have learned from previous projects is the importance of making data import technically accessible to your target users. I worked on a project at Skills Development Scotland implementing a bulk import facility using the XCRI schema. This initially asked for learning providers to submit their course data as a schema-compliant XML file, which few had the skills or resources to do. A second implementation used a spreadsheet template which mapped to the schema, but gave insufficient feedback on any errors in input. I was not in a position to influence the original project design, but gained a great deal of insight into issues faced by the users. In researching other upload interfaces during the course of this and my subsequent work at a startup company in the same field, I have come to understand the importance of visual feedback in the data import process.
 * What project(s) are you interested in (these can be in the same or different organizations)?