EMWCon Spring 2016/Provenance working group

The Provenance working group (Yingjie, BlaueBlüte) at EMWCon Spring 2016 developed some ideas on tracking provenance of data stored in Semantic-MediaWiki installation and on how to make use of such metadata.

Goals

 * Enable users to identify trustworthiness of data in SMW.
 * Capture the Where–When–Who of data to facilitate (content) management.
 * Enable analysis of history of sematic data—lineage.
 * Track connection between “classes” and “instances” over time—has the definition of a “class” changed after it was instantiated?

Use Facets

 * Define provenance metadata along with, e.g., property values.
 * View provenance data alongside page display, query results, etc.
 * Use provenance data in queries, e.g. to restrict queries based on trustworthiness.

Sources of provenance data

 * wiki-internal
 * contributors
 * edit timestamps
 * external
 * editor-provided, like references
 * hybrid (?)
 * external ratings of contributors
 * external ratings of individual pages (e.g., page rank?)

Strategies

 * Amending SMW syntax
 * Subobjects

Example
This shows how external provenance data could be defined:

A query could then look like this:

This query might then return something like: