Extension:Wikibase Quality Extensions

Welcome to WikidataQuality!

We are a team of students from Hasso-Plattner-Institute in Potsdam, Germany. For our Bachelor's project we're working together with the team of Wikidata to ensure their data quality.

In consultation with the Wikidata community, two projects have emerged. On part of our team is currently working on a tool that validates Wikidata by comparing it againts external databases, whereas the second part wants to improve the usage and visualization of constraints.

Improving Constraint Reports
When we started working on this projects, the only way to define constraints was on the talk page of a property, and it could only be done via editing templates. This is neither user-friendly nor easy to maintain. On the contrary: During our studies we found, that there are over 4000 hand-written constraints, but some of them don't match exactly the definition of the templates, e.g. Single_value instead of Single value. It is very difficult for a bot to check the data againts their corresponding constraints, when some of them are written wrong.

So this is the status quo: There are constraints on the talk page of properties and there is a bot checking the data he finds in dumps of Wikidata against those constraints and genereates these constraint reports. While this definetly generates additionally value, it isn't nice to read, the underlaying constraints are a pain to maintain and checking againts a dump is of course not as accurate as checking againts live data.

Luckily, it is now possible to create statements on properties. Based on that feature, we are planning to migrate the constraints from the talk pages, enabling us to generate meaningful constraint reports right where they are needed.

tbd: Vision + Mockups

Currently, we are building a special page where you give an ItemID and we generate a table with the constraint report. Right now, we do this based on the constraints that were defined on the talk pages. To be able to do this on a special page in reasonable time and particulary on live data, we parsed every talk page and build a table with every constraint with their corresponding parameters.

In the end, the result of this check should be displayed right beside the statements when you visit an item page, but this will take a while. Until then, we want to migrate the constraints to the statements on properties, so that our special page can work without the usage of the table we genereated from the talk pages.

And here, we need your help:

tbd: link to property proposal page, explanation

External Validation
tbd: Description of this project

For further information please visit our Github Wiki.