Importing data into Wikibase
Perhaps the greatest challenge of a clean, empty database is that of properly filling it up with what you want to put in it. Wikibase is no exception; this section covers the most common and useful tools for getting data into Wikibase.
QuickStatements is the useful import tool you’ll find already running in your Docker setup. It’s more than an import tool; it can be used to edit and modify data. You can find the easy-to-use QuickStatements interface on your Wikibase instance by navigating to
<your Wikibase url>/tools/quickstatements.
The essential help document linked above is packed with useful information far too intricate to be summarized here, but it’s worth noting that OpenRefine data (see below) can be exported to QuickStatements format.
WikibaseImport is a MediaWiki extension and a flexible, powerful command-line tool that allows you to import into Wikibase using exports of data from other Wikibase instances. It’s best for experienced Wikibase users who have a firm grasp of Wikibase’s data architecture and are comfortable using command-line tools on their instance’s web server. (See the maintenance section for more information on working directly with a container)
Wikibase Integrator, also known as Wikidata Integrator, is a Python tool for creating sophisticated bots that can read from and write to Wikibase. It was developed to improve on Pywikibot’s handling of the MediaWiki API and integrate tightly with the Wikibase SPARQL endpoint.
Read here how the developers put WikibaseIntegrator to work.
Originally developed by Google, OpenRefine is now a community-supported data management tool that can handle large bodies of data and wrangle them into a format suitable for importing into Wikidata.
While OpenRefine support for Wikibase users is not yet available, progress on this feature is likely coming in the future. For more information, see this discussion on adding support for Wikimedia Commons.
The native MediaWiki export and import functionality can be used to transfer Wikibase data between wikis. Some configuration changes are needed, but not extra software or programming work.
RaiseWikibase is a Python tool for fast inserts into a Wikibase instance. It uploads up to a million entities and wikitexts per hour, fills data directly into the MariaDB database and can create a bot account for the wrappers of the Wikibase API.
The main use case is uploading millions of pages into a fresh Wikibase instance. Read how to use it.