Help:Extension:GWToolset

Introduction
You’re probably reading this because you are considering or planning to make a large amount of content available for reuse by publishing it on Wikimedia Commons. This manual will guide you through the necessary steps.

Technical compatibility analyses
The Toolset has been developed to be used by the most common way GLAMs have organised their content. This means that the Toolset is easy to work with for most organisations, but that some will have to take extra measures before they can use it. The diagram in this paragraph can be used to determine how compatible the Toolset is for your organisation. Every question in the diagram is explained underneath.

Is the XML in flat format?
There are several standards that are currently used by organisations to organise their metadata, for example OAI-PMH, EDM, MARC and Lido. The GLAMwiki Toolset accepts all forms of metadata as long as the data complies to the following requirements.

What is flat format?
The metadata of individual objects have to be on the same level of hierarchy in the XML file, that's what 'flat' revers to. Metadata in a deeper level, further in the hierarchy, is not recognised by the Toolset.

The use of attributes
Attributes of declarations are also not recognised with one exception: the language attribute. This attribute can be used to recognise the descriptions of objects in different languages.

For example , is recognised as a description in English.
 * This is a description

Will be seen by the Toolset as The PhotoID in this example will not be read. Information in attributes can cause loss of information.
 * www.example.org
 * www.example.org

Multiple descriptions in one metadata field
Er zijn velden die meerdere keren voor kunnen komen, bijvoorbeeld . Er is voor gekozen om deze samen te voegen, gescheiden door een pipe ( | ). In sommige gevallen heeft een bestand meerdere omschrijvingen, bijvoorbeeld "voertuigen", "rupsvoertuig", "vlammenwerper" en "gevestvoertuig". Als deze allemaal als  zijn opgenomen worden deze toegevoegd. Het is dus goed om deze in verschillende declaraties te onderscheiden.

Dus niet
 * 


 * "voertuigen", "rupsvoertuig", "vlammenwerper" ,"gevestvoertuig"


 * 

maar


 * 


 * voertuigen


 * 


 * 


 * rupsvoertuig


 * 


 * 


 * vlammenwerper


 * 


 * 


 * gevestvoertuig


 * </dc:description>

Door je elementen te scheiden in je xml zorg je ervoor dat deze op de juiste manier worden weergegeven op Wikimedia Commons.

Can the XML be transformed in flat format?
Mocht de metata omgezet worden om aan bovenstaande eisen te voldoen dan kan dat door:

* een specialist een script op te laten stellen die dat omzet

*  gebruik te maken van xslt:  http://www.w3schools.com/xsl/

* Kiezen voor een standaard die xml plat publiceert, bijvoorbeeld OAI-PMH en tot op zekere hoogte de Europeana API

* Google Refine of Open Refine zou ook kunnen helpen:  https://github.com/OpenRefine

Selecting content
Not all content is suited to be published. File formats, copyright restrictions, organisational restrictions, etc. determine if a work can be published on commons. These variables also determine if a content upload can be done in one batch or if it is better to separate the content into separate batches.

Content types
Every type of content needs a different metadata template. It is not possible to upload photos and sound files in one batch, these need to be separated in a batch of photos and a batch of sound files.

License Types
It is not possible to upload content with different licenses in one batch. Let's say you want to upload files that are available under a CC BY and files with a CC BY-SA license, then you'd have to separate the uploads in a batch for every license.

Permissions
Content that was created hater 1923 probably needs a notice that you have permission form the creator to release these files under one of the accepted licenses for Wikimedia Commons. It is not possible to upload files of different creators in one batch because you need an OTRS ticket number for every creator.

Content donation strategies
There have been several large content donations already. All of these were mass donations: one single event where all the content was uploaded to Wikimedia Commons. This is not the only way to do a donation. This chapter discusses different strategies for content donations.

One time mass donation
This is the classic way of donating content: a large scale donation of the content that can be selected with the available sources.

Advantages:

Theme based
Some GLAMs are currently considering theme based uploads. A theme can be an exhibition. This means that selecting the content that will be uploaded to commons can become a part of the process of preparing an exhibition.

Advantages:
 * Ongoing process of uploads, every new upload gains interest
 * Lessons learned from past upload can be

Advantages

Requesting access
To use the toolset you need to:
 * 1) Be a user of the wiki.
 * 2) Be granted access rights to the toolset.

We recommend that you do all testing on the Commons Beta server and only once you feel that the tool is giving you the results you want, use it on the Production server. Because these are two separate environments, you will need to have a user account on each and request access on each. The best way to do this:
 * 1) Commons Production server: leave a message on the Commons notice board to request rights for the GWtoolset. Please introduce yourself and motivate your request.
 * 2) Commons Beta server: contact a developer or bureaucrat on beta to request the rights for the GWToolset user group on beta. You can ask in the commons IRC channel or contact them from these lists:
 * 3) Bureaucrats
 * 4) Developers

Metadata templates
Wikimedia Commons uses templates to map metadata. The amount of metadata that will be displayed on Commons is therefor limited to the fields that are present in the metadata template that is chosen for the upload.

There are several templates available. Some of the templates that are available are: Note: This list is not yet complete
 * Art_Photo: https://commons.wikimedia.org/wiki/Template:Art_Photo
 * Artwork: https://commons.wikimedia.org/wiki/Template:Artwork
 * Book https://commons.wikimedia.org/wiki/Template:Book
 * Musical work https://commons.wikimedia.org/wiki/Template:Musical_work

There is currently no template available for video content. It's not possible (yet) to use a template you created yourself.

The type of work that you want to upload determines the template you ought to use. This also me ands that it is not possible to upload multiple types of content that require different templates. E.g.: if you want to upload photos and sound files you should separate these uploads and XML files in an upload (and XML file) of the photos and an upload (and XML file) of the sound files. It is not possible to upload both file types in one batch.

License template and other metadata sub-templates
Some metadata fields also use templates. An example is the metadata field for the license of a mediafile. A Creative Commons license will be recognised by the Toolset and results in the display of the corresponding banner with the license. It is possible to create your own template. This is useful when you've cleared permission to use the content and received an OTRS ticket to include with the files. See [|this example of an OTRS ticket in a license template]. If the text in the license field is not referring to a template, this information will be shown as plain text.

Note: the Wikimedia Commons community is very strict when it comes to permission of files usage. The content is most likely deleted when there is any doubt about copyright infringement or other restrictions that do not permit the use of the file on the Wikimedia platforms. This is why a good license template is an absolute must.

Institution Template
An institution template is used to show what institution donated and/or uploaded the file to commons. The template makes it possible to add more information about your institution than only the name of the institution. An example of an institutional template is this template of the Amsterdam Museum. Usefull information to include in this template is: This template is not required, but highly recommended to include with your uploads.
 * The logo of your organisation
 * A photo of the building of your organisation
 * The location (City, country, etc)
 * The coordinates
 * The URL to your website

An institution template will be recognised by the Toolset. The template mentioned above will be included by the Toolset if the source tag in the XML file has the same name as the template, in this case: Amsterdam Museum</dc:source>.

Source template
https://commons.wikimedia.org/wiki/Category:Source_templates

https://commons.wikimedia.org/wiki/Template:British_Library_image

Screencast
The following screencast gives you a quick overview of how to use the extension. You can follow along by going to Special:GWToolset and following the wizard instructions. Note: you will need to be a member of the “gwtoolset” group in order to use the extension. Contact a Wikimedia Commons admin to be added to the group.