The data model of WikibaseMediaInfo describes the structure of the data that is used to describe media files and content in Wikibase. The WikibaseMediaInfo extension embeds such machine readable descriptions on file description pages. This data model is conceptual ("Which information do we have to support?") and does not specify how this data should be represented technically ("Which data structures should the software use?") or syntactically ("How should the data be expressed in a file?"). Separate documents describe the serialization of the Wikibase data model in JSON and in RDF (Resource Description Framework). The MediaInfo data model defines basic concepts and relationships needed to describe media files and content, which act as a fixed ontology. This ontology provides a minimal scaffolding that allows Items and Statements to be used for detailed modeling of media files and content. The specification of the MediaInfo data model is based on the Wikibase data model, so the Wikidata glossary and the Wikibase data model primer may be helpful in understanding this document. The MediaInfo data model aims to align with established standards for describing media content and files. TBD: which established standards, and how do we relate to them.
A MediaInfo entity describes some kind of media content or file. Media content and files are described using the following information:
- An ID. MediaInfo entities have IDs starting with an "M" followed by a natural number in decimal notation, e.g.
M3746552. These IDs are unique within the repository that manages the MediaInfo entity. The ID can be combined with a repository's concept base URI to form a unique URI for the MediaInfo entity e.g. https://commons.wikimedia.org/wiki/Special:EntityData/M72261258.json
- A Caption for use as a short name for a media content or file, particularly in listings. The caption may also be used as a sub-title when embedding the image elsewhere.
- A list of Statements to describe properties of the media content or file (e.g. painted by or recorded at or license or material)
The caption is a short name for the media content or file. The intended use is similar to that of a descriptive file names, but without any technical details (like the file extension) and translatable. Note that the caption does not have to be unique, it should just be descriptive.
Captions are not simple strings, but MultilingualTextValues, so users can see media files represented in their preferred language.