Wikibase/Coding conventions

This page describes the coding conventions used within the wider Wikibase codebase. This includes code formatting policies, high level architecture recommendations and everything in between.

A good portion of the Wikibase code is part of MediaWiki extensions, and almost all of the code has been written to support these MediaWiki extensions. As a result the guidelines used are based on the MediaWiki coding conventions, in particular the ones that apply to PHP and JavaScript. This document outlines the additions and modifications compared to these guidelines. In other words, you should generally follow the MediaWiki coding conventions unless they are contradicted by this document.


 * How to use this document

It is also important to consider that the more complicated the subject of the guideline, the more you are recommended to use critical thinking and not blindly apply said guideline. To be concrete, simple things such as code formatting should be viewed as policies and strictly adhered to in almost all cases. Subjects such as component design have a set of applicable recommendations, and rules that should generally be followed. Those rules will of course not always apply, and almost always still require one to make informed trade-offs. Good design cannot be achieved by mindlessly applying rules.

The coding conventions described in this document are also general. They apply mostly to most of the packages in the Wikibase codebase and closely associated projects. Some packages do not follow a specific guideline, which can be by design, or due to legacy reasons. One should carefully consider diverging from policies adhered to in these packages even if they conflict with the ones outlined here. If the package specific documentation does not provide more information on these divergences, it is recommended you contact the package maintainers for more information.

Tests
In general we apply the policy that all new behavior should be tested. There are exceptions for trivial code, such as getters and setters, in particular when the code is not expected to be widely dependent upon. Tests will make it easier for yourself to ensure your code is correct (testing through the user interface takes a lot more effort and is less effective), and will get your code through review quicker.

Use test doubles where appropriate. Try to avoid fakes and mocks if simple stubs or spies will suffice.

Most code should have real unit tests (that just test the unit of code) rather then component, integration or system tests. When writing a test that falls in one of the later categories, keep it separate, so people can run all unit tests without needing to wait for some database or network.

Follow Arrange Act Assert. Test only one logical thing with a single test method. Indicate what is being tested in the method name. does not explain itself well, while  does.

Comments

 * Prefer clean code over bad code with comments


 * "When you feel the need to write a comment, first try to refactor the code so that any comment becomes superfluous." -- Martin Fowler, Refactoring: Improving the Design of Existing Code, 1999

Generally try to avoid comments. Only add those that are really needed, and cannot be made superfluous by making the code itself more clear.


 * Avoid redundant comments

The comments here do not add any value. They therefore should be omitted.

This is much better:


 * Comments themselves should be clean

When running into a situation where a comment is needed, take care to write a short and precise comment that has correct grammar and punctuation.


 * Keep in mind the cost of comments

Comments are prone to being inaccurate, and tend to become more inaccurate over time, as code gets updated, while the comments do not. Unless great care is taken, comments will end up spreading disinformation, often subtly. At point they hurt more then helping. In this regard every comment is a liability.

Class design
The SOLID principles should be kept in mind and generally adhered to.

Package design
The package principles should be kept in mind and generally adhered to.

PHP
We are using PHP CodeSniffer to automatically enforce certain style decisions. Our rule set is mostly based on the [//phabricator.wikimedia.org/diffusion/MCSN/ MediaWiki CodeSniffer rule set], with a few exceptions and additions. The differences are documented at https://github.com/wmde/WikibaseCodeSniffer/blob/master/Wikibase/ruleset.xml.

Basic Coding Standard
You must follow [//github.com/php-fig/fig-standards/blob/master/accepted/PSR-1-basic-coding-standard.md PSR-1]. This includes following an autoloading standard, such as PSR-0 or [//github.com/php-fig/fig-standards/blob/master/accepted/PSR-4-autoloader.md PSR-4].

Namespace names must use SturdyCaps.

Documentation header
The MediaWiki coding guidelines on this topic state one should use a verbose GPL header. Our policies state the opposite and strongly discourage inclusion of such verbose headers.

The general idea behind our policy is that only documentation relevant to developers should be included. Everything else is clutter that gets in the way. When someone opens a file, you do not want them to have to always scroll down first before they can actually see the relevant code. The code is more important then the address of the Free Software Foundation.

This means the following things should not be included:


 * The verbose GPL header
 * @file tags, either with or without file name
 * Any @ingroup or @package tags – organization is already done by the directory structure and namespaces
 * Any dates or other crust your default IDE template might contain

Things that need to be included:


 * @license tag
 * @author tag (there can be multiple)

Other things that are often included:


 * @group tag in tests
 * Any documentation the class or interface requires

Namespace, imports and newlines
Formatting should furthermore follow the above example. This means having the following sections, each separated with a blank line:


 * The PHP opening tag
 * The namespace your class resides in
 * Any use statements to import resources from other namespaces
 * The class documentation header directly followed by the class declaration

The namespace declaration and the use statements all use fully qualified names that do not start with a backslash.

@since tags
@since tags indicate in which version the annotated artefact has been introduced. This is important for people building on top of a component and want to provide compatibility with more then the latest development version.

@since tags should only be used in code bases that follow [//semver.org/ semantic versioning]. This typically excludes extensions like Wikibase itself.

As with all manually maintained information in comments, these tags come with a cost. They are clutter to a lot of people that have no use for them, and they have the potential to be wrong. It is therefore discouraged to add these tags to all classes and methods. Adding @since tags for protected or private methods serves little purpose. The same goes for adding these tags to classes or interfaces that are private to a package. Public classes and interfaces, as well as the public methods defined in them, likely deserve a @since tag.

Referring to resources in other namespaces
Code inside classes should generally not contain any fully qualified names, and rather import the needed classes or interfaces.

Wrong:

Right:

Usage of relative references is also discouraged.

PHPUnit

 * [//phpunit.de/manual/current/en/writing-tests-for-phpunit.html#writing-tests-for-phpunit.data-providers Data providers] should be named in the format . The format   (as seen in the PHPUnit documentation) is also acceptable.
 * Test methods and providers should not be made static unless there is specific reason to do so.

The [//phpunit.de/manual/current/en/test-doubles.html PHPUnit mock API] can be used. For test doubles that are needed in multiple tests, creating a dedicated class is often more clear.

JavaScript style policies
In general, the MediaWiki JavaScript coding conventions shall apply to the Wikibase extension as well. A few differences and additional specifications are outlined below.

Function and variable prefixes
Since JavaScript does not support encapsulation, we are using an underscore to mark functions and variables private discouraging the use out of their scope:

jQuery wrapped elements should be prefixed with a dollar sign to be able to distinguish those easily from other variables. In addition, this allows naming a jQuery node like the instance of a widget that is initialized on that node:

Whitespace
We follow the MediaWiki coding conventions:

Except for comments, spaces should, in general, not be used for the purpose of alignment within the code:

Indentation
When chaining more than one function call, each call should be placed in a new line without applying any indentation to prevent ending up with deep indentation levels:

This convention does not apply when assigning a variable. However, one should not declare other variables along with such a variable declaration. Just separate the variable declarations:

Line breaks
Like to the MediaWiki coding conventions, lines should not exceed the length of 100 characters (tab equals 4 spaces). When a line break within a statement cannot be avoided, the operator should be wrapped to the next line indented by an extra level to indicate the continuation of the statement. Closing parentheses should always be on the same level like their corresponding opening ones.

jQuery DOM element creation
Although quick-closing is not necessary when creating elements,  is preferred over   for consistency throughout the codebase.

Keeping context in callbacks: jQuery.proxy; vs. var self = this;
For consistency throughout the codebase, usage of is preferred over

jQuery.Promise
When dealing with promises, the,   and   methods should be used rather than the less expressive. Callback parameters given when the promise gets resolved should be documented explicitly.

Accessing jQuery widget instances
For accessing jQuery widget instances, the .data function should be used resulting in a more expressive code:

Documentation
Just like in MediaWiki, we use [//github.com/senchalabs/jsduck/ JSDuck] for generating documentation. Consequently, only tags listed in the [//github.com/senchalabs/jsduck/wiki#syntax JSDuck syntax documentation] should be applied. The only additional tag is "@license". Documentation should never be on file level, but on constructor/class level as well as on function and member level. Scoped global functions and variables should be marked with "@ignore" to not have them end up as class members in the documentation.

Types
Do not use scope arguments in the documentation, but be explicit in type and function parameter documentation since the generated documentation does not resolve scope arguments. Instead of …

… apply:

Optional arguments
In function documentations, optional function arguments should be documented with square brackets around the variable name making their optional nature more obvious.

Instead of …

… use:

Mixed arguments
Can be documented as

jQuery Widget documentation
Since there should be not more than one widget per file, the file level documentation should be used to document a widget. In addition to the default documentation tags @license and @author, the documentation header should contain a description of the widget and documentation of the widget's options as well as the events triggered by the widget using the @option and @event tag respectively.

(jQuery) promises
In order to avoid having to analyse a function body of a function returning a promise object, all values forwarded by promises should be documented, for example: The format ensures a properly readable documentation getting generated.

Array iteration
We decided to use [//developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Array/forEach Array.prototype.forEach] and [//developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Array/every Array.prototype.every] instead of $.each.

''Please keep in mind that .forEach( value, key ) and $.each( key, value ) have a different order of parameters and .forEach doesn't support escaping the iteration by returning false. In this case one should use .every instead of .forEach.''