Manual:Tag extensions



Individual projects will often find it useful to extend the built-in wiki markup with additional capabilities, whether simple string processing, or full-blown information retrieval.

Tag Extensions allow users to create new custom tags that do just that.

For example, one might use a tag extension to introduce a simple   tag, which injects a donation form into the page.

Extensions, along with man1>Special:MyLanguage/Manual:Parser Functions|Parser Functions and man2>Special:MyLanguage/Manual:Hooks|Hooks are the most effective way to change or enhance the functionality of MediaWiki.

You should always check the matrix before you start work on an extension to make sure someone else hasn't done exactly what you are trying to do.

A simple tag extension consists of a callback function, which is hooked to the parser so that, when the parser runs, it will find and replace all instances of a specific tag, calling the corresponding callback function to render the actual HTML.

Example
This example registers a callback function for the   tag.

When a user adds this tag to a page like this:  , the parser will call the renderTagSample function, passing in four arguments:

Input between the &lt;sample&gt; and &lt;/sample&gt; tags, or null if the tag is "closed", i.e. &lt;sample /&gt; Tag arguments, which are entered like HTML tag attributes; this is an associative array indexed by attribute name. The parent parser (a Parser object); more advanced extensions use this to obtain the contextual Title, parse wiki text, expand braces, register link relationships and dependencies, etc. The parent frame (a PPFrame object). This is used together with $parser to provide the parser with more complete information on the context in which the extension was called.
 * $input :
 * $args :
 * $parser :
 * $frame :

For a more elaborate example, see 1>Special:MyLanguage/Manual:Tag extensions/Example</>|Tag extension example

Attributes
Let's look at another example:

This example dumps the attributes passed to the tag, along with their values.

It's quite evident that this allows for flexible specification of new, custom tags.

You might, for example, define a tag extension that allows a user to inject a contact form on their user page, using something like &lt;emailform to="User" email="user@foo.com" /&gt;</tt>.

There is a veritable plethora of tag extensions available for MediaWiki, some of which are cat>Special:MyLanguage/Category:Parser extensions</>|listed on this site; others can be found via a quick web search.

While a number of these are quite specialised for their use case, there are a great deal of well-loved and well-used extensions providing varying degrees of functionality.

Conventions
See <tvar|1></> for the general layout and setup of an extension.

Publishing your extensions
A convenient template has been created to hold this information called <tvar|extension></>. See the template page for more information. You should also add as much detail as possible to the body of the page, and it is wise to check back fairly regularly to respond to user questions on the associated talk page. Also, make sure the page belongs to <tvar|cat></>.
 * 1) Create a new page on this wiki named Extension:<extension_name> with information on your extension, how to install it, and screenshots of it in use.


 * 1) Extensions that create new hooks within the extension code should register them on extension hook registry.


 * 1) Notify the mediawiki-l mailing list.

See also publishing your extension.

Security concerns
You'll notice above that the input in the examples above is escaped using htmlspecialchars</tt> before being returned.

It is vital that all user input is treated in this manner before echoing it back to the clients, to avoid introducing vectors for arbitrary HTML injection, which can lead to cross-site scripting vulnerabilities.

Loading modules
The right way to add modules for your extension is to attach them to the ParserOutput rather than to $wgOut.

The module list will then be automatically taken from the ParserOutput object and added to $wgOut even when the page rendering is pre-cached.

If you are directly adding the modules to $wgOut they might not be cached in the parser output.

Timing and extensions
If you change the code for an extension, all pages that use the extension will, theoretically, immediately reflect the results of new code.

Technically speaking, this means your code is executed each and every time a page containing the extension is rendered.

In practice, this is often not the case, due to page caching - either by the MediaWiki software, the browser or by an intermediary proxy or firewall.

To bypass MediaWiki's parser cache and ensure a new version of the page is generated, click on edit, replace "action=edit" in the URL shown in the address bar of your browser by "action=purge" and submit the new URL.

The page and all templates it references will be regenerated, ignoring all cached data.

The purge action is needed if the main page itself is not modified, but the way it must be rendered has changed (the extension was modified, or only a referenced template was modified).

If this is not sufficient to get you a fresh copy of the page, you can normally bypass intermediary caches by adding '&rand=somerandomtext' to the end of the above URL.

Make sure 'somerandomtext' is different every time.

How do I disable caching for pages using my extension?
Since MediaWiki 1.5, the parser is passed as the third parameter to an extension.

This parser can be used to invalidate the cache like this:

Regenerating the page when another page is edited
Maybe you don't want to disable caching entirely, you just want the page to be regenerated whenever another page is edited, similar to the way that template transclusions are handled.

This can be done using the parser object that is passed to your hook function.

The following method was lifted from CoreParserFunctions.php and appears to work for this purpose.

Fine grained adjustment of caching behavior
You can use fine grained caching for your extension by using cache keys to differentiate between different versions of your extension output.

While rendering you can add cache keys for every feature by adding an addExtraKey method to your hook function, e.g.:

However, modifying $parser->getOptions during parse means that the extra option keys aren't included when trying to get a cached page, only when rendering a page to go into cache, so you can use the PageRenderingHash hook to set extra options.

PageRenderingHash is run both when putting a page into cache, and getting it out, so its important to only add new keys to the hash if they're not already there.

e.g:

Some important notes on this:

! is used a separator for different rendering options
 * Using "!setting1=$value" instead of just "!$value" in the confstr ensures that the parser cache does not become messed up if different extensions are installed or their load order changes.

Be warned that addExtraKey does not tell the parser cache that the extra key is in use, and thus can easily result in breaking the cache if you are not careful.
 * Some people use  instead of.

Since version 1.16
Parser hook functions are passed a reference to the parser object and a frame object; these should be used to parse wikitext.

Parser::recursiveTagParse</tt> has been around since version 1.8.

Its advantages include simplicity (it takes just one argument and returns a string) and the fact that it parses extension tags in $text</tt>, so you can nest extension tags.

The second parameter to recursiveTagParse, $frame</tt>, is an optional argument introduced in MW 1.16 alpha (r55682).

In other words, content such as  </tt> will be recognized and converted into the appropriate value.
 * If $frame</tt> is provided (using the value of $frame</tt> passed to your extension), then any template parameters in $text</tt> will be expanded.

Although this unlikely to be the desired behavior, this was the only option available before MW 1.16.
 * If $frame</tt> is not provided (e.g., $parser->recursiveTagParse( $text )</tt>), or if $frame</tt> is set to false, then template parameters will not be expanded;  </tt> will not be altered.

However, one step of parsing that is still skipped for tags, even when using recursiveTagParse, is Parser::preSaveTransform.

preSaveTransform is the first step of parsing, responsible for making permanent changes to the about-to-be saved wikitext, such as:


 * Converting signatures (, ~ ,  )

Without this step, shorthand links such as Help:Contents are considered to be invalid, and are left in their wikitext form when parsed.
 * Expanding link labels, also known as the pipe-trick (e.g., changing Help:Contents into Contents ).


 * Expanding templates.

The original call to preSaveTransform intentionally skips such conversions within all extension tags.

If you need pre save transform to be done, you should consider using a parser function instead.

All tag extensions can also be called as a parser function using  which will have pre save transform applied.

Version 1.8 to version 1.15
The only difference before 1.16 is that the $frame argument was not available for Parser::recursiveTagParse</tt>.

If the resulting inability to recognize template variables is a problem, bug 2257 has more information and workarounds.

Since version 1.5
Since MediaWiki 1.5, XML-style parameters (tag attributes) are supported.

The parameters are passed as the second parameter to the hook function, as an associative array.

The value strings have already had HTML character entities decoded for you, so if you emit them back to HTML, don't forget to use, to avoid the risk of HTML injection.

How can I avoid modification of my extension's HTML output?
The return value of a tag extension is considered almost parsed text, which means its not treated as pure html, but still modified slightly.

There are two main things that are done to the output of a tag extension (Along with a couple other minor things):

Strip markers are certain items which are inserted at various stages of processing wikitext to act as a marker to re-insert removed content at a later time. This is not something extensions usually need to worry about.
 * Replace strip markers.

This can sometimes be an issue in some extensions.
 * Parser::doBlockLevels which turns *'s into lists, and turns any line starting with a leading space into a &lt;pre&gt; among other things.

Tag extensions also support returning an array instead of just a string (Much like parser functions) in order to change how the return value is interpreted.

The 0th value of the array must be the html.

The "markerType" key can be set to nowiki</tt> in order to stop further parsing.

Doing something like  would ensure that the $html value is not further modified and treated as just plain html.

How do I get my extension to show up on Special:Version?
In order for your extension to be displayed on the MediaWiki Special:Version page, you must assign extension credits within the PHP code.

To do this, add a $wgExtensionCredits</tt> variable as the first executable line of code before the hook line or function definition.

An example extension credit is:

Replace <tt>validextensionclass</tt> with one of the following (unless your extension falls under multiple classes&mdash;then create a credit for each class):


 * 'specialpage'&mdash;reserved for additions to MediaWiki Special Pages;


 * 'parserhook'&mdash;used if your extension modifies, complements, or replaces the parser functions in MediaWiki;


 * 'variable'&mdash;extension that add multiple functionality to MediaWiki;


 * 'media'&mdash;used if your extension is a media handler of some sort


 * 'other'&mdash;all other extensions.

The <tt>myextensionmsg</tt> is the name of an interface/i18n message that describes your extension that will need to be defined in your extension's i18n.php file.

If you omit this field, the <tt>description</tt> field will be used instead.

Retrieving the tag name inside of the callback
Suppose you have several tags <tvar|foo> </> and <tvar|bar> </> that share the same callback, and inside the callback function, you want to obtain the name of the tag that invoked the callback.

The short answer is: the tag name (<tvar|foo> </> or <tvar|bar> </>) is not present in any of the callback's arguments.

But you can work around this by dynamically constructing a separate callback for each tag: