Manual:$wgUseTidy

Details
Use "HTML tidy" to make sure HTML output is sane.

HTML tidy is a free tool that fixes broken HTML. See HTML tidy and http://www.w3.org/People/Raggett/tidy/

You may wish to setup this tool, and set $wgUseTidy=true, to ensure that the wiki outputs reasonably clean and compliant HTML, even when malicious or foolish users add corrupt/badly formatted HTML to wiki pages.

Note that MediaWiki already does some built-in checks and corrections to user's HTML, and limits the range of html tags and attributes which can be used (unless you set $wgRawHtml=true Dangerous!) Limitations are described at meta:Help:HTML in wikitext. The logic for this is found in . As such, you may decide that running HTML tidy over the output is not necessary.

Note: HTML tidy will irreversibly and unexpectedly mangle standard HTML markup when it feels like it. For example, wikitext like  will not actually produce a clickable link. There are several dozen particular bugs identified that are likely never to be fixed (see  and its list of blockers). If you enable Tidy, you're in for a world of hurt.

Configuration
The location of the tidy configuration file can be set using $wgTidyConf - before MediaWiki 1.10, this was required. In later versions, a working default is provided.

However, this may not always work: see $wgTidyInternal for some more installation information.

Effects
Tidy is still required to mix wiki table and html table syntax, as well as simple wikicode and html-style markup.
 * Mixed open/close tags.
 * '''foo
 * foo&amp;lt;/b&amp;gt;
 * foo
 * Definition list nesting
 * ; hi
 * one
 * Definition list nesting
 * ; hi
 * one
 * one

 one  hi one</li></ol> </dd></dl> Tidy can correct most bad HTML, which can be bad user input like or conflicting or badly written extensions (and even some bugs in the core software). However, it does not resolve all strict XHTML validation issues, such as duplicate xml ID attribute values, or IDs starting with numbers.
 *  hi
 * }