Parsoid/Deployments

Planned deployments, linked from Deployments. For a list of past deployments, look for 'parsoid' in Server Admin Log.

See Parsoid to learn how to deploy a new version of Parsoid.

Wednesday, Jun 25, 2014: 4ef9d6be will be deployed around 1pm PST

 * : Recognize  and   as html elements
 * Allow comments and spaces before table lines
 * : Empty line with comment eats trailing nl token
 * Small tweaks to indent-pre handling in the serializer

Monday, Jun 23, 2014: ✅

 * : Fixed bug in nowiki-escaping of magic words
 * Additional tweaks and improvements to selective serializer

Wednesday, Jun 18, 2014: ✅

 * : Suppress &lt;nowiki&gt;s for table WT strings outside tables
 * : Add nowiki protection around quotes adjacent to I/B tags
 * : Strip unsupported table tags during serialization
 * Cleanup, fixes and improvement to serializer around handling of leading white-space on new lines.

Monday, Jun 16, 2014: deploy cancelled
Deployment cancelled to investigate issues found in testing.

Wednesday, Jun 11, 2014 around 1pm PST: ✅

 * : DOM support for DISPLAYTITLE magic word.
 * : Invalid links in HTML are serialized to MediaWiki:Badtitletext message.
 * : Escape nowiki when combined with other wiki markup.
 * Some serialization tweaks.
 * More CSS tweaks for Parsoid HTML.

Monday, Jun 9, 2014 around 1 pm PST: ✅

 * : Mediawiki and Parsoid CSS styles linked to Parsoid HTML.
 * : Allow links with angle brackets after anchor.
 * : Accept !! in table data.
 * Parsoid enabled on outreach wiki and wikimania2014 wiki.

Wednesday, Jun 4, 2014 around 1pm PST: ✅

 * : Fix nowiki escaping bug in template args during serialization.
 * Fix to tokenizer to better handle table / indent-pre interactions.

Monday, Jun 2, 2014 around 1pm PST: ✅

 * Bug fix in upright handling for images.
 * : Fix crashers handling pre-like strings.
 * : Fix out-of-stack crashers on some wiki pages.
 * Additional selser tweaks.
 * Additional performance tweaks.

Wednesday, May 28, 2014 around 1:15pm PST: ✅

 * Bunch of tweaks to the selective serializer.
 * Performance fixes to enable parsing humongous pages.
 * Bug fix to the XML serializer around handling of HTML tags.
 * : New empty ExtLinks shouldn't be converted to interwiki.

Wednesday, May 21, 2014 around 1pm PST: ✅

 * : Replace space with underscore in namespace links
 * : Bug fix in nowiki-ing of ";" chars
 * Improved serialization of new language links.
 * Support protocol-relative urls.

Monday, May 19, 2014 around 1pm PST: ✅

 * : Deal with &lt;nowiki/&gt; escaping around url and other magic links (RFC, PMID, ISBN).
 * : Accept  and record info in data-parsoid.
 * Some edge-case improvements to template parsing (see commit summary of https://gerrit.wikimedia.org/r/#/c/133506/).

Monday, May 12, 2014 around 1pm PST: ✅

 * Bug fix nowiki-escaping transclusion args.
 * Improvements to accuracy of DSR information -- eliminates some template wrapping errors.
 * Eliminate crashers when attempting parse of deleted revisions.
 * Edge case tweaks to serializer.

Wednesday, May 7, 2014 around 1pm PST: ✅

 * : Fix for production crashers (edge case).
 * : Fixes bad nesting of formatting and figure elements.
 * Other minor fixes in the tokenizer.

Monday, May 5, 2014 around 1pm PST: ✅

 * : Additional fixes to template encapsulation code based on production crashers.
 * Handling of empty redirects (edge case bug).
 * : Upgrade tokenizer (pegjs) from 0.7 to 0.8 -- required lots of tweaking and fixing of tokenizer.
 * Simple upgrades of other libraries (See https://gerrit.wikimedia.org/r/#/c/130992/)

Thursday, May 1, 2014 around 9:20 am PST: ✅

 * A whole bunch of performance tweaks.
 * : Last set of fixes to template encapsulation code.
 * Use handlebars for ParsoidService views.

Monday, April 28, 2014 around 1pm PST: ✅

 * : Bugfix merging nested template ranges (caused by fostered content in tables) + other fixes.
 * Logging: Suppress stack traces for warnings
 * Several link handling fixes
 * : Handle unescaped single quotes in urls
 * : Correctly handle multiple # chars in links
 * : Serializer: handle full stops in link target
 * Serializer: Underscores not converted to spaces for interwiki links
 * Several other fixes (see https://gerrit.wikimedia.org/r/#/c/126853/ for more)

Wednesday, April 23, 2014 around 1pm PST: ✅

 * Fix oldid logging with error/fatal log entries.
 * : Fix bug merging overlapping template ranges (caused by fostered content in tables)

Monday, April 21, 2014 around 1:45pm PST: ✅

 * : Accept comments in eofl position
 * Support comments before table lines
 * Improved handling of "bogus" image options

Wednesday, April 16, 2014 around 1:30pm PST: ✅

 * : Serialize links with wikitext chars correctly (ex: foo '' bar)
 * : Match fixed PHP behavior for framed images with a height specification
 * : Multiple commits to fix crashers found in RT testing
 * Accept entities in ref attributes
 * Improvements to wrapping of fostered transclusions

Monday, April 14, 2014 around 1pm PST : ✅

 * : Handle multiple colons in titles in subpage-supporting namespaces
 * , : Improvements to serialization of interwiki links
 * : Fix parsing and serialization of invalid wikilinks
 * : Fix some edge case template encapsulation scenarios

Wednesday, April 2, 2014 around 1pm PST : ✅

 * : Improved serialization of empty i/b nodes.
 * : Fix serialization of headings, etc. after categories.
 * : Accept multiple comments in start-of-line context (headings, etc.).
 * : Accept multi-line comments after headings.
 * Accept comments in template targets.
 * Sanitizer fix for handling protocols like news: and javascript: (no security issue, validation happens elsewhere too)

Monday, Mar 31, 2014 around 1pm PST: deploy canceled
Canceled deployment to investigate issues caught in testing.

Monday, Mar 24, 2014 around 1pm PST: ✅

 * Dont generate NaN dimensions after edits.
 * Fixed bug in detecting unresolvable tpl targets.

Wednesday, Mar 19, 2014 around 1pm PST ✅

 * Improved connection timeout handling
 * Handle non-string extension attribute values
 * Allow scaling of Vector images

Monday, Mar 17, 2014 around 1pm PST ✅

 * Support for manual thumbnail option (thumb=) on images.
 * Roundtrip empty image attributes.
 * : Improvements to RT-ing of fostered content.

Thursday, Mar 13, 2014 around 4pm PST ✅

 * Redeployed.

Wednesday, Mar 12, 2014 around 1pm PST ✅

 * Parse and roundtrip invalid image options
 * Fix image up-scaling for 'format unspecified' images
 * A bunch of code cleanup.

Because of a bug in the deployment system, the deployment did not happen and Parsoid remained stuck at 98936e7a according to http://parsoid-lb.eqiad.wikimedia.org/_version.

Monday, Mar 10, 2014 around 1pm PST ✅

 * New logging framework deployed with improved error reporting to production logs.
 * Eat > and [ in table / tr attribute names -- improves parsing / serialization of pages with broken wikitext.

Because of a bug in the deployment system, the deployment did not happen and Parsoid remained stuck at 98936e7a according to http://parsoid-lb.eqiad.wikimedia.org/_version.

Monday, Mar 3, 2014 around 1pm PST ✅

 * Treat all block tags identically in pre-handler
 * DSR computation: Properly handle tags nested in.
 * New tags are now serialized on their own line.
 * More liberal parsing of broken table and table-row attributes in wikitext.
 * Fixed regression dealing with fostered text nodes from tables.

Wednesday, Feb 26, 2014 around 1pm PST ✅

 * Emit | chars outside tables as | text
 * Handle multiple conflicting image options properly
 * Handle templated image options in inline images
 * Bug fixes in pre-handling and DSR output.

Monday, Feb 24, 2014 around 1 pm PST ✅

 * Enabled CORS on all API endpoints.
 * : Support trailing 'pxpx' in image size options.
 * : Correctly handle duplicate options in image wikitext.

Wednesday, Feb 19, 2014 around 1 pm PST ✅

 * Additional fixes for link trail / template interaction.
 * Support link trails for interwiki links.
 * Allow template attributes for image attributes.
 * Support image options that have the "|" char in them.
 * template on nlwiki pages handled properly: parses as expected and serialized properly.

Wednesday, Feb 12, 2014 around 1 pm PST ✅

 * HTML PRE tsr calculation fixes
 * Several clean-up and refactor patches
 * Error logging clean-up
 * Further fixes for link trail / template interaction

Monday, Feb 10, 2014 around 3 pm PST ✅

 * Emit non-piped links for edited redirects
 * Handle linktrails/prefixes correctly for templated links
 * Correctly render p-tags in blockquotes

Thursday, Feb 6, 2014 around 12:30 am PST ✅
Deployed after failed code update of Feb 3, 2014 was fixed.


 * Add Wikiversity to site list in ParsoidConfig

Monday, Feb 3, 2014 @ 11:30 am PST ✅
This was the first deploy from our new repository /mediawiki/services/parsoid/deploy. This deploy includes all fixes over the last 6 weeks (from December 16th, 2013).

This deployment saw the following code improvements go out:
 * Fixes to GC issues that led to memory leaks in node 0.10
 * First pass over long-standing image handling cleanup.
 * First steps implementing a logging subsystem in Parsoid.
 * Code quality fixes to improve robustness of code.

Besides these changes, these specific bugs were fixed.

Images
 * Wikitext tables inside image captions accepted
 * ,, , Use edited image attributes over original values.

Links
 * Interwiki links pointing to current wiki parsed as plain links
 * Update to linktrail/prefix regexp code
 * Update to linktrail/prefix regexp code
 * Serialization of new link redirects serialized fixed
 * handled correctly
 * [[Foo]] handled correctly
 * Trailing extlink-like text in wikilink handled correctly ..[Foo]

Refs &amp; extensions
 * Accept unclosed &lt;references&gt; tag
 * Multiple &lt;references /&gt; tags handled properly
 * Non-standard WS in extension tags accepted

Misc Tokenizer
 * Stray table-end tags ignored in some contexts
 * ISBN with an X recognition

Wikitext escaping fixes
 * Url parsing fix during nowiki escaping
 * Fixes for nowiki escaping of ext-tag like text
 * Fixes to wikitext escaping of link text

Misc edit/serializer fixes
 * Fixed serialization of edited magic words
 * Table end tags always serialized on new lines
 * Whitespace edits properly recognized

Misc
 * Improvements to handling of fostered table content
 * Parsoid binding to specific IP or interface
 * Parsoid now handles OBJECT element

Thursday, December 26, 2013 @ 20:45 UTC

 * Pushed updated Parsoid config to fix broken support for wikis with "-" in their prefix (ex: nds-nl and others).

Thursday, December 19, 2013 @ 00:10 UTC

 * Pushed updated Parsoid config to add support for tyv and min wikipedias.

Monday, December 16, 2013 @ 13:00-14:00 PST ✅

 * Fix for production crashers.
 * Fix for indent-pre parsing in the presence of block tags.
 * Support for per-wiki API proxies.

Wednesday, December 11, 2013 @ 13:00-14:00 PST ✅

 * Fix for broken HTML-pre serialization that lost newlines after opening tag in some cases.

Tuesday, December 10, 2013 around 10 am PST

 * Reverted Parsoid cluster to node 0.8 after discovering memory leaks in production

Monday, December 9, 2013 @ 15:00-16:00 PST

 * Upgraded Parsoid cluster to node 0.10 after running it in round-trip testing without issues since last week (and for months locally)
 * Configured Parsoid to use api.svc.eqiad.wmnet directly rather than going through the Varnishes (51273)

Monday, December 9, 2013 @ 13:00-14:00 PST ✅

 * Fixes to HTML and Indent-Pre handling
 * Serialization improvements
 * Additional tweaks to the DOMDiff algorithm
 * Tweaks to newline separator handling to minimize dirty diffs
 * URL link parenthesis heuristic
 * Performance: Added API proxy configuration to bypass caching layers in front of Mediawiki API (Config change deployed, but proxy not yet enabled)
 * Changed default thumbnail size to 220px (matching WMF site defaults, bug 50523)
 * Add Wiktionary as /enwiktionary/, /dewiktionary/ etc (bug 58212)

Wednesday, December 4, 2013 @ 13:00-14:00 PST ✅

 * Fix for crasher that was filling up production log
 * Enable gzip compression support
 * Handle page names starting with a slash
 * Serialize new headings with spaces around '=' char
 * Initial support for time/data/mark HTML5 elts
 * ISBN links now assigned mw:ExtLink type to conform to Parsoid DOM Spec
 * A bunch of fixes to the selective serializer
 * Improvements to DOMDiff algorithm
 * Bug-fixes in nowiki escaping before/after linktrails/prefixes
 * Parse attributes in a case-insensitive manner
 * A bunch of other assorted fixes

Wednesday, November 20, 2013 @ 13:00-14:00 PST ✅

 * Correctly serialize magic words added on client
 * Various DSR fixes (suppress spurious warnings, fix errors)
 * Bug parsing indent-pres following a html-pre

Monday, November 18, 2013 @ 13:00-14:00 PST ✅

 * Fix for (eliminates whitespace diffs on frwiki on template edits)
 * Fix for (incorrect use of TSR while detecting stray closing tags)
 * Fix for serialization of new categories
 * API fixes
 * Improved error handling

Wednesday, November 13, 2013 @ 13:00–14:00 PST: ✅

 * Improvements to image option parsing, DOM diffing and Wikitext escaping

Thursday, November 7, 2013: ✅

 * DOM spec clean-up (delayed deploy for VE compat)
 * A lot of fixes and performance improvements