Talk:Parsoid/Todo

In this page we track and report Parsoid parsing, round-tripping or serialization issues. Problematic wikitext snippets can be added in Parsoid/Bug_test_cases for direct testing.

VE testing issues
[2:12:41 AM] Gabriel Wicke: something generates new s in the nowiki section without any edits in that area: https://www.mediawiki.org/w/index.php?title=VisualEditor%3ATest&diff=552860&oldid=552858 [2:25:10 AM] Gabriel Wicke: I believe that those paragraphs are inserted on the VE side, since they seem to round-trip ok: http://parsoid.wmflabs.org/_rt/mw:Parsoid/Test/VE Likely solved once the change to nowiki as span is deployed and/or nowiki is handled in the VE -- Gabriel Wicke (GWicke) (talk) 13:37, 28 June 2012 (UTC)

Misc issues
* foo ** bar *** baz with master revision 33dc9abb0db364bb41ca0b06d368bde386719d6a. This is a problem with diffWords which swallows newlines. diffChars works better, but it takes too long and too much memory. Alternative would be to use diffChars on "small" lines.
 * Try to emulate PHP parser in treating foo as foo (low priority)
 * search for 'listItem' in http://parsoid.wmflabs.org/_rt/Takeda%20clan.
 * SSS: This is a "syntax error" with mismatched ref tags in wikitext. The specific segment that crashes it is this:  . Note the error in &lt;ref name="enc-shingen"/&gt;.  This ref tag should not be closed. This is a similar bug as the previous one where there are mismatched tags which are usually handled by the Tidy post-processor.  We need a strategy for this in general. Here is the smallest test case to reproduce this:  boo yahoo 
 * Two issues reported in Thread:User talk:GWicke/Normalization of wiki text
 * Preserve sort order in category links: Thread:User talk:GWicke/Normalization of wiki text/reply (6)
 * Thread:User talk:GWicke/Normalization of wiki text/reply (7) is expected behavior for a yet-unhandled extension tag
 * The weather box in http://parsoid.wmflabs.org/Broken_Hill,_New_South_Wales is rendered incorrectly
 * Parser: Text not wrapped in &lt;p&gt; tags. Look at HTML output for http://parsoid.wmflabs.org/_rt/mw:Parsoid/Todo  In several sections, text after headings in certain context appears bare.  I haven't yet reduced this to a small test case.
 * Diffing bug: Try roundtrip diff on a page with content
 * Roundtripping of html attributes -- needs fixing
 * Anything in particular? Attributes on plain HTML tags seem to work fine. -- Gabriel Wicke (GWicke) (talk) 13:34, 28 June 2012 (UTC)

Issue on http://parsoid.wmflabs.org/_rt/pt:Foo
Is it possible to have an https or protocol relative link for reporting bugs on this page? The address https://parsoid.wmflabs.org/_rt/pt:Foo doesn't seems to work. Helder 15:38, 8 June 2012 (UTC)


 * It certainly is possible, but not really our top priority right now. There is no authentication info involved, and all the content is public. -- Gabriel Wicke (GWicke) (talk) 22:17, 20 June 2012 (UTC)

Issue on http://parsoid.wmflabs.org/_rt/pt:HTML
The article [//pt.wikipedia.org/w/index.php?title=HTML&oldid=30591267 pt:HTML] uses the non-existent " " to exemplify the way HTML works, but the code There is no " " in HTML is converted back to something else: There is no " should produce something like   rather than marking the entire paragraph as template-generated.
 * This is pretty much what we intend to do: see Parsoid/HTML5_DOM_with_microdata. -- Gabriel Wicke (GWicke) (talk) 10:03, 20 June 2012 (UTC)


 * Add round-tripping of category links and the like, right now these are lost
 * Fix the newline-at-the-end-of-an-li-or-before-a-ul behavior such that
 * the parser doesn't output newlines before each  and before each  -within-a-
 * the serializer doesn't depend on these newlines to output correct wikitext
 * (newline handling in general is slated to be revamped but I wanted to document this case specifically because VE works around it)
 * Feature request: the first  inside an   should be ignored, whereas every subsequent   should be treated as if it had stx=html (the latter is already done). This means that   should be serialized to
 * This is because the parser doesn't wrap the text in a list item in a paragraph (i.e. the text is directly in the list item) whereas VE's linear model does wrap it in a paragraph because listItem nodes can't contain text directly. The HTML->linmod converter can deal with adding the paragraphs quite easily and cleanly, but removing these paragraphs in the linmod->HTML converter with the conditions being this specific is a pain (we currently do do this as a workaround, but it's ugly). So we can tolerate input that doesn't have wrapped first paragraphs, but Parsoid doesn't tolerate input that does have wrapped first paragraphs; it would make our lives easier if it did
 * -> Listed in the Parsoid/Todo. Will also be needed for table cells. -- Gabriel Wicke (GWicke) (talk) 10:03, 20 June 2012 (UTC)

Issue with indented tables
Compare the results of the following on http://parsoid.wmflabs.org/_rtform/: {| class="wikitable" ! Wiki code ! Expected result :
 * {| border="1"
 * {| border="1"


 * a
 * b
 * c
 * d
 * }
 * }
 * }

This kind of "indented table" is used in some articles ([//pt.wikipedia.org/w/index.php?title=Sequ%C3%AAncia_principal&oldid=30969308&uselang=en#Dados_da_sequ.C3.AAncia_principal example] / round-trip). Helder 13:25, 28 June 2012 (UTC)


 * --Fixed in c5f99614 Ssastry (talk) 17:03, 30 July 2012 (UTC)

JSON in the rendered HTML
For some reason this test shows

in the HTML version... Helder 03:06, 8 July 2012 (UTC)


 * Fixed since then ;) Gabriel Wicke (GWicke) (talk) 01:28, 29 January 2013 (UTC)

Ordered lists inside of tables
See Bug 40274 and this test case. Helder 19:21, 15 September 2012 (UTC)

Parsoid server different form mediawiki server
Which are the parameters that have I to change if I have the parsoid server on a differente host from mediawiki host?

Localsettings.ph ? VisualEditor.php ? . ..

Thanks.


 * You need to set  or similar if you want to use VisualEditor with your own Parsoid instance - see Extension:VisualEditor. Jdforrester (WMF) (talk) 17:59, 20 September 2012 (UTC)

Bolds and italics
This topic may be relevant: Helder 00:21, 6 October 2012 (UTC)
 * w:Wikipedia:Village pump (technical)

Issue on http://parsoid.wmflabs.org/_rt/pl/Zręczyce
So this is a non-English wiki and this might be an issue by itself, but there's something funny with the HTML of that location map. Just letting you know. Matma Rex (talk) 21:12, 20 January 2013 (UTC)


 * We made some stabs at this, but sadly it does not work 100% yet in that page. Needs further investigation. Gabriel Wicke (GWicke) (talk) 01:26, 29 January 2013 (UTC)

it can not be well displayed in ie8
It can not be well displayed in ie8,why?
 * Parsoid HTML can be displayed in any browser. You might be referring to the VisualEditor, which has a FAQ at Help:VisualEditor/FAQ. --Gabriel Wicke (GWicke) (talk) 17:51, 25 August 2013 (UTC)

Add an "id" to the "h2" elements
Hi!

Could you add ids to the h2 elements at so that I can link specifically to the "Diff between original Wikitext (green) and round-tripped wikitext (red)" section?17:37, 28 November 2013 (UTC)

A few various issues
Some of these might already be known. I found these looking at various articles and talk pages on enwiki: See http://parsoid.wmflabs.org/_rt/enwiki/Wikipedia:Sandbox?oldid=596441905 for examples. Jackmcbarn (talk) 04:14, 21 February 2014 (UTC)
 * A line that opens bold but doesn't close it gets it closed
 * The case of attribute names are lost, spaces in tags are lost, and the case of closing tags is assumed to be the same as opening ones
 * Double-quotes get used in all attributes
 * Interwiki link prefixes are changed and leading colon is removed
 * External links with weird nesting break
 * Spaces are added before the / in self-closing tags
 * Spaces are added to the beginning of table attribute lines
 * ISBN numbers with two spaces between the word and the number get condensed

Issue when parsoid tries to expand scary transclusions
I have scary interwiki transclusion enabled on my mediawiki install. Parsoid and VisualEditor works for pages without transclusions like. I can supply with logs if you need. Jaeol (talk) 05:39, 4 May 2014 (UTC)

Issue on http://parsoid.wmflabs.org/_rt/: horizontal scrollbar
It would be better if the wikitext wasn't shown in a block element with, which leads to horizontal scrollbars when (as it usually does) the wikitext has long runs of text without newlines in them. It would be better to use something with  (see https://developer.mozilla.org/en-US/docs/Web/CSS/white-space), or perhaps even better, render it more like mediawiki renders diffs.

Also, the tool is actually at http://parsoid-lb.eqiad.wikimedia.org/_rt/ now; you should perhaps fix the "Report a parser issue" link? &mdash;SamB (talk) 17:53, 11 April 2015 (UTC)

Issue on http://parsoid.wmflabs.org/_rtform/
we get this error. CentOS 6 [warning/api/etimedout][mediawikiwiki/Parsoid] Failed API request, {"error":{"code":"ETIMEDOUT"},"retries-remaining":1} [info][mediawikiwiki/Parsoid?oldid=1953689] started parsing [warning/empty/tr][mediawikiwiki/Parsoid?oldid=1953689] Template Wikimedia engineering project information produces stripped empty elements [info][mediawikiwiki/Parsoid?oldid=1953689] completed parsing in 10265 ms