Parsoid/Parser Unification/Confidence Framework/Reports

From mediawiki.org

Questions to answer for report assessment[edit]

  • What’s the visual diff score? What will make us confident to deploy based on this score?
  • Are there specific extensions critical to Wikitech DiscussionTools that are not yet supported?
  • Are there missing functionalities in Parsoid that will impact this deployment?
  • What’s the rollback plan in case needed?
  • What would be the main concern and reason to postpone this deployment?
  • What’s your proposed confidence score?
    • Not confident at all
    • Not confident
    • Somewhat confident
    • Confident
    • Extremely confident

Deployment Readiness for Wikitech DiscussionTools[edit]

Visual Difference Score Assessment[edit]

Current Status:[edit]

  • Pixel Perfect Rendering: Achieved for 95.7% of approximately 6250 talk pages tested.
  • Minor Issues: 99.9% of pages show pixel perfect or only minor vertical whitespace shifts.
  • Known Differences: 0.1% of pages have known differences (not planned for fixing).
  • Critical Differences: None.

Considerations for Deployment:[edit]

  • Reliability: The high percentage of pixel-perfect rendering (98.55%) is a strong indicator of the system’s reliability.
  • Pending Reassessment: None
  • Blocker Evaluation: None

Extension and Functionality Support[edit]

Critical Extensions:[edit]

  • Assumption: Extensions critical for Wikitech DiscussionTools are presumed to be identified in the visual diff process.

Functionality Concerns:[edit]

  • Login vs. Logout Issue: None
  • Legacy Parser Compatibility: A strategy is needed to ensure that deployment of the new system does not disrupt the recording of metadata by the legacy parser.

Rollback Strategy[edit]

  • Procedure: A full rollback is planned if any significant issues are observed post-deployment.

Main Concerns and Potential Delay[edit]

  • Current Stance: As of now, there are no major concerns that would necessitate postponing the deployment.
  • Dependent Factors: None.

Proposed Confidence Score for Deployment[edit]

  • Score: Confidence in the rollout is extremely high.