User talk:Legoktm/parsoid re-parse

Update this project?
@Legoktm @SSastry (WMF) Hello.

We are using this script to refresh Lint errors on our own wikis (since this is the only way to do so). However, this script lacks functionality.

I found this page via Google, and it says that WMF is internally using a modified version of this script. Is it possible to make it public or publish it to GitHub & pip so that people can contribute?

Thanks. TripleCamera2022 (talk) 08:30, 20 October 2023 (UTC)


 * Known issues:
 * How to run this script? My current approach is  and then  . I don't think this is the right way.
 * Cannot change number of threads in arguments (32 is too big).
 * Parsoid URL (in ) suits WMF wikis, but not non-WMF wikis.
 * Cannot be interrupted by Ctrl + C.
 * I don't know if they are fixed in the internal version, I'm just listing them here.
 * TripleCamera2022 (talk) 16:32, 24 October 2023 (UTC) (edited)
 * I suspect this script comes from the Tidy days, so my memory is faint since we no longer use this. But, can you post a link to the script (I imagine you found this in some repo -- Linter / core / Parsoid?) and I can then better offer suggestions. SSastry (WMF) (talk) 20:03, 23 October 2023 (UTC)
 * OK. The source code is at https://git.legoktm.com/legoktm/parsoid-reparse. There are also a few related links:
 * Extension:Linter, which mentioned the phabricator task.
 * T161556, which mentioned the source code repository.
 * User:Legoktm/parsoid_re-parse, which mentioned the source code repository.
 * By the way, may I ask why this script is no longer used? Is it because there is an alternative way to clear Parsoid cache? TripleCamera2022 (talk) 16:29, 24 October 2023 (UTC)
 * This script was used back in the day when we were still fine tuning linting code and linter categories and we needed to quickly populate all lints across an entire wiki (for all wikis on the cluster) without having to wait for the pages to be edited (and hence reparsed and relinted). Once the lints have been initialized, we don't need to do this again -- as pages are edited, lints get updated when the pages are reparsed by Parsoid. When we introduce new lints, we might want to reparse everything, but (a) that is very infrequent (b) we are happy to have the lint repopulate organically.
 * It is possible this may be needed some time in the future again, but we will cross that bridge if / when we get there. SSastry (WMF) (talk) 16:57, 24 October 2023 (UTC)
 * My friends have configured Linter recently, and they are using this script to refresh Lint errors. According to them, there are only two ways to clear Parsoid cache: one is to access, another is to edit a page using Visual Editor. Besides, they say running refreshLinks.php wouldn't help (as opposed to what Extension:Linter says). Is this true? TripleCamera2022 (talk) 08:42, 26 October 2023 (UTC)
 * A gentle ping~ TripleCamera2022 (talk) 16:43, 8 November 2023 (UTC)