Topic on Project:Support desk

MarcoAurelio (talkcontribs)

Hello. This relates to cleanupSpam.php (Manual:cleanupSpam.php). Am I right assuming that if we replace DB_REPLICA with DB_MASTER the sysadmin using the extension will get more actual results? I asked because trying to run this script I tried to get some spam deleted but the script said that "no pages found using <domain>" while those pages existed. Thanks.

Ciencia Al Poder (talkcontribs)

This would only be needed if the slaves are very lagged and didn't get the updated links. Otherwise, the script may be broken.

Assuming it uses the externallinks table, if it doesn't work, may be because the links haven't been populated on the table (the job queue is broken or has a long delay), or something has changed and the script no longer works.

MarcoAurelio (talkcontribs)

Thanks @Ciencia Al Poder - I was trying to do that on the Beta Cluster. I've been running the script regularly without issues until yesterday. Any hints where I should be looking to see if it is a JobQueue/DB issue? Beta has been a bit rough these days IIRC. Thanks.

Ciencia Al Poder (talkcontribs)

You can query the API:Extlinks to see if one of the pages you know it shoud target, has the link registered in the externallinks table. If not, it's a problem with the job queue or the parser.

If the link is present, then something must be wrong with the cleanupSpam script.

MarcoAurelio (talkcontribs)

Further investigation showed that it was an issue with the script not recognizing all URL protocols but only http://. A patch has been submitted to fix this. Thanks.