Topic on Talk:Cross-wiki Search Result Improvements

How is the ranking of sister projects determined?

5
Summary by 197.218.81.79

The wiki blocks are ordered by recall (most to least number of articles returned from each project). . Large wikis are likely to be ordered first frequently. Concerning wikivoyage there's a small variation, a filter wikivoyage results on title.

Commons uses boosted-templates (https://phabricator.wikimedia.org/T163223).

Tbayer (WMF) (talkcontribs)

I just tried out searching for some popular travel locations, and Wikivoyage came out last or next to last (i.e. below the fold) in each of these examples:

(Examples are not cherry-picked, these were just the first few that that came to my mind.)

197.218.81.178 (talkcontribs)
MPopov (WMF) (talkcontribs)

Projects should – in theory – be ordered according to recall (most to least number of articles returned from each project). And this is mostly true if you open each sister project's search results page separately.

Looking at InterwikiSearchResultSetWidget in MediaWiki Core, it does not appear there is explicit front-end code for ordering the projects when the SERP is rendered and the order is determined by Cirrus (IIRC) on the back-end when it returns the interwiki results – which should be according to recall.

Looking at InterwikiSearcher.php in Cirrus source, we still have code from the first cross-wiki search A/B test where results can be returned in a random order if the configuration requests it, but they should be returned according to recall in production. Although maybe we're accidentally still using the static order that the switch statement defaults to. I'll reach out to @EBernhardson (WMF) and @DCausse (WMF) for clarification.

DCausse (WMF) (talkcontribs)

Absolutely, the wiki blocks are ordered by recall. Large wikis are likely to be ordered first frequently. Concerning wikivoyage there's a small variation. During the RFC it was requested to strongly filter wikivoyage results on title. Today we ensure that 80% of the search terms (stop-words excluded) appears in a title for wikivoyage results. In other words it decreases recall for wikivoyage and probably one of the reason you feel that wikivoyage is ranked so badly. Without the title filter wikivoyage would be ranked #3 (just below wiktionary) for the query Alaska.

197.218.81.79 (talkcontribs)
Reply to "How is the ranking of sister projects determined?"