User talk:Svemir Brkic

From MediaWiki.org
Jump to: navigation, search
An archive box Archives 

Old talks

Contents

[edit] Sphinx on ProyectoFedora.org

Svemir, I implemented Sphinx on ProyectoFedora.org/wiki which is the spanish Fedora Project wiki. I've been modifying the way it displays the search page and results, and wanted to share these back with you. You can look at the changes here: http://proyectofedora.org/wiki/Especial:SphinxSearch

Thanks for your work! --Nushio 03:58, 8 July 2009 (UTC)

Cool, thanks! Feel free to add a link to your project on the extension pages, or I can do it later. Are there some changes or improvements you would like to contribute to the project? Svemir Brkic 12:50, 8 July 2009 (UTC)
Like I said, I made some visual changes, mainly to SphinxSearch_body.php to display the namespaces as in Wikipedia's search (Based on user feedback). I'm also starting to tweak the results page to display more relevant stuff, and moving things around, like formatting.

I'm still a n00b when it comes to hacking wikipedia extensions(and php, for that matter), so most of the code is pretty "dirty", but functional. But of course I'd love to contribute back to the project! --Nushio 22:15, 8 July 2009 (UTC)

[edit] Reshuffle the SphinxSearch documentation

The talk page does contain some useful information which are independent of any SphinxSearch release, so it might be a good idea to strip some of those topics, group them and create a subpage. Topics that seems relevant:

People always wonder about the differences of MW's search solutions, so on the main the page, a comparison matrix could describe those features in comparison (syntax support, performance, signle unique feature etc.).

Priority on the ToDo list would help people to see what comes next, I mean SPH_SORT_EXTENDED mode by @relevance and by number of times the page would be a killer and single out any feature against the other search engines available. PDF indexing could an item too as proposed by [1]

[edit] Sphinx, text excerpts, xmlpipe2 and pdf/djvu indexing

We found a work-around and made a proof-of-concept that allows to convert pdf/dvju files into a text otuput which than can be transformed into sphinx xmlpipe2 type like xml. Indexing through xmlpipe2 to generated content, merging with the main index all goes quite smooth, the only issue is that while searching for a term SphinxSearch would point to the correct NS_IMAGE page ID in MediaWiki (and display the correct file) but of course since no content information is stored in MediaWiki to this file, search result can't display any text excerpts.

What we thought was to try to store those related text information in another database table independent from MW but with the necessary field identifier. The question now is where in SphinxMWSearch could we have a chance to re-read missing text excerpt so that the result display would behave as if the text would come directly from MediaWiki's text table.

We found a database select in line 344 (SphinxMWSearch.php, 0.8.5) which only fetches information from the page table but we could not pinpoint the location for when/where the text excerpts are actually fetched. Help would be much appreciated --MWJames 09:15, 23 December 2011 (UTC)

That happens in the core SearchResult class, method initText in include/search/SearchEngine.php Svemir Brkic 21:50, 23 December 2011 (UTC)

[edit] Category filter

Hello, I have installed the SphinxSearch extension. Can you tell me how to make a category filter like the one that is used in New World Encyclopedia? Thank you. --GnuDoyng 00:47, 27 December 2011 (UTC)

That functionality has been deprecated. Current (SVN trunk) version of the extension supports "intitle:", "incategory:", "prefix:", and other advanced Wikipedia search techniques described here and it also supports extended sphinx search syntax. --Svemir Brkic 15:09, 27 December 2011 (UTC)
Personal tools
Namespaces
Variants
Actions
Site
Support
Download
Development
Communication
Print/export
Toolbox