Topic on Help talk:CirrusSearch

CirrusSearch search in text files

9
PushpendraJadaun12 (talkcontribs)

I use documentation versioning to show pages based on different versions.So some text files are not being stored in database is there any way by which I can search text file's content using cirrussearch extension as it searches only data present in database. Please let me know the way to do it.

DTankersley (WMF) (talkcontribs)

Can you give an explicit sample url that you're using?

Thanks!

PushpendraJadaun12 (talkcontribs)

@DTankersley (WMF) Sorry the question has been modified with some more detail. Please have a look at it.

DCausse (WMF) (talkcontribs)

CirrusSearch can search file content as long as a MediaHandler extension supporting the file types you want is installed. As far as I know only Pdf are currently supported this way, this is handled by Extension:PdfHandler.

Ksfield (talkcontribs)

@DCausse (WMF) can you elaborate on what Extension:PdfHandler is doing that allows for the pdfs to be indexed by CirrusSearch? I am interested in add capability for other types of files to be indexed but am having a hard time figuring out what is needed. Thanks!

DCausse (WMF) (talkcontribs)

You just need to install CirrusSearch and PdfHandler to be able to index and search pdf and djvu content (please see their respective documentations). Other types of files are not supported (MS office/libreoffice docs) are not supported as far as I know.

PushpendraJadaun12 (talkcontribs)

Below mentioned commands are supposed to rebuild the search index from scratch: 1) php updateSearchIndexConfig.php --startOver 2) php forceSearchIndex.php

But it seems like they doesn't work either as I have tried them so they index my text files and let me allow to search in text file content but though these commands run successfully it doesn't search.

Note : When I add a new text file and search its content then new file's content get displayed in search results.

Please let me know if anyone can help me out as soon as possible.

DCausse (WMF) (talkcontribs)

Please read the documentation at Extension:PdfHandler esp. the "Debugging" part. You may have to run other commands before rebuilding the index from scratch.

PushpendraJadaun12 (talkcontribs)

@DCausse (WMF) My search is on txt files.Can you please let me know whether Extension:PdfHandler will work for search in txt files ?

Reply to "CirrusSearch search in text files"