Topic on Help talk:CirrusSearch

Saharma (talkcontribs)

I have followed the installation procedure, searching will result in the files I have uploaded however, I am still unable to search inside the uploaded document themselves.

TheDJ (talkcontribs)

what kind of documents ? I think we only support searching inside djvu and pdf files.. Other document types don't have mediahandlers that support extracting textual information from the documents for search indexing purposes as far as I'm aware.

Saharma (talkcontribs)

A pdf file. Generated from a word document. This is the output from ?action=cirrusDump

Note that "file_text": false


[{"_index": "my_wiki_general_first","_type": "page","_id": "5","_version": [],"_source": {"version": 9,"wiki": "my_wiki","namespace": 6,"namespace_text": "File","title": "Pdf doc.pdf","timestamp": "2019-08-15T12:16:24Z","create_timestamp": "2019-08-14T07:25:58Z","category": [],"external_link": [],"outgoing_link": [],"template": [],"text": "An invalid user was specified to permission testing to embed this PDF. This is a PDF Document.","source_text": "<pdf>File: Pdf doc.pdf</pdf>\n\nThis is a PDF Document.","text_bytes": 53,"content_model": "wikitext","language": "en","heading": [],"opening_text": null,"auxiliary_text": [],"defaultsort": false,"file_text": false,"file_media_type": "OFFICE","file_mime": "application/pdf","file_size": 6499,"file_width": 0,"file_height": 0,"file_bits": 0,"file_resolution": 0,"display_title": null,"redirect": [],"incoming_links": 0}}]

TheDJ (talkcontribs)

When i get to a desktop ill see if i can find if maybe this requires setting a specific config variable or something

Reply to "Search in a document"