Manual talk:File metadata handling

From mediawiki.org
Latest comment: 8 years ago by Bawolff

I would highlight what is mentioned in the current article "some obvious downsides of not being searchable whatsoever." ; Indeed this is a downside, as when metadata is rich and correct, there is no easy way to use it/find it. a bot is to be written to read this metadata and re-inject it as text in media page I guess; It would be great if there is an option that enables indexing metadata and it would be great as well if there is a bot to read it and write it into wiki.--Uwe a (talk) 11:50, 10 August 2015 (UTC)Reply

@Uwe a: Nothing is stopping the search engine code from actually indexing the image metadata separately from the page (Similar to how the OCR layer is handled for pdf files with search). I would view that as the proper fix to the issue. Bawolff (talk) 09:52, 12 August 2015 (UTC)Reply
@Bawolff: I do not understand your response frankly, when we are talking about search, are we both talking about mediawiki search?--Uwe a (talk) 22:53, 4 November 2015 (UTC)Reply
@Uwe a: I was thinking about how Extension:CirrusSearch includes the text content of pdfs when doing a search (unfortunately, the default search engine for mediawiki does not do this). There's no reason why MediaWiki's search engine couldn't be modified to include information beyond what is on the contents of the page. Bawolff (talk) 23:00, 4 November 2015 (UTC)Reply