Thread:Project:Support desk/How to detect content (or non-content) categories from wiki dump files?

Here is some description of non-content categories: https://en.wikipedia.org/wiki/Wikipedia:Categorization#Wikipedia_administrative_categories.

For most categories probably its enough to parse template part in xml category dump to see eg. "" but for non-content category: https://en.wikipedia.org/wiki/Category:Articles_that_mention_track_gauge_2286_mm, I can see only: "", for which case probably I should parse template page "Track gauge"? What if there is a chain of template files?

I can see also, there are additionally possible some magic words which doesn't make the task simpler. https://en.wikipedia.org/wiki/Help:Magic_words_for_beginners#Categories_and_Indexing

Any ideas?