Topic on Talk:Growth/Feature summary

Czar (talkcontribs)

>3,262 mentor questions have been asked as of March 2020

Is this dataset publicly available somewhere?

MMiller (WMF) (talkcontribs)

@Czar -- I think you're asking for the list of questions so you can see what they say and get a sense for common questions. If you're willing to use Google Translate (or if you read other languages!) an easy thing to do is to go to the Recent Changes feed on one of the wikis that have the feature and filter to the "mentorship module question" edit tag. Here's a link where I've done that in French Wikipedia. You could also look in any of these Wikipedias: Czech, Korean, Vietnamese, Arabic, Ukrainian, Armenian, Serbian, Hungarian, and Basque.

I'll also tag our Arabic ambassador, @Dyolf77 (WMF), who once read through and categorized all the Arabic questions. He may have some interesting counts for you.

Dyolf77 (WMF) (talkcontribs)

@MMiller (WMF) Thanks for bringing this point about questions from newcomers. @Czar Hi, I analysed more than 1300 questions from Arabic contributors and made this table. I found that the autobiography/biography question had the biggest number. Followed by nonsense questions and in the 3rd position, people just saying "Hello".

Czar (talkcontribs)

@Dyolf77 (WMF) Thanks! Is there a Quarry query I could use to run this again for another language's wiki? Also didn't know about the googletranslate function in Sheets—very nice. Isn't "general questions" the biggest bucket, even before "biography" questions?

Dyolf77 (WMF) (talkcontribs)

@Czar Good point, maybe it is a bad label from me, I meant by "General questions", other different questions with various topics. Biography is the most asked question, still. So in general questions, newcomers asked about editing, sources, reviewing edits, deleted articles etc. About the table, it was the work of @Martin Urbanec (WMF), he can tell about the code.

Martin Urbanec (WMF) (talkcontribs)

@Czar Hello, thanks for your question! I was the one who made the data for Dyolf77. I did that by running https://gist.github.com/urbanecm/ec8d74604f584d2272edaf92a3a3711f. It should be pretty easy to run it elsewhere, but toolforge access is needed to be able to do so without changing the script significantly. If you don't have it, I can generate the dataset for a different wiki for you. As Marshall stated. the questions are available as MediaWiki revision, my script merely puts that into a simple table for easier analysis. Hope this helps,

Reply to "Question data"