Analytics/Wikistats/DumpReports/Future per report

Work in progress

March 2016: this page provides an overview of existing Wikistats reports which focus on Wikimedia content and content creators (better known to insiders as Wikistats' dump based reports), and seeks your input on which reports are most valuable to you. With your help the WMF Analytics Team can determine which reports should be migrated/replaced first, which later, or not at all.

Please add your signature to those reports you want to remain in some form in a new setup. (three tildes, if signed on).

Reports per wiki
For more than 800 Wikimedia wikis there is a dedicated page with monthly counts on content and content creators. Arguably for many wikis some of these metrics are vital to assess the health of the editing community for that particular wiki. But the presentation is overcrowded, static, and somewhat disorganized. Broadly speaking these tables fall into two categories: 1) focus on content 2) focus on contributors, with the first table on the page (also the oldest) a hybrid between these two categories.

Main monthly trends, and quarterly rankings (A)
Oldest Wikistats report, with several presentation layers, and as said a hybrid between content itself and content contributors
 * Year over year (YoY) for recent months
 * Absolute values for every month (or every first month of the quarter)
 * Rankings within this project (e.g. Wikipedia), with tiny wikis filtered

Note: some metrics in this first section are not up to date for large Wikipedias, as run time length of data collection became an obstacle.


 * Keep
 * 1) Erik Zachte (talk) Oldest Wikistats table. Some metrics are very often referred to, others just occasionally. The table is overly complex. I suggest redesign, strip down to essentials, or make it dynamic, where the user can specify which metrics to present, and in which way.

Breakdown of editors by activity level per month (B)

 * Keep
 * 1) Erik Zachte (talk) Combine with chart version (Summary Report, see below).

Breakdown of editors by activity level for all time (C)

 * Keep
 * 1) Erik Zachte (talk) Occasionally very useful, to show how relatively few people do most editing.

Most prolific recently active editors (D)

 * Keep
 * 1) Erik Zachte (talk) Rebuild as dynamic report where user can choose to show active/sleeping users and bots (4, 5 and 6) in one table.

Most prolific recently absent editors (E)

 * Keep

Most prolific bots (F)

 * Keep

Most prolific anonymous editors (~ ip addresses) (G)
Currently out of order.
 * Revive

Breakdown of articles by size (H)

 * Keep
 * Drop
 * 1) Erik Zachte (talk)

Article count per namespace (I)

 * Keep
 * 1) Erik Zachte (talk)

Most edited articles (J)
Currently out of order.
 * Revive
 * 1) Erik Zachte (talk)

Articles with most contributors (aka ZeitGeist) (K)

 * Keep
 * 1) Erik Zachte (talk) Best section on this page to bring some 'color' to the wiki. Note how this is not about most edited articles, but articles which have most contributors.

Summary reports
Some key metrics (with MoM and YoY), but mostly charts.

Scope:
 * A set of metrics, for one wiki (e.g. Commons)
 * A set of metrics, for all wikis in one project combined (e.g. Wikivoyage, see first table)
 * One metric, across all projects (e.g. Active Wikis Per Project)




 * Keep
 * 1) Erik Zachte (talk) (I would love to see a mobile version)

Bar charts per wiki
These metrics correspond 1:1 to the columns in the first table above (the hybrid table): Main monthly trends, and quarterly rankings (A).

These charts with one bar per month have become too unwieldy, and span several screens, even on a large monitor.




 * Keep
 * Drop
 * 1) Erik Zachte (talk) Either drop, or make more compact. Quarterly (Jan/Apr/Jul/Oct) or half-yearly samples (or averages) could still work.