Analytics/Data Processing

Vision
Have a big data processing platform to produce reports and metrics and facilitate research while complying with the privacy policy and expectations of the MediaWiki community.

Roadmap

 * 1) Use Kafkatee in lieu of udp2log (eventually decommissioning udp2log)
 * 2) * Benefit: get rid of packet loss problems for consumers of data from udp2log
 * 3) Total PageView Prototype (top level metrics on total page views)
 * 4) * Benefits: 2014-15 Q1 goal - produce metrics for executives
 * 5) * gain experience towards fully implementing ETL and page view counting
 * 6) Replace Webstats Collector
 * 7) * Benefits: implement robust (scalable & without packet loss) system to feed web stats scripts.
 * 8) Fully Dimensionned PageViews and ImageViews
 * 9) * Benefits: for executives and community.
 * 10) * Replaces reliance on Comscore data
 * 11) Wikipedia Zero filtering & Page Views
 * 12) * Benefits: reliable & robust generation of data for WP Zero reports. Replace temporary solution in use.