Analytics/Data Processing

Vision
Have a big data processing platform to produce reports and metrics and facilitate research while complying with the privacy policy and expectations of the MediaWiki community.

Roadmap

 * 1) Use Kafkatee in lieu of udp2log (towards eventually decommissioning udp2log)
 * 2) Total PageView Prototype (top level metrics on total paveviews)
 * 3) Replace Webstats Collector
 * 4) Fully Dimensionned PageViews and ImageViews
 * 5) Wikipedia Zero filtering & Page Views

Project
Move data into Hadoop for research and reporting.

This project includes *Refinery* formerly known as Analytics/Kraken and Kafka