User:LouisDang

Hi I'm Louis Dang and I'm volunteering with the Analytics team. I will use this page to index my work and reference material for easy access for myself and others.

My Work

 * https://github.com/louisdang/kraken/tree/master/src/org/wikimedia/analytics/kraken/pig Pig UDFs for regular expressions matching and simple IPv4 and IPv6 address validation.
 * https://github.com/louisdang/kraken/blob/master/src/pig/lib.pig Pig macro library for date conversion and geocoding
 * https://github.com/louisdang/kraken/blob/master/src/pig/geocode_and_group_by_date.pig Example script for using the macro library
 * https://github.com/louisdang/kraken/blob/master/src/org/wikimedia/analytics/kraken/pig/ParseWikiUrl.java Parse Wiki URL UDF.

Pig

 * http://pig.apache.org/docs/r0.10.0/ Pig 0.10 Documentation
 * https://cwiki.apache.org/confluence/display/PIG/Index Pig Wiki
 * http://www.cloudera.com/blog/2009/06/analyzing-apache-logs-with-pig/ Cloudera tutorial on geocoding with Pig

Oozie

 * http://nosql.mypopescu.com/post/8436633131/a-detailed-guide-to-oozie InfoQ guides on Oozie.

Misc

 * https://docs.google.com/folder/d/0B1unTxaXLQeARGhBTjlySDlUVFk/edit Useful research papers.
 * http://meta.wikimedia.org/wiki/User:Stu/comScore_data_on_Wikimedia comScore data analysis by user Stu.