User:LouisDang

Hi I'm Louis Dang and I'm volunteering with the Analytics team. I will use this page to index my work and reference material for easy access for myself and others.

My Work

 * https://github.com/louisdang/kraken/tree/master/src/org/wikimedia/analytics/kraken/pig Pig UDFs for regular expressions matching and simple IPv4 and IPv6 address validation.
 * https://github.com/louisdang/kraken/blob/master/src/pig/combinedLog_geocode_and_group_by_country_and_date.pig Script for Apache Combined Logs geographical and month/date aggregation.

Pig

 * http://pig.apache.org/docs/r0.10.0/ Pig 0.10 Documentation
 * https://cwiki.apache.org/confluence/display/PIG/Index Pig Wiki
 * http://www.cloudera.com/blog/2009/06/analyzing-apache-logs-with-pig/ Cloudera tutorial on geocoding with Pig

Oozie

 * http://nosql.mypopescu.com/post/8436633131/a-detailed-guide-to-oozie InfoQ guides on Oozie.