Analytics/Kraken/Mediawiki to Hive

=Background= Kraken will allow you to import data from the MySQL databases. However not all of the MySQL datatypes are natively supported by Hive (the SQL-like language to query the data). I have added a table with a suggested mapping between mysql datatypes and hive datatypes. If the column 'Requires Feedback' says true then that's a conversion I particularly would like to get feedback. Also, please let me know what top 5 tables you are using most often.

=Suggested datatype mapping for the Revision table= +--++-++---++---+ +==================+================+=============+================+===============+================+===================+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+ +--++-++---++---+
 * canonical_column |    column     | mysql_table | mysql_datatype | hive_datatype | native mapping | requires feedback |
 * text_id         | rev_text_id    | revision    | int            | int           | True           |                   |
 * comment         | rev_comment    | revision    | varbinary      | string        | False          | True              |
 * sha1            | rev_sha1       | revision    | varbinary      | string        | False          | True              |
 * parent_id       | rev_parent_id  | revision    | int            | int           | True           |                   |
 * user_text       | rev_user_text  | revision    | varbinary      | string        | False          | True              |
 * timestamp       | rev_timestamp  | revision    | varbinary      | timestamp     | False          | True              |
 * deleted         | rev_deleted    | revision    | tinyint        | smallint      | False          | True              |
 * len             | rev_len        | revision    | int            | int           | True           |                   |
 * page            | rev_page       | revision    | int            | int           | True           |                   |
 * minor_edit      | rev_minor_edit | revision    | tinyint        | smallint      | False          | True              |
 * user            | rev_user       | revision    | int            | int           | True           |                   |
 * id              | rev_id         | revision    | int            | int           | True           |                   |