Wikimedia Product/Data dictionary/mediawiki_history_reduced

From mediawiki.org


This page describes the data set mediawiki_history_reduced that stores on Druid Datasources, which can be accessed via Superset/Turnilo. It's directly loaded from wmf.mediawiki_history_reduced on Hive. wmf.mediawiki_history_reduced on Hive is a reduced version of wmf.mediawiki_history.

Schema[edit]

Field name Data type Description Data example Source schema Source field
event_entity string event entity revision, user, page wmf.mediawiki_history event entity
event_type string create, move, delete, etc with specific digest types. Detailed explanation in the docs under #Event_types monthly_digest, create, etc. wmf.mediawiki_history event_type
events int Number of events 1 wmf.mediawiki_history COUNT(1)
other_tags array<string> Can contain: deleted (and deleted_day, deleted_month, deleted_year if deleted within the given time period), revetered and revert (for revisions), self_created (for users), user_first_24_hours if a revision is made during the first 24 hours of a user registration, redirect (for pages) ["redirect"], ["self_created"] wmf.mediawiki_history page_is_redirect,user_is_created_by_self
page_namespace int The page namespace of the event 0 wmf.mediawiki_history page_namespace
page_title string The page_title of the event, prefixed with canonical namespace if any Galileo Galilei wmf.mediawiki_history page_title, page_title_historical
page_type string content or non_content based on namespace being in content space or not content wmf.mediawiki_history page_namespace_is_content, page_namespace_is_content_historical
project string The project this event belongs to en.wikipedia wmf_raw.mediawiki_

project_namespace_map

hostname
revisions bigint 1 if the event is entity revision, or sum of revisions in case of digests 1 wmf.mediawiki_history event_entity
revisions_sum bigint Number of revisions 100 wmf.mediawiki_history COUNT(1)
text_bytes_diff bigint The text-bytes difference of the event 4077 wmf.mediawiki_history revision_text_bytes_diff
text_bytes_diff_abs bigint The absolute value of text-bytes difference for the event 4077 wmf.mediawiki_history revision_text_bytes_diff
text_bytes_diff_abs_sum bigint Sum of the text-bytes difference of the event 4077 wmf.mediawiki_history revision_text_bytes_diff
text_bytes_diff_sum bigint Sum of the absolute value of text-bytes difference for the event 4077 wmf.mediawiki_history revision_text_bytes_diff
user_text string user_text of user performing the event, whether registered or anonymous (IP) user_12345 wmf.mediawiki_history event_user_text, event_user_text_historical
user_type string anonymous, group_bot, name_bot or user all wmf.mediawiki_history event_user_is_anonymous, event_user_is_bot_by, event_user_is_bot_by_historical

Dashboards which use this table[edit]

(Collecting...)

Known issues and changes[edit]