Wikimedia Product/Data dictionary/virtualpageviews_hourly

From mediawiki.org


This page describes the data set virtualpageviews_hourly that stores on Druid Datasources, which can be accessed via Superset/Turnilo. virtualpageviews_hourly on Druid is directly loaded from wmf.virtualpageview_hourly on Hive, while wmf.virtualpageview_hourly on Hive is originally extracted from event.VirtualPageView.

Schema[edit]

Field name data type description data example source schema source field
project string Project name from hostname hu.wikipedia event.VirtualPageView event.source_url
agent_type string Agent accessing the pages, can be spider or user user event.VirtualPageView IF(useragent.is_bot, 'spider', 'user')
ua_browser_family string Name of web browser (if not using an official Wikipedia mobile app), extracted from the client device's User-Agent Chrome event.VirtualPageView user_agent_map['browser_family']
ua_device_family string Client device family (e.g. brand of manufacturer, product name), extracted from the client device's User-Agent if provided Mac event.VirtualPageView user_agent_map['device_family']
city string City iso code of the accessing agents (maxmind GeoIP database) Springfield event.VirtualPageView geocoded_data['city']
subdivision string Subdivision of the accessing agents (maxmind GeoIP database) Ohio event.VirtualPageView geocoded_data['subdivision']
ua_wmf_app_version string Version of official Wikipedia mobile app (for iOS, Android, and KaiOS), extracted from the client device's User-Agent - event.VirtualPageView user_agent_map['wmf_app_version']
country string Country (text) of the accessing agents (maxmind GeoIP database) Hungary event.VirtualPageView geocoded_data['country']
country_code string Country iso code of the accessing agents (maxmind GeoIP database) HU event.VirtualPageView geocoded_data['country_code']
ua_os_major string Major version of that Operating System, extracted from the client device's User-Agent 10 event.VirtualPageView user_agent_map['os_major']
continent string Continent of the accessing agents (maxmind GeoIP database) Europe event.VirtualPageView geocoded_data['continent']
ua_os_family string Operating System family used by the client device, extracted from the User-Agent Windows 10 event.VirtualPageView user_agent_map['os_family']
language_variant string Language variant from path (not set if present in project name) 2017-es_Eurov

%C3%ADzi%C3%

B3s_Dalfesztiv%C3%A1l

event.VirtualPageView event.source_url
ua_os_minor string Minor version of that Operating System, extracted from the client device's User-Agent 13 event.VirtualPageView user_agent_map['os_minor']
referer_class string Always internal (virtualpageviews are always shown in wiki pages) internal 'internal'
access_method string Always desktop (virtualpageviews are a desktop only feature for now) desktop event.VirtualPageView event.access_method, 'desktop'
ua_browser_major string Major version of the client browser, extracted from the client device's User-Agent 64 event.VirtualPageView user_agent_map['browser_major']
view_count bigint Number of views 1 event.VirtualPageView count(*)

Dashboards which use this table[edit]

Readers Dashboard

Known issues and changes[edit]