Usertagging

What's usertagging
Usertagging is the equivalent of revtagging for user accounts. It's the ability to associate an arbitrary set of metadata (or tags) to a user_id at the time of the account creation for analytics purposes. Every account registration is stored in MediaWiki with a unique user_id in the user table: usertagging allows us to attach supplementary, schema-free data to a user_id.

What is it for
MediaWiki stores by default a series of data about registered users in the user table. The data held in this table is needed to operate the website. Some of this data (for example the user registration date, edit_count) is also used for analytics purposes, to study editor activity and retention. With several projects and editor engagement experiments generating new account registration, we need to store richer set of user metadata to be able to assess the impact of these projects. Only data that is compliant with the Wikimedia Foundation's privacy policy will be stored.

The focus of usertagging is primarily on storing metadata about account registrations, but it can be in principle extended to include supplementary data (like participating in an experiment). The following are examples of user metadata that are currently not included in the user table and that we may need to capture as part of analytics for product experimentation and community outreach activities:


 * user 3456 signed up via the Article Feedback "signup" call-to-action
 * user 6789 was bucketed as a participant in experimental condition X of experiment Y
 * user 2468 created an account via the Global Education portal
 * user 1357 registered as part of an outreach event

What usertagging is not
Usertagging will not include user data that is already captured by MediaWiki. It will not include any data that conflicts with our privacy policy.

E3 experiments

 * Motivation
 * Foo


 * Required data
 * Campaign: Bot
 * Subcampaign: bot name
 * Additional info: bot version
 * Refs
 * Foo
 * Bar