Wikimedia Technology/Annual Plans/FY2019/CDP3: Knowledge Integrity/Goals

=Program Goals and Status for FY18/19=

Segment 1 - Research
 * Goal Owner: Dario Taraborelli
 * Program Goals for FY18/19: Wikimedia sites provide the most trustworthy, comprehensive, neutral information across topics and languages by referencing this information to vetted reliable sources and linking it to external content providers and metadata repositories, making Wikimedia projects the central gateway to access citable information in the knowledge ecosystem.
 * Annual Plan: Segment 1 - Research
 * Primary Goal is Knowledge as a Service: increase reach



 = Q1 Goals =

Outcome 1 / Output 1
Wikimedia contributors are better able to focus and prioritize their sourcing efforts and Product teams can build the best user experiences to support readers’ learning goals and their digital literacy.
 * A map of verifiability of information in Wikimedia projects

Goal(s)

 * Design and test and end-to-end machine learning framework to identify statements in need of a citation. ✅
 * Improving the taxonomy of reasons why editors add citations to Wikipedia statements ✅
 * Design the experiment and collect larger-scale data about reasons why people add citations ✅

Status
July 2018

August 21, 2018

September 13, 2018
 * Details: we expect this goal to be fully done before the end of Q1. The first bullet point is expected to be done by the end of the month. The third bullet point is done and we have done extensive extra work on it as well. What is left from it is documentation which we expect to be done by 2018-09-18.
 * Update on Sept 18: all goals for this outcome is ✅

Outcome 1 / Output 2
Wikimedia contributors are better able to focus and prioritize their sourcing efforts and Product teams can build the best user experiences to support readers’ learning goals and their digital literacy.
 * Research to understand how readers use citations

Goal(s)

 * Prepare the data and do preliminary analysis on the first data collection on citation usage based on data gathered via Citation Usage schema ✅
 * Develop a survey to better understand the role of citations in Wikipedia readers evaluations of Wikipedia articles and to identify opportunities for supporting their learning goals and increasing their digital literacy. ✅

Status
July 2018

August 21, 2018
 * Data collection is done and the documentation just needs to be finished up ✅ . Developing the survey is and more information is in T199188

September 18, 2018
 * The survey wording and goals are now ✅

Outcome 4 / Output 6
More knowledge professionals and other contributors are motivated to join the effort to build an open citation ecosystem, and are more able to actively improve the structure, quantity, and quality of citations on Wikimedia projects.
 * Funding the WikiCite event series

Goal(s)

 * Fundraise for the annual meeting in the WikiCite series and set of satellite events, to improve the sustainability and global reach of the initiative. ✅
 * Organize the event, open the application process and design the program ✅

Status
July 2018
 * ✅ Fundraising is completed!

August 22, 2018
 * Organizing the event is underway

September 18, 2018
 * ✅ The selection process has completed, notifications to applicants are being sent out as of October 1. The chairs of individual days of the event are now collecting information from selected attendees to finalize the agenda.



=Q2 Goals =

Outcome 1 / Output 1
Wikimedia contributors are better able to focus and prioritize their sourcing efforts and Product teams can build the best user experiences to support readers’ learning goals and their digital literacy.


 * A map of verifiability of information in Wikimedia projects

Goal(s)

 * Design a machine learning framework to identify why statements need a citation in English Wikipedia.
 * [Stretch] Submit a paper summarizing the modeling work for unsourced statement detection

Status
October 2018
 * Discussed...

November 2018
 * Discussed...

December 2018
 * Discussed...

Outcome 1 / Output 2
Wikimedia contributors are better able to focus and prioritize their sourcing efforts and Product teams can build the best user experiences to support readers’ learning goals and their digital literacy.


 * Research to understand how readers use citations

Goal(s)

 * Run the second round of data collection to understand Wikipedia citation usage
 * Prepare the data and analyze the data collected in the second round.
 * Perform first round of survey data collection of reader citation usage on English Wikipedia.
 * Analyze first round survey data of reader citation usage

Status
October 18, 2018


 * The survey work has been ported to Qualtrics and a privacy statement has been submitted to Legal for review.

November 2018


 * Discussed...

December 2018


 * Discussed...

Outcome 4 / Output 6
More knowledge professionals and other contributors are motivated to join the effort to build an open citation ecosystem, and are more able to actively improve the structure, quantity, and quality of citations on Wikimedia projects.
 * Host the WikiCite 2018 event

Goal(s)

 * Host the WikiCite 2018 event in Berkeley, CA (November 27-29, 2018)

Status
October 2018


 * Discussed...

November 2018


 * Discussed...

December 2018


 * Discussed...