JADE/Intro blog

From mediawiki.org

At Wikimedia, we develop AIs that are trained to emulate human judgments. These AIs help us support quality control work, task routing, and other critical infrastructures for maintaining Wikipedia and other Wikimedia wikis. Recently, the media and the research literature has been discussing the insidious ways in which bias creeps into AIs like ours. In order to address these biases, we're trying a novel strategy, but one that will be familiar to Wikipedians. In this blog post, we'll discuss JADE, an AI auditing support system that we're building to allow Wikipedians to collaboratively track and discuss the biases in our AIs.

Context & purpose[edit]

We’re designing a companion project to ORES, and there is little awareness of the project outside of our immediate circle. The goal of this blog article is to highlight the new project and to bring in collaborators. We have a vision around JADE and the importance of auditing support for users of advanced AIs. In this blog post, we'd like to explain that vision and argue that for its inherent Wikipedian-ness

Audiences
  • Tool authors who will integrate with JADE.
  • Wiki editors interested in new initiatives, many of whom are already invested in ORES.
  • Technical contributors to MediaWiki, both WMF staff and volunteers.
  • Peripheral readers of Wikipedia who are interested in tech announcements.
  • Researchers in human-computer interaction and machine learning.

Outline[edit]

  1. Problem statement: what is lacking?
    1. Patrollers are working in an ad-hoc space.
      1. Current toolbox mostly applies to content:
        1. Flag that patrolling happened, but not the outcome.
        2. freeform discussion;
        3. patrol tags;
        4. category tags;
        5. fixing content.
      2. Outside of content,
        1. users can be blocked.
        2. revisions can be suppressed.
    2. ORES must be audited.
      1. Systemic biases cannot be self-diagnosed.
      2. Auditing should be diverse.
      3. Auditing should be transparent.
  2. How are we fixing this?
    1. JADE gives a quantitative space in which to do patrolling work, and have discussions.
      1. Rich patroller flag to prevent duplicate work.
      2. Patrolling artifacts become wiki entities, and are available for meta-moderation.
        1. Patrolling can be suppressed, discussed, and analyzed by ORES.
      3. Patrollers can work with revisions directly, rather than only article content.
      4. JADE will democratize auditing by making it accessible and transparent.
      5. Mock-up of a judgment, maybe of the UI.
    2. JADE closes the loop between patrollers and ORES, by providing structured feedback that can be used to tune our algorithms.  This is the audit cycle, and will help us identify biases.
      1. Output from a human consensus discussion is the gold standard for truth.  Multi-rater reliability, plus rational discourse that can be reviewed, and which improves the reviewers.  Is there some possibility that consensus formation can lead to shared biases, however?
      2. This is a generalization of Wiki Labels.
      3. Examples from JADE can be fed back into ORES with minimal technician intervention.
      4. External observers are able to point out our biases.
      5. Give examples where we’ve discovered bias thanks to feedback.
      6. User feedback is known to potentially bring dramatic improvements to classifier accuracy.  Structured types of feedback are even better.  Asking for comments is shown to improve user judgments.
      7. Easy diagram of the ecosystem.
  3. Conclusion: JADE because we have a rare opportunity to create a widely used, transparent AI algorithm, governed by democratic practices in which our users participate.
    1. What are normal feedback mechanisms in this industry?
      1. Google Jigsaw's Conversation AI is also providing feedback mechanisms and values the data they receive.  Theirs is write-only, scalar judgments and optional comments.
      2. Humans are data entry bots
      3. Mediation is done entirely by technicians
    2. What are the innovative and interesting mechanisms?
      1. Rationales: pick out keywords or features which explain the judgment.
      2. Learn by example: New class defined by hand-picked samples.
      3. Active learning: We choose what will be judged.
      4. Learn from adversarial data: meta-vandalism.
      5. User co-training:
      6. Immediate algorithmic feedback: retraining or modulation of the AI to incorporate feedback in real-time.
    3. What’s so democratic about JADE?
      1. We allow discussion about judgments.  Other systems only have individual judgments, and the reviewers usually can’t even read each other’s replies.
      2. We can pair democratized auditing with transparency about the pipeline itself.