New Developers/Quarterly/2017-10

Overview
Goal, possible candidates, scope, etc.

Key findings

 * Finding 1
 * Finding 2
 * Finding 3

New developers metrics & trends (July - September 2017)

 * 1) In the previous 12 months (Jul 2016 - Jun 2017)
 * 2) Contributions
 * 3) New developers we attracted
 * 4) 237 new authors contributed to 92 repositories
 * 5) New developers active one year after their first contribution
 * 6) In the last quarter (July - September 2017)
 * 7) Contributions
 * 8) Number of new developers we attracted
 * 9) 38 new authors contributed to 22 repositories
 * 10) Number of new developers who actively contributed
 * 11) * More than one commitset / patchset / contribs, etc.
 * 12) Number of commits we received and merged from new developers
 * 13) Project that received most contributions from new developers
 * 14) To what projects, new developers mostly contribute:
 * 15) * mediawiki/extensions/examples 10
 * 16) * mediawiki/core 5
 * 17) * apps/android/wikipedia 3
 * 18) What do we infer from this pattern? Anything?
 * 19) Documentation resources page views
 * 20) Number of developers who landed on our documentation pages targeted at newcomers. Referral paths (page views)
 * 21) For pages like: how to contribute, new developers, how to become a MediaWiki hacker, etc.
 * 22) Developer outreach programs and events
 * 23) Number of new developers onboarded and retained at:
 * 24) Wikimedia Hackathon 2017 / Onboarded and retained
 * 25) Wikimania Hackathon 2017 / Onboarded
 * 26) Could consider tallying the email addresses and match them with the ones in the newcomers spreadsheet
 * 27) Outreachy Round 13 (Dec 2016 - Mar 2017) / Google Summer of Code 2016 / Onboarded and retained
 * 28) Commit pattern during events and programs

Survey
Brief..

Doubts
(Questions for Andre)

Time to get a Bitergia overview with a specific focus on topics mentioned below:

General About specific metrics Note to self:
 * For the new developers, how do we filter out the contributions made to third party repositories (as in T146135#3176718)?
 * Go to C_Gerrit_Demo and copy the second block from T146135 into the search bar. The results will not change much though, this is just a safety measure. And Andre should probably check the entries in that manual blacklist again.
 * Pulling email addresses.. Is it possible that Srishti understands this process as well? Or Srishti emails Andre 2-3 times in the month of September asking for more emails?
 * Andre to ask Bitergia a week before September ends for updated and complete quarter data, as the data on C_Gerrit_Demo is not yet automatically updated.
 * Number of commits we received and merged from new developers in the last quarter - how do we pull proposed commits that landed in Gerrit and the ones that got merged/landed in git from new developers
 * There is no way to do this on C_Gerrit_Demo. The only workaround is taking all the names of new authors from the "new Authors" list on C_Gerrit_Demo (CSV export) and constructing a query for the search field on https://wikimedia.biterg.io/app/kibana#/dashboard/Gerrit by entering "author_name=X OR author_name=Y". Then use the "Status" circle in the middle by hovering your mouse pointer over it and take the "Count" numbers for NEW, ABANDONED, MERGED from there.
 * Number of new developers who actively contributed in the last quarter - (More than one commitset / patchset / contribs) etc. What is “contribs” in the "New Authors" widget on the C_Gerrit_Demo page?
 * contribs = changesets in Gerrit. Not patchsets within one changeset.
 * New developers active 1 year after their first contribution - How to calculate this?
 * By following the complicated steps on https://phabricator.wikimedia.org/T160430#3383647
 * Figure out a way to present stats / numbers