New Developers/Quarterly/2017-10

Overview
Goal, possible candidates, scope, etc.

Key findings

 * Finding 1
 * Finding 2
 * Finding 3

New developers metrics and trends
Trends in onboarding and retention per quarter for the past one year:

Coming soon below

Specific metrics are below:

Survey analysis
Brief..

Doubts
(New)

Andre asks: (Resolved)
 * Under "Developer outreach programs and events > Number of new developers onboarded and retained at: > Wikimedia Hackathon 2017 / Onboarded and retained", how is 'retained' defined exactly? Check for any activity between 20170701 and 20170930 after Hackathon in 201705?
 * Yes, check for any activity at all between now and the hackathon (20170701 and 20170930)
 * Andre asks: All links above include any new developers, also staff, as the phrasing does not imply 'volunteers'. Should I changes those queries to only include author_org_name:"Independent" OR author_org_name:"Unknown"?
 * For a moment, I was about to say that it makes total sense to me and that exclude the staffers. But, I think if there are staffers new to Wikimedia code contribution process and trying to contribute in their volunteer time only, it would be good to include them in this study as well, unless you have a different opinion.

Time to get a Bitergia overview with a specific focus on topics mentioned below:

General About specific metrics Note to self:
 * For the new developers, how do we filter out the contributions made to third party repositories (as in T146135#3176718)?
 * Go to C_Gerrit_Demo and copy the second block from T146135 into the search bar. The results will not change much though, this is just a safety measure. And Andre should probably check the entries in that manual blacklist again.
 * Pulling email addresses.. Is it possible that Srishti understands this process as well? Or Srishti emails Andre 2-3 times in the month of September asking for more emails?
 * Andre to ask Bitergia a week before September ends for updated and complete quarter data, as the data on C_Gerrit_Demo is not yet automatically updated.
 * Number of commits we received and merged from new developers in the last quarter - how do we pull proposed commits that landed in Gerrit and the ones that got merged/landed in git from new developers
 * There is no way to do this on C_Gerrit_Demo. The only workaround is taking all the names of new authors from the "new Authors" list on C_Gerrit_Demo (CSV export) and constructing a query for the search field on https://wikimedia.biterg.io/app/kibana#/dashboard/Gerrit by entering author_name:"foo" OR author_name:"bar". Then use the "Status" circle in the middle by hovering your mouse pointer over it and take the "Count" numbers for NEW, ABANDONED, MERGED from there.
 * Number of new developers who actively contributed in the last quarter - (More than one commitset / patchset / contribs) etc. What is “contribs” in the "New Authors" widget on the C_Gerrit_Demo page?
 * contribs = changesets in Gerrit. Not patchsets within one changeset.
 * New developers active 1 year after their first contribution - How to calculate this?
 * By following the complicated steps on https://phabricator.wikimedia.org/T160430#3383647
 * Figure out a way to present stats / numbers