Wikimedia Release Engineering Team/Checkin archive/20180122

Pad for the Release Engineering Team weekly meeting Archive at: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Checkin_archive = 2018-01-22 =

Vacations/Important dates

 * https://office.wikimedia.org/wiki/HR_Corner/Holiday_List
 * How to do it


 * Feb 2nd: Mukunda vacation day
 * Feb 9th: thcipriani half-day
 * Feb 19 (Mon): President's Day (All US Staff)
 * Mar 30 (Fri): WMF Holiday
 * April 14 (Fri): WMF Holiday
 * May 15?/16/17: Team offsite in Barcelona
 * May 18-20: Wikimedia Hackathon in Barcelona
 * May 21 (Mon): Tech-Mgt F2F

Rotating positions and absences
Maniphest query for deployment blocker tasks: https://phabricator.wikimedia.org/maniphest/?project=PHID-PROJ-fmcvjrkfvvzz3gxavs3a&statuses=open%28%29&group=none&order=newest#R

Jan 15 and Jan 22

 * Train: Tyler
 * wmf.17
 * wmf.18 - NO TRAIN THIS WEEK
 * SoS: Mukunda
 * Out
 * Jan 15 (Mon): Martin Luther King Day (All US Staff)
 * Jan 22/23: Dev Summit
 * Jan 24: Tech Management F2F
 * Jan 25/26: WMF All Hands

Jan 29 and Feb 05

 * Train: Chad
 * wmf.19 - NO TRAIN THIS WEEK
 * wmf.20
 * SoS: Tyler
 * Out
 * Jan 29-31: Team offsite

Feb 12 and Feb 19

 * Train: Mukunda
 * wmf.21
 * wmf.22
 * SoS: Chad
 * Out:
 * Feb 19 (Mon): President's Day (All US Staff)

Scrum of Scrums

 * Greg to copy to etherpad after meeting: https://etherpad.wikimedia.org/p/Scrum-of-Scrums

Release Engineering

 * Blocking
 * None?
 * Blocked
 * Ops: "Stack overflow when Redis is down" - https://phabricator.wikimedia.org/T185055
 * Would like some guidance from Ops on the documentation bits Gergo identifies
 * Updates

Last week

 * Blocking
 * None?
 * Blocked
 * "Stack overflow when Redis is down" - https://phabricator.wikimedia.org/T185055
 * Need help from Operations and/or Performance
 * Updates
 * Catching up the train this week and rolling out the last version before DevSummit/All Hands and RelEng team offsite weeks. [wiki[email]]
 * https://phabricator.wikimedia.org/T180749#3897321
 * We moved Wednesday morning’s SWAT window 1 hour earlier (to 10am) to give us an hour break before the new MW version rolls to second set of wikis (all non-wikipedias) which was a follow-up from a recent post-mortem. [wiki][email]
 * https://lists.wikimedia.org/pipermail/wikitech-l/2018-January/089404.html
 * https://phabricator.wikimedia.org/T182733
 * We broke git-fat deploy repos in scap (old config no longer valid), workaround/fix available in all relevant repos.
 * https://phabricator.wikimedia.org/T184882#3899710
 * (Yes, we’re re-doing how the CI for scap is done, see: https://phabricator.wikimedia.org/T184628 )
 * Updated the Debian packaging for Zuul (CI task scheduler) and released 2.5.0-8-gcbc7f62-wmf6, unblocking an upgrade of Gerrit.
 * https://phabricator.wikimedia.org/T158243
 * Converted our home-grown docker image builder to `docker-pkg` from Giuseppe
 * https://phabricator.wikimedia.org/T177276
 * Getting started with the basics of planning our team offsite pre Barcelona Hackathon. Submitt

Puppet SWAT

 * list of patches you want to submit to Puppet SWAT

Logspam \ Last week's train updates

 * https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Roles#Train_Conductor

Other Team Business

 * Offsite schedule:
 * https://docs.google.com/document/d/1Ku2-djYs11U1CAlkTZmdt0YUbZq82aakp-74D6Gdhv4/edit?pli=1#heading=h.5gpr30v7c4il

Q3 goal/project check-in

 * All of it in table form: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Goals/201718Q3

Program 1: Outcome 5: Milestone 1: Develop and migrate to a JavaScript-based browser testing stack

 * Due: End of this quarter
 * What: Specific improvements to the now canonical framework, see: task T182421, notably:
 * Upgrade webdriverIO to version 4.9
 * Investigate replacing nodemw with mwbot
 * Video recording for Selenium tests in Node.js
 * Task:


 * T175179 Create selenium-CirrusSearch-jessie daily Jenkins job
 * Done.

Program 1: Outcome 5: Objective 1: Maintain existing shared Continuous Integration infrastructure

 * Goals
 * Draft requirements for a Kubernetes based solution for CI -
 * Migrate MediaWiki PHPUnit tests to Shipyard (docker-based CI) (~40% of Nodepool usage) -
 * Unify production and CI docker image build process -
 * ✅ 01/15


 * Lack of progress due to Antoine oddities in personal schedule. Requires MySQL backend which is not so trivial to handle.

Program 3: Outcome 1: Objective 2: Identify and find stewards for high-priority/high use code segment orphans

 * Due: End of quarter


 * Meeting with Toby and Victoria scheduled from Feb1

Program 3: Outcome 2: Objective 2: Define and implement a process to regularly address technical debt across the Foundation

 * Due: End of quarter


 * started investigating approaches to syncing up the various tech debt activities across WMF.

Program 3: Outcome 2: Objective 3: Promote and surface important technical debt topics at large gatherings of Wikimedia developers (e.g., DevSummit and Hackathon(s))

 * Due: End of next quarter


 * no progress.

Program 6: Outcome 2: Objective 2: Set up a continuous integration and deployment pipeline

 * Due: End of this quarter
 * Keyword: SSD
 * phab project: https://phabricator.wikimedia.org/project/view/2453/
 * Goal:
 * Verify basic functionality of 'production' deployment and image (initially targeting mathoid):
 * Functional PoC within integration in the deployment-pipeline
 * Deploy to isolated k8s


 * Helm test is working with service-checker
 * Minikube packaging still in process: https://github.com/thcipriani/releng-debs-minikube-stuff

Program 1: Outcome 1: Objective 1: Scap (Tech Debt Sprint FY201718-Q2)

 * workboard


 * http://uftp-multicast.sourceforge.net/

Program 1: Outcome 6: Milestone 2: Maintain Phabricator

 * Streamline logspam workflows by adding some integration with phabricator
 * Store git-lfs (and other phab uploads) in swift:


 * Phabricator upgrade happened without complications. Security bug fixed:
 * https://phabricator.wikimedia.org/T144184
 * We now have translatewiki translations in production Phabricator and a currently manual process for updating the strings:
 * https://phabricator.wikimedia.org/T184459

Other work
*Legacy Encoding Train Blocker port mortem meeting scheduled. *Three stewardship review items have been submitted. Greg and I to review tomorrow.

Team Kanban Board Review and Triage

 * closed and touched in the 7 days
 * No update for 4 weeks
 * No update for 3 weeks
 * No update for 2 weeks
 * No update for 1 week
 * All Open
 * Review To Triage column of #releng
 * Assigned
 * Unassigned

Once / month-ish review of backlog(s)

 * releng Review To Triage column of #releng
 * releng-kanban Review unassigned in kanban
 * releng-kanban Review 'backlog' colum of -kanban
 * releng-next - Review for things we need to put on our kanban backlog
 * releng-backlog - oh my, the huge backlog of things...

Kanban stats

 * Burnup chart